Publications
Academic Publications
-
Qiuqiang Kong, Bochen Li, Jitong Chen, and Yuxuan Wang,
GiantMIDI-Piano: a large-scale MIDI dataset for classical piano music,
Transactions of the International Society for Music Information Retrieval, 5(1), pp.87-98, 2022.
DOI: 10.5334/tismir.80 .
-
Bochen Li, Yuxuan Wang, and Zhiyao Duan,
Audiovisual singing voice separation,
Transactions of the International Society for Music Information Retrieval, 4(1), pp.195–209, 2021.
DOI: 10.5334/tismir.108
<pdf>
<project>
-
Qiuqiang Kong, Bochen Li, Xuchen Song, Yuan Wan, and Yuxuan Wang,
High-resolution piano transcription with pedals by regressing onset and offset times,
IEEE/ACM Transactions on Audio, Speech, and Language Processing. vol. 29, pp. 3707-3717, 2021.
DOI: 10.1109/TASLP.2021.3121991
-
Bochen Li, Karthik Dinesh, Chenliang Xu, Gaurav Sharma, and Zhiyao Duan,
Online Audio-Visual Source Association for Chamber Music Performances,
Transactions of the International Society for Music Information Retrieval (TISMIR), vol. 2, no. 2, pp. 29-42, 2019.
DOI: 10.5334/tismir.25
-
Bochen Li and Aparna Kumar, Query by video: cross-modal music retrieval, In Proc. International Society for Music Information Retrieval (ISMIR), 2019.
-
Bochen Li*, Xinzhao Liu*, Karthik Dinesh, Zhiyao Duan, and Gaurav Sharma, Creating a multi-track classical music performance dataset for multi-modal music analysis: challenges, insights, and applications, IEEE Transactions on Multimedia, vol. 21, no. 2, pp. 522-535, 2019. (* equal contribution)
<pdf>
<project>
-
Yapeng Tian, Jing Shi, Bochen Li, Zhiyao Duan, and Chenliang Xu, Audio-visual event localization in unconstrained videos, in Proc. European Conference on Computer Vision (ECCV), 2018.
<pdf>
-
Bochen Li, Akira Maezawa, and Zhiyao Duan, Skeleton plays piano: online generation of pianist body movements from MIDI performance, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2018.
<pdf>
<demo>
-
Bochen Li and Akira Maezawa, MIDI2Pose: Online keyboard performance motion generation from performance data, in Proc. Information Processing Society of Japan, 2018.
<link>
-
Xueyang Wang, Ryan Stables, Bochen Li, and Zhiyao Duan, Score-aligned polyphonic microtiming estimation, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018.
<pdf>
<poster>
-
Bochen Li, Karthik Dinesh, Gaurav Sharma, and Zhiyao Duan, Video-based vibrato detection and analysis for polyphonic string music, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2017, 123-130. (best paper nomination)
<pdf>
<slides>
-
Bochen Li, Chenliang Xu, and Zhiyao Duan, Audio-visual source association for string ensembles through multi-modal vibrato analysis, in Proc. The 14th Sound and Computing Conference (SMC), 2017, pp. 159-166. (best paper award)
<pdf>
<slides>
-
Bochen Li, Karthik Dinesh, Zhiyao Duan and Gaurav Sharma, See and listen: score-informed association of sound tracks to players in chamber music performance videos, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, pp. 2906-2910.
<pdf>
<slides>
-
Karthik Dinesh*, Bochen Li*, Xinzhao Liu, Zhiyao Duan and Gaurav Sharma, Visually informed multi-pitch analysis of string ensembles, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, pp. 3021-3025. (* equal contribution)
<pdf>
<slides>
-
Bochen Li and Zhiyao Duan, An approach to score following for piano performances with the sustained effect, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 12, pp. 2425-2438, 2016.
<pdf>
<project>
-
Bochen Li and Zhiyao Duan, Score following for piano performances with sustain-pedal effects, in Proc. International Society for Music Information Retrieval Conference (ISMIR), 2015, pp. 469-475.
<pdf>
<poster>
-
Li, N., Wang, R., Deng, Y., Liu, Y., Li, B., Wang, C., and Balz, T. Unsupervised polarimetric synthetic aperture radar classification of large-scale landslides caused by Wenchuan earthquake in hue-saturation-intensity color space. Journal of Applied Remote Sensing, vol. 8, no. 1, 2014.
-
Li, N., Wang, R., Deng, Y., Liu, Y., Wang, C., Balz, T., and Li, B. Polarimetric Response of Landslides at X-Band Following the Wenchuan Earthquake. IEEE Geoscience Remote Sensing Letter., vol. 11, no. 10, pp. 1722-1726, 2014.
Patents
-
Bochen Li, Vibert Thio, Haonan Chen, Xuefan Hu, and Jitong Chen,
Approach to automatic music remix based on style templates,
Publication of US20230360619A1,
November 2023.
-
Vibert Thio, Bochen Li, Haonan Chen, and Jitong Chen,
Automatic and interactive mashup system,
Publication of US20230360618A1,
November 2023.
-
Bochen Li, Andrew Shaw, and Jitong Chen,
Converting audio samples to full song arrangements,
Publication of WO2023214937A1,
November 2023.
-
Yufan Xue, Qiang Zheng, Dong Niu, Liangqin Xu, Xiaochan Wang, Jitong Chen, Bochen Li, and Naihan Li,
Music generation method, apparatus and system, and storage medium,
Publication of WO2023211386A2,
September 2023.
-
Bochen Li, Rodrigo Castellon, Daiyu Zhang, and Jitong Chen,
Beatboxing transcription,
Publication of US20230282188A1,
September 2023.
-
Zhihao Ouyang, Daiyu Zhang, Bochen Li, Baoman Liu, and Liuqing Yang,
Automatic and fast generation of music audio content for videos,
Publication of US11763849B1,
September 2023.
-
Bochen Li, Daiyu Zhang, Shawn Chan, and Jitong Chen,
Interactive movement audio engine,
Publication of US20230197040A1,
June 2023.
-
Shuai Yuan, Bochen Li, Qiuhong Xu, Na Zhao, Zhengyi Fang, Peidao Li, and Shengli Wang,
Method and device for determining audio frequency, electronic equipment and storage medium,
Publication of CN115831080A,
March 2023.
-
Chenyu Sun, Jitong Chen, Nathanael Schager, Maryyann Crichton, Josiah John Serrano, Bochen Li, Xuefan Hu, Fraser Smith, Huangui Jin, David Trevelyan, Suiyu Feng, Brandon Wu, and Tao Xiong,
Special effect processing method and apparatus,
Publication of WO2022169418A1,
August 2022.
-
Bochen Li and Aparna Kumar,
Systems, methods & computer program products for associating media content having different modalities,
Publication of US20200394213A1,
December 2020.
-
Akira Maezawa and Bochen Li,
Information processing method,
Publication of US20200365126A1,
November 2020.