Bridging Music & Text with Pre-trained Models for Music Captioning and QA

Sep 29, 2023

07/2023 – present

Supervised by Dr Emmanouil Benetos, Centre for Digital Music, Queen Mary University of London

Developed Music Instruct (MI) query-response dataset based on captions & well-designed prompts to GPT-4.
Achieved cutting-edge performance in question answering on both MusicQA and Music Instruct datasets.
Employed instruct fine-tuning techniques on MI to attain state-of-the-art (SOTA) results in captioning.

MA Yinghao, PhD student in C4DM, QMUL. Research interests include music information retireval, self-supervised learning, music-related multimodal machine learning, and audio signal processing and matter.

Bridging Music & Text with Pre-trained Models for Music Captioning and QA

马英浩 (Nicolaus) MA Yinghao

PhD Student in AI & Music

Related

Bridging Music & Text with Pre-trained Models for Music Captioning and QA

马英浩 (Nicolaus) MA Yinghao

PhD Student in AI & Music

Related

Publications

MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response