Bridging Music & Text with Pre-trained Models for Music Captioning and QA

07/2023 – present

Supervised by Dr Emmanouil Benetos, Centre for Digital Music, Queen Mary University of London

  • Developed Music Instruct (MI) query-response dataset based on captions & well-designed prompts to GPT-4.
  • Achieved cutting-edge performance in question answering on both MusicQA and Music Instruct datasets.
  • Employed instruct fine-tuning techniques on MI to attain state-of-the-art (SOTA) results in captioning.
Avatar
马英浩 (Nicolaus) MA Yinghao
PhD Student in AI & Music

MA Yinghao, PhD student in C4DM, QMUL. Research interests include music information retireval, self-supervised learning, music-related multimodal machine learning, and audio signal processing and matter.

Related