Bridging Music & Text with Pre-trained Models for Music Captioning and QA
07/2023 – present
Supervised by Dr Emmanouil Benetos, Centre for Digital Music, Queen Mary University of London
- Developed Music Instruct (MI) query-response dataset based on captions & well-designed prompts to GPT-4.
- Achieved cutting-edge performance in question answering on both MusicQA and Music Instruct datasets.
- Employed instruct fine-tuning techniques on MI to attain state-of-the-art (SOTA) results in captioning.