Research & implementation of Chinese flute playing technique recognition based on machine learning

  • Feb. 2020 – May 2020. Beijing, CHN.
  • One of the graduation theses that awarded the outstanding paper honor of School of Mathematical Science, Peking University.
  • Research Assistant for prof. CHEN Xiaoou in Wangxuan Institute of Computer Technology at Peking University.

Music object recognition and recording is the essential component of music information retrieval. Different from other fields of melody extraction and music transcription, the research on musical instrument technique detection is still in the early stage. The existing work mainly focuses on technique detection of individual notes or frame, and there is a lack of effective data set.

This article constructed audio dataset(extraction code “jvbk” if needed) on ten kinds of techniques of transverse Chinese bamboo flute (Di), with 101 minutes of audio, using Mel frequency spectrum as audio feature, and put forward a model based on the fully convolutional neural network (FCNN). This model is an end-to-end sound event detector for variable length of the input and can be used in the detection of instruments technology.

Compared to the model based on VGG13, VGG16 and LeNet-5 baseline to evaluate the effectiveness of the proposed framework, this model got the average accuracy 94%, much better than all the others, on input of the different length of audio, which suggest good generalization ability.

Key Words: sound event detection, music information retrieval, fully convolutional neural network, musical instrumet technique detection

Avatar
马英浩 (Nicolaus) MA Yinghao
PhD Student in AI & Music

MA Yinghao, PhD student in C4DM, QMUL. Research interests include music information retireval, self-supervised learning, music-related multimodal machine learning, and audio signal processing and matter.

Related