Jan Jug (2015) Transcription of piano music with deep learning. EngD thesis.
Abstract
Transcription of music is a complex process of transcribing an audio recording into a symbolic notation. The goal of this thesis was to examine transcription of piano music with deep learning, for which three models of deep neural networks were implemented: multilayer perceptron, convolutional neural network and deep belief network. Through the use of deep belief network, unsupervised pretraining for automatic extraction of musical features from audio signals was also tested. Learning of these models and evaluation of transcription was performed with MAPS database for piano music transcription. A comparison between Fast Fourier Transform and Constant Q Transform for data pre-processing was also carried out. Final results show that deep learning with an appropriate learning schedule is potentially a powerful tool for automatic transcription of music.
Item Type: | Thesis (EngD thesis) |
Keywords: | automatic music transcription, deep neural networks, piano music, deep learning, multilayer perceptron, convolutional neural network, deep belief network, Fast Fourier Transform, Constant Q Transform |
Number of Pages: | 43 |
Language of Content: | Slovenian |
Mentor / Comentors: | Name and Surname | ID | Function |
---|
doc. dr. Matija Marolt | 271 | Mentor |
|
Link to COBISS: | http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536477635) |
Institution: | University of Ljubljana |
Department: | Faculty of Computer and Information Science |
Item ID: | 3044 |
Date Deposited: | 04 Sep 2015 16:17 |
Last Modified: | 17 Sep 2015 12:52 |
URI: | http://eprints.fri.uni-lj.si/id/eprint/3044 |
---|
Actions (login required)