ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Implementation of a Vamp plugin for segmentation of audio recordings

Timotej Fartek (2018) Implementation of a Vamp plugin for segmentation of audio recordings. EngD thesis.

[img]
Preview
PDF
Download (3125Kb)

    Abstract

    Digitalization is very important for audio data archives as it increases the lifespan and persistence of stored data. In the process multiple options for semantic analysis emerge. This thesis is about segmentation of audio data, specifically the separation between speech and music in audio files which can be useful for instance for radio stations or streaming services such as Spotify and Netflix. Within the scope of this thesis a working segmentation algorithm, which takes a frequency-domain (meaning it is transformed using a discrete fourier transform) input and returns a list of features with their appropriate time stamps and probablities that the input signal at that specific time belongs to the class music, was developed. It is implemented as a Vamp plugin and with the help of Vampy, a wrapper plugin, it is programmed in Python. Performance of the developed plugin was also analysed and compared to other pre-existing implementations in Matlab and C#.

    Item Type: Thesis (EngD thesis)
    Keywords: digital audio processing, digital signal processing, Vamp, Vampy, segmentation, Sonic Visualiser
    Number of Pages: 46
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Matija Marolt271Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537728707)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 4074
    Date Deposited: 19 Feb 2018 15:13
    Last Modified: 07 Mar 2018 10:34
    URI: http://eprints.fri.uni-lj.si/id/eprint/4074

    Actions (login required)

    View Item