ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Data preparation for municipal virtual assistant using machine learning

Leon Noe Jovan (2016) Data preparation for municipal virtual assistant using machine learning. MSc thesis.

Download (3443Kb)


    The main goal of this master’s thesis was to develop a procedure that will automate the construction of the knowledge base for a virtual assistant that answers questions about municipalities in Slovenia. The aim of the procedure is to replace or facilitate manual preparation of the virtual assistant's knowledge base. Theoretical backgrounds of different machine learning fields, such as multilabel classification, text mining and learning from weakly labeled data were examined to gain a better understanding of the topic. In this thesis, we present a procedure that finds the most relevant websites to provide answers on various questions relating to the municipality's activities. The procedure's parameters were first optimized using test data, and then the procedure was evaluated manually using data of new municipalities. In this way, we acquired real estimation of the quality of the implemented procedure. The results show that the procedure recommends more relevant answers in comparison to a commercial search engine. The developed procedure therefore effectively speeds up and simplifies data preparation for the municipal virtual assistant. In this way, we facilitate the work of municipality staff who until now had to insert answers into the municipal virtual assistant's knowledge base manually.

    Item Type: Thesis (MSc thesis)
    Keywords: municipal virtual assistant, multi-label classification, weakly labeled data, clustering, text mining
    Number of Pages: 79
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Matjaž Kukar267Mentor
    prof. dr. Matjaž Gams1109Comentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537011139)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3365
    Date Deposited: 21 Jun 2016 09:35
    Last Modified: 01 Jul 2016 13:06
    URI: http://eprints.fri.uni-lj.si/id/eprint/3365

    Actions (login required)

    View Item