ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

System for sentiment analysis of comments about mobile applications

Luka Kacil (2016) System for sentiment analysis of comments about mobile applications. EngD thesis.

[img]
Preview
PDF
Download (2513Kb)

    Abstract

    The goal of this thesis was to build a sentiment analysis system, which can tag exuberant reviews in the Google Play store. First we gave an overview of the sentiment analysis field and analysis of input comments to better understand our problem domain. We described theoretical foundations of every method used to build our system. We started by transforming input reviews into tokens which were then normalized, negated and transformed in n-grams. After that we used stemming, spell correction, part of speech tagging and adding other attributes to generate eight different collections of features. We selected best features from every collection with χ2 method. For classification we used naive Bayes, logistic regression and support vector machine to classify reviews. After that we evaluated classifiers by using internal cross-validation and computing classification accuracy, recall, precision, F1 score and statistical tests. In the end we tested tagging reviews from our problem domain with existing solutions for sentiment analysis and compared the results. Results revealed that there were statistically significant differences between classifiers. There were also statistically significant differences between some feature collections. Results also revealed that there were statistically significant differences between existing solutions and some of our models.

    Item Type: Thesis (EngD thesis)
    Keywords: sentiment analysis, supervised machine learning, support vector machine, naive Bayes, logistic regression
    Number of Pages: 79
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Zoran Bosnić3826Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537016515 )
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3369
    Date Deposited: 21 Jun 2016 16:31
    Last Modified: 05 Jul 2016 13:14
    URI: http://eprints.fri.uni-lj.si/id/eprint/3369

    Actions (login required)

    View Item