ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Discovery and analysis of advertisements from textual data

Kristjan Pičulin (2018) Discovery and analysis of advertisements from textual data. EngD thesis.

[img]
Preview
PDF
Download (994Kb)

    Abstract

    For my thesis i made a program, that recognizes if a web article is a pai advertisement or if it is a real news article and also analized the results that were made by the program. I analized why articles are classified the way they are, why are some articles misclassified and what things affect how program is recognizing articles. I was especially interested in a way to separate news articles and advertisements. The program was made in Python programming language. I used libraries such as: pyqt, sklearn and similar. I was quite successful in making the program work the way i wanted and i also found out many interesting things about articles and advertisements.

    Item Type: Thesis (EngD thesis)
    Keywords: machine learning, paid articles, advertisement, SVM, Naive Bayes, SGD, TF-IDF.
    Number of Pages: 40
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Aleksander Sadikov934Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537963459)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 4290
    Date Deposited: 09 Oct 2018 10:12
    Last Modified: 10 Oct 2018 08:31
    URI: http://eprints.fri.uni-lj.si/id/eprint/4290

    Actions (login required)

    View Item