ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Using deep neural networks for differentiating automatically generated from manually written articles

Amon Stopinšek (2019) Using deep neural networks for differentiating automatically generated from manually written articles. EngD thesis.

[img]
Preview
PDF
Download (1491Kb)

    Abstract

    This thesis deals with the text classification on the problem of classifying manually written and automatically generated articles. We tested various convolutional and recurrent deep neural network architectures and various text representations. Models were tested on a dataset of manually written and automatically generated articles about weight loss. Best results were achieved with a model using the BLSTM architecture and word2vec word embeddings. With this model, we achieved 96,71% classification accuracy on the test dataset of manually written articles, 100% classification accuracy on articles generated with the bad model and 97,41% classification accuracy on articles generated with the good model.

    Item Type: Thesis (EngD thesis)
    Keywords: artifical inteligence, deep neural networks, text classification, natural language processing
    Number of Pages: 61
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Igor Kononenko237Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1538116803)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 4341
    Date Deposited: 28 Jan 2019 12:34
    Last Modified: 07 Feb 2019 12:58
    URI: http://eprints.fri.uni-lj.si/id/eprint/4341

    Actions (login required)

    View Item