Using deep neural networks for differentiating automatically generated from manually written articles

Amon Stopinšek (2019) Using deep neural networks for differentiating automatically generated from manually written articles. EngD thesis.

Preview

PDF
Download (1491Kb)

Abstract

This thesis deals with the text classification on the problem of classifying manually written and automatically generated articles. We tested various convolutional and recurrent deep neural network architectures and various text representations. Models were tested on a dataset of manually written and automatically generated articles about weight loss. Best results were achieved with a model using the BLSTM architecture and word2vec word embeddings. With this model, we achieved 96,71% classification accuracy on the test dataset of manually written articles, 100% classification accuracy on articles generated with the bad model and 97,41% classification accuracy on articles generated with the good model.

Item Type:

Thesis (EngD thesis)

Keywords:

artifical inteligence, deep neural networks, text classification, natural language processing

Number of Pages:

Language of Content:

Slovenian

Mentor / Comentors:

Name and Surname	ID	Function
prof. dr. Igor Kononenko	237	Mentor

Link to COBISS:

http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1538116803)

Institution:

University of Ljubljana

Department:

Faculty of Computer and Information Science

Item ID:

4341

Date Deposited:

28 Jan 2019 12:34

Last Modified:

07 Feb 2019 12:58

URI:

http://eprints.fri.uni-lj.si/id/eprint/4341

Actions (login required)

View Item