ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

A textual analytics based writting aid

Gašper Mlakar (2012) A textual analytics based writting aid. EngD thesis.

[img]
Preview
PDF
Download (2437Kb)

    Abstract

    We examined the most common techniques for natural language processing and text mining. We built an application with writing aids on the basis of the text analysis. We embedded tools for the classification of texts based on the content and writing style. We used Naive Bayes classifier and its improved version AODE. We compared texts with cosine similarity of the vector text representation and used the similarity for clustering. We used a few reference text corpora and built a simple tool for their viewing and editing. We built a few utilities based on dictionaries and synonyms, e.g; a simple spelling checker.

    Item Type: Thesis (EngD thesis)
    Keywords: natural language processing, text mining, classification, Naive Bayes classifier, AODE, cosine similarity, clustering, text corpus
    Number of Pages: 60
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Marko Robnik Šikonja276Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=00009487444)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 1823
    Date Deposited: 19 Sep 2012 17:24
    Last Modified: 05 Nov 2012 14:30
    URI: http://eprints.fri.uni-lj.si/id/eprint/1823

    Actions (login required)

    View Item