ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Contextual matching and search using web-based bulletin board model

Vasja Laharnar (2013) Contextual matching and search using web-based bulletin board model. EngD thesis.

[img]
Preview
PDF
Download (12Mb)

    Abstract

    Bulletin boards are, for many years, a well-known and established way of providing information to different groups of people. We tried to present an improved internet-based form of a bulletin board where we had to first inform ourselves about the basic natural language processing tasks. Among other things, we performed tokenization of the published content, lemmatization of the obtained tokens and also built a structure of semantically similar words in a non-relational database. We also classified the texts using a naive Bayesian classifier, thus allowing the contextual matching of the posts. We successfully tested the implemented search and match systems on an internship problem domain in the shape of a web service based on the supplied content from educational institutions as well as companies and organizations.

    Item Type: Thesis (EngD thesis)
    Keywords: natural language processing, semantic similarity, lemmatization, classification, Naive Bayes classifier, information retrieval, search, matching
    Number of Pages: 60
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Marko Bajec245Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=9758804)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 2014
    Date Deposited: 27 Mar 2013 14:30
    Last Modified: 08 Apr 2013 14:31
    URI: http://eprints.fri.uni-lj.si/id/eprint/2014

    Actions (login required)

    View Item