ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Search and classification of web shops

Aron Birsa (2017) Search and classification of web shops. EngD thesis.

[img]
Preview
PDF
Download (737Kb)

    Abstract

    The aim of the thesis was to develop a tool for automatic classification of online stores depending on the type of products they offer. Websites are classified into seven predefined categories: antiques and collectibles, cloth- ing, consumer electronics, furniture, home and garden, jewelry and office products. The main problem was getting relevant data to build a learning and test data set and classifying web sites. The following machine learning methods were used: naive Bayesian classifier, k-nearest neighbors algorithm, random forests, neural networks and support vector machine. The most promising result were obtained using the support vector machine classifier.

    Item Type: Thesis (EngD thesis)
    Keywords: specialized search engine, data mining, machine learning, e- commerce, text analysis, naive Bayesian classifier, k-nearest neighbors algo- rithm, random forests, neural networks, support vector machine.
    Number of Pages: 24
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Marko Robnik Šikonja276Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537344451)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3757
    Date Deposited: 20 Jan 2017 13:28
    Last Modified: 02 Feb 2017 10:27
    URI: http://eprints.fri.uni-lj.si/id/eprint/3757

    Actions (login required)

    View Item