Aron Birsa (2017) Search and classification of web shops. EngD thesis.
Abstract
The aim of the thesis was to develop a tool for automatic classification of online stores depending on the type of products they offer. Websites are classified into seven predefined categories: antiques and collectibles, cloth- ing, consumer electronics, furniture, home and garden, jewelry and office products. The main problem was getting relevant data to build a learning and test data set and classifying web sites. The following machine learning methods were used: naive Bayesian classifier, k-nearest neighbors algorithm, random forests, neural networks and support vector machine. The most promising result were obtained using the support vector machine classifier.
Item Type: | Thesis (EngD thesis) |
Keywords: | specialized search engine, data mining, machine learning, e- commerce, text analysis, naive Bayesian classifier, k-nearest neighbors algo- rithm, random forests, neural networks, support vector machine. |
Number of Pages: | 24 |
Language of Content: | Slovenian |
Mentor / Comentors: | Name and Surname | ID | Function |
---|
izr. prof. dr. Marko Robnik Šikonja | 276 | Mentor |
|
Link to COBISS: | http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537344451) |
Institution: | University of Ljubljana |
Department: | Faculty of Computer and Information Science |
Item ID: | 3757 |
Date Deposited: | 20 Jan 2017 13:28 |
Last Modified: | 02 Feb 2017 10:27 |
URI: | http://eprints.fri.uni-lj.si/id/eprint/3757 |
---|
Actions (login required)