ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Distributed representation based classification

Ivan Drljepan (2013) Distributed representation based classification. EngD thesis.

[img]
Preview
PDF
Download (1305Kb)

    Abstract

    Machine learning is increasingly met with datasets that require learning on a large number of learning samples. In solving these problems, some successful methods require too much time and/or space, for them to be viable. The aim of the thesis was the implementation and testing of the distributed representation based classification method of which classification speed is independent of the number of learning samples. We show that an implementation, which preserves a constant classification time, in case of high-dimensional problems requires too much space for it to be practical. By using hash tables we preserved an almost constant, fast classification for low-dimensional problems. It is made possible by a low memory consumption which is crucial for this method's classification speed. However, with low-dimensional problems, high number of learning samples causes learning saturation, which results in a drop of the classification rate. With more dimensions classification rate improves, but on account of higher memory consumption and longer classification time. Empirical evaluation has shown that, compared to the related nearest neighbors method, distributed representation based classification is faster and uses less space, while classification rates show no statistically significant differences. We determined that the method is suitable for sequential problems and that there are existing problems which are entirely unsuitable for it. Thus the method does not offer a general solution, however, under certain circumstances, it can solve problems faster, requires less space and at the same time maintain comparable classification rate.

    Item Type: Thesis (EngD thesis)
    Keywords: machine learning, classification, hash tables, big data, distributed representation
    Number of Pages: 45
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Marko Robnik Šikonja276Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=9689684)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 1981
    Date Deposited: 13 Feb 2013 08:41
    Last Modified: 04 Mar 2013 08:39
    URI: http://eprints.fri.uni-lj.si/id/eprint/1981

    Actions (login required)

    View Item