ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Python data mining environments

Aleš Mrak (2012) Python data mining environments. EngD thesis.

Download (1432Kb)


    In the thesis we compare the systems for data mining that have an interface in the programming language Python. Many open-source systems for data mining and library had implemented their software interfaces to the Python programming language. They choose Python because it is fast and provides object-oriented programming, allows for the integration of other software libraries in Python and is implemented in all major operating systems (Windows, Linux / Unix, OS / 2, Mac, etc..). Our analysis systems for data mining covers seven most used systems (Elefant, MDP, OpenCVLibrary, Orange, Pybrain, Pyml and Shogun). The analysis covered the following properties of the systems data formats, application programming interface (GUI and API), multitasking, support for databases, response times and other aspects such as installation, documentation and support for the users. From this analysis, we also find out what are the common shortcomings of the analyzed libraries and we give some recommendations to developers.

    Item Type: Thesis (EngD thesis)
    Keywords: Python, data mining, Elefant, MDP, OpenCV, Orange, PyBrain, PyML, Shogun, Machine learning.
    Number of Pages: 96
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Blaž Zupan106Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=00009400660)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 1819
    Date Deposited: 19 Sep 2012 16:14
    Last Modified: 26 Sep 2012 12:01
    URI: http://eprints.fri.uni-lj.si/id/eprint/1819

    Actions (login required)

    View Item