ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Data visualization using machine learning

Gregor Leban (2007) Data visualization using machine learning. PhD thesis.

[img] PDF (Disertacija)
Download (3064Kb)


    Data visualization is a tool that has an enormous potential for extracting knowledge from data. Visualizing the right set of features in a right way can clearly identify interesting and potentially useful patterns. However, not all data projections are equally interesting and the task of a data miner is to find the most insightful ones. To help the user we developed a method called VizRank, which can automatically compute an estimate of interestingness for each of possible projections of class labeled data. We can rank projections according to this score and then focus only on a small subset of best ranked projections, that will provide the greatest insight into the data. VizRank can be applied on any visualization method that maps attribute values to the position of a shown symbol. Examples of such methods are scatterplot, radviz, polyviz and general linear projections. We also extended the concept of projection ranking to parallel coordinates method and to mosaic diagrams. To demonstrate the usefulness of the developed algorithms we present results on data sets from UCI repository and from cancer microarray data analysis.

    Item Type: Thesis (PhD thesis)
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Ivan Bratko77Mentor
    izr. prof. dr. Blaž Zupan106Comentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=6128212)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 706
    Date Deposited: 08 Dec 2008 18:52
    Last Modified: 13 Aug 2011 00:34
    URI: http://eprints.fri.uni-lj.si/id/eprint/706

    Actions (login required)

    View Item