ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

VizRank: Data Visualization Guided by Machine Learning

Gregor Leban and Blaz Zupan and Gaj Vidmar and Ivan Bratko (2006) VizRank: Data Visualization Guided by Machine Learning. Data Mining and Knowledge Discovery, 13 (2). pp. 119-136.

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
382Kb

Abstract

Data visualization plays a crucial role in identifying interesting patterns in exploratory data analysis. Its use is, however, made difficult by the large number of possible data projections showing different attribute subsets that must be evaluated by the data analyst. In this paper, we introduce a method called VizRank, which is applied on classified data to automatically select the most useful data projections. VizRank can be used with any visualization method that maps attribute values to points in a two-dimensional visualization space. It assesses possible data projections and ranks them by their ability to visually discriminate between classes. The quality of class separation is estimated by computing the predictive accuracy of k-nearest neighbor classifier on the data set consisting of x and y positions of the projected data points and their class information. The paper introduces the method and presents experimental results which show that VizRank's ranking of projections highly agrees with subjective rankings by data analysts. The practical use of VizRank is also demonstrated by an application in the field of functional genomics.

Item Type:Article
Keywords:Data Visualization, Data Mining, Visual Data Mining, Machine Learning, Exploratory Data Analysis
Language of Content:English
Institution:University of Ljubljana
Department:Faculty of Computer and Information Science
Divisions:Faculty of Computer and Information Science > Artificial Intelligence Laboratory
ID Code:210
Deposited On:25 May 2006
Last Modified:29 Oct 2009 11:27

Repository Staff Only: item control page