Rok Gomišček (2015) Visualization and interpretation of models obtained with non-negative matrix factorization. MSc thesis.
Abstract
Attributes that describe data in the databases present themselves in large numbers. For this reason defining truly important attributes for classification and establishing their mutual dependence poses a significant challenge. One way of reducing the dimensionality of the space and defining important attributes and examples is by using non-negative matrix factorization. In this master thesis we first examined the basics of non-negative matrix factorization and a few ways of visualizing the data and factor models in matrices. We propose a few ways of presenting and understanding the models acquired with factorization. We evaluated the effectiveness of the methods on several databases and learnt that each method reveals useful information about a model. Clustering of the factorized matrices can produce purer clusters than clustering of the source data. By projecting examples to the factor space we can see which factors affect certain classes. Adding attributes to this projection makes it possible to deduce the link between the examples and the attributes of the source space.
Actions (login required)