Marinka Zitnik and Blaz Zupan (2014) Matrix factorization-based data fusion for gene function prediction in baker's yeast and slime mold. In: PSB, Jan 2014, The Big Island of Hawaii.
Abstract
The development of effective methods for the characterization of gene functions that are able to combine diverse data sources in a sound and easily-extendible way is an important goal in computational biology. We have previously developed a general matrix factorization-based data fusion approach for gene function prediction. In this manuscript, we show that this data fusion approach can be applied to gene function prediction and that it can fuse various heterogeneous data sources, such as gene expression profiles, known protein annotations, interaction and literature data. The fusion is achieved by simultaneous matrix tri-factorization that shares matrix factors between sources. We demonstrate the effectiveness of the approach by evaluating its performance on predicting ontological annotations in slime mold D. discoideum and on recognizing proteins of baker's yeast S. cerevisiae that participate in the ribosome or are located in the cell membrane. Our approach achieves predictive performance comparable to that of the state-of-the-art kernel-based data fusion, but requires fewer data preprocessing steps.
Item Type: | Conference or Workshop Item (Paper) |
Keywords: | gene function prediction, data fusion, matrix factorization, Gene Ontology annotation, membrane protein, ribosomal protein |
Related URLs: | |
Institution: | University of Ljubljana |
Department: | Faculty of Computer and Information Science |
Divisions: | Faculty of Computer and Information Science > Bioinformatics Laboratory |
Item ID: | 2334 |
Date Deposited: | 30 Jan 2014 17:35 |
Last Modified: | 26 Mar 2014 13:42 |
URI: | http://eprints.fri.uni-lj.si/id/eprint/2334 |
---|
Actions (login required)