ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Modeling chemical reactions of RNA-binding proteins with machine learning

Jernej Henigman (2015) Modeling chemical reactions of RNA-binding proteins with machine learning. EngD thesis.

[img]
Preview
PDF
Download (1148Kb)

    Abstract

    In this thesis machine learning methods are used to classify chemical reactions. At the same time the most important changes in molecular structure are identified that are typical for chemical reactions of RNA-binding proteins. In the first part, six basic groups of chemical reactions were used to determine the optimal set of parameters for modeling and prediction. Three groups of parameter sets were tested: methods for balancing the learning set (seven methods), methods for molecular fingerprinting (seven methods) and predictive models (five methods). Empirically is shown that the best combination consists of the following parameters: random undersampling as balancing method, Morgan+MorganBitVector for molecular fingerprinting and random forest as predictive model, with which average AUC 0.97 was achieved. For the second part, the optimal set of parameters is used to discriminate between chemical reactions associated with RNA-binding proteins and those chemical reactions associated with non RNA-binding proteins. AUC score 0.77 was achieved.

    Item Type: Thesis (EngD thesis)
    Keywords: machine learning, chemical reactions, balancing methods, molecular fingerprints, predictive models, AUC, RNA, molecular structure
    Number of Pages: 56
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Tomaž Curk299Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536484803)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3021
    Date Deposited: 18 Aug 2015 16:48
    Last Modified: 18 Sep 2015 12:18
    URI: http://eprints.fri.uni-lj.si/id/eprint/3021

    Actions (login required)

    View Item