ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Constructing survival curves from censored data with machine learning methods

Aljaž Košmerlj (2008) Constructing survival curves from censored data with machine learning methods. EngD thesis.

[img] PDF
Download (466Kb)


    In the present thesis I introduce and evaluate a new machine learning method for estimating survival functions from survival analysis data. Firstly, I describe the field of survival analysis and the problems it deals with. I introduce and define the basic terms of survival analysis, like survival function and survival curve. I also define censored data, a speciallity of survival analysis data, and explain their importance and the learning problems they cause. As a reference method I describe the Kaplan-Meier estimator, a well-known statistical method for estimating survival curves, that serves as a conceptual basis for the new proposed method. I close the introduction with a short overview of the advances of machine learning in the field of survival analysis, concluding that so far there are no well established meachine learning methods in this field. I continue with an in depth description of the proposed method and it's potential advantages. To test the new method thoroughly I start with a series of tests on artificially generated data from a physics domain. The new method proves itself useful and can match the accuracy of the Kaplan-Meier estimator. I discuss the problem of nonmonotonic survival curve estimations, that can be obtained using the proposed method. All the tests are repeated on a set of real medical data describing the prognostic value of protein markers for survival of metastatic breast cancer patients. The results further confirm the proposed method as useful. In conclusion I present the possibilities of improving the proposed method and suggest other prospects of using machine learning techniques in survival analysis.

    Item Type: Thesis (EngD thesis)
    Keywords: Survival analysis, survival curve, censored data, time to event data, Kaplan-Meier estimator, machine learning
    Number of Pages: 56
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Ivan Bratko77Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=6750292)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 271
    Date Deposited: 23 Oct 2008 12:20
    Last Modified: 13 Aug 2011 00:32
    URI: http://eprints.fri.uni-lj.si/id/eprint/271

    Actions (login required)

    View Item