ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Modelling multivariate discrete data with latent Gaussian processes

Aleksandar Dimitriev (2017) Modelling multivariate discrete data with latent Gaussian processes. MSc thesis.

Download (560Kb)


    Multivariate count data are common in some fields, such as sports, neuroscience, and text mining. Models that can accurately perform factor analysis are required, especially for structured data, such as time-series count matrices. We present Poisson Factor Analysis using Latent Gaussian Processes, a novel method for analyzing multivariate count data. Our approach allows for non-i.i.d observations, which are linked in the latent space using a Gaussian Process. Due to an exponential non-linearity in the model, there is no closed form solution. Thus, we resort to an expectation maximization approach with a Laplace approximation for tractable inference. We present results on several data sets, both synthetic and real, of a comparison with other factor analysis methods. Our method is both qualitatively and quantitatively superior for non-i.i.d Poisson data, because the assumptions it makes are well suited for the data.

    Item Type: Thesis (MSc thesis)
    Keywords: factor analysis, Gaussian process, latent space, Poisson, count data
    Number of Pages: 44
    Language of Content: English
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Erik Štrumbelj5570Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537369283)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3790
    Date Deposited: 14 Feb 2017 08:58
    Last Modified: 06 Mar 2017 10:16
    URI: http://eprints.fri.uni-lj.si/id/eprint/3790

    Actions (login required)

    View Item