ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Analysis of catastrophic forgetting during incremental learning of classificational convolutional neural network of classificational convolutional neural network

Jakob Božič (2019) Analysis of catastrophic forgetting during incremental learning of classificational convolutional neural network of classificational convolutional neural network. EngD thesis.

[img]
Preview
PDF
Download (4Mb)

    Abstract

    Catastrophic forgetting is phenomenon when an artificial neural network immediately and almost completely forgets previously learned tasks when trained incrementally on new ones. It is a well-known problem and although there are many approaches to alleviating it, none of them solves it completely. We experimentally check for main causes of catastrophic forgetting. Analysis is performed on a deep convolutional neural network for image classification. Results are interpreted by confusion matrices and classification accuracy graphs, we also visualize changes of weights and biases of network. Analytical findings serve as a basis for designing different approaches to updating network parameters, aiming to prevent or alleviate catastrophic forgetting. We also evaluate effects of availability of Oracle, capable of determining subset of all possible classes for classification, when using the network. We implement one of existing approaches to preventing catastrophic forgetting and adapt it to work without Oracle. Findings, presented in thesis serve as a starting point for design of new approaches aimed at preventing catastrophic forgetting.

    Item Type: Thesis (EngD thesis)
    Keywords: catastrophic forgetting, incremental learning, convolutional neural networks, classification
    Number of Pages: 69
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Danijel Skočaj296Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1538240963)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 4433
    Date Deposited: 22 May 2019 14:54
    Last Modified: 07 Jun 2019 11:11
    URI: http://eprints.fri.uni-lj.si/id/eprint/4433

    Actions (login required)

    View Item