ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Forecasting backup storage consumption

Blaž Koncilja (2016) Forecasting backup storage consumption. EngD thesis.

[img] PDF
Download (491Kb)


    Storage needs for archiving data are increasing. Companies need to store more and more data to function normally. Storing this data can be costly, that is why we want to provide sufficient storage capacity to meet the demands and not exceed them which brings additional costs. With the help of data mining we are trying to forecast trends in storage consumption. We acquired data from two environments for archiving and saved them to a database. We analysed data consumption trends with linear regression, piecewise linear regression and k-nearest neighbours. Piecewise linear regression proved to be the most accurate and reliable. Even though results are good enough to be implemented into production, we should be cautious as the two environments have different characteristics and this influences the forecasting.

    Item Type: Thesis (EngD thesis)
    Keywords: data mining, procedure CRISP-DM, piecewise linear regression, data preparation
    Number of Pages: 41
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    izr. prof. dr. Marko Robnik Šikonja276Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537116867)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3509
    Date Deposited: 06 Sep 2016 14:51
    Last Modified: 19 Sep 2016 11:23
    URI: http://eprints.fri.uni-lj.si/id/eprint/3509

    Actions (login required)

    View Item