ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Data deduplication in a data store

Dejan Kovač (2014) Data deduplication in a data store. EngD thesis.

Download (371Kb)


    In our thesis we presented the area of data deduplication and implemented an algorithm for object storage with support for elimination of duplicate chunks within those objects. In the first part we presented storage system as a tree-likestructure of directories and files. We described the features of storage system and simple ways of storing data on a medium. We examined in detail the properties of distributed storage system Ceph, it's components and operation. In the second part we presented deduplicatin as an important feature of modern storage systems. We surveyed deduplication techniques for centralized as well as distributed systems. In the last part we implemented an example of deduplication technique along with a simple object storage system. Using the described techniques we implemented detection of variable-length duplicated chunks within objects and added CLI tools for manipulating objects in the store.

    Item Type: Thesis (EngD thesis)
    Keywords: deduplication, filesystem, data store
    Number of Pages: 45
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Andrej Brodnik5540Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536186307)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 2845
    Date Deposited: 29 Oct 2014 16:50
    Last Modified: 05 Feb 2015 10:23
    URI: http://eprints.fri.uni-lj.si/id/eprint/2845

    Actions (login required)

    View Item