ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Simplification of ETL processes using Talend Platform

Tomaž Čufer (2015) Simplification of ETL processes using Talend Platform. EngD thesis.

[img]
Preview
PDF
Download (1931Kb)

    Abstract

    The ETL process presents a broad concept of extracting, transforming and loading data. Each of these phases needs to be well defined to transfer the data efficiently to a different location or transform it into the demanded form. Unstructured forms of data along with its huge volume, which is common nowadays, makes this process even more difficult, and is reflected in the longer execution time. With a suitable ETL tool it is possible to simplify the implementation process and assure better control over it. The thesis describes how to complete such simplifications using an appropriate tool in practice. Two commercial and open source tools were compared. Talend tool was chosen and its workflow was later presented in detail. Handling management and integration problems of data is described, where the used data came from web scraping and the Twitter social network. At the end, a SWOT analysis was made for Talend tool.

    Item Type: Thesis (EngD thesis)
    Keywords: job, process, data integration, Talend, tool, data warehouse
    Number of Pages: 42
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Dejan Lavbič302Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536264131)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 2961
    Date Deposited: 19 Mar 2015 15:41
    Last Modified: 09 Apr 2015 14:18
    URI: http://eprints.fri.uni-lj.si/id/eprint/2961

    Actions (login required)

    View Item