ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Visualization of baby names combined with IMDb and Wikipedia data

Aleks Kobentar (2019) Visualization of baby names combined with IMDb and Wikipedia data. EngD thesis.

Download (1600Kb)


    This Bachelor’s Thesis describes the process of collecting and visualizing data from different sources. There are three different data sources. The first source is from the Statistical office of Slovenia where there is data about the number of baby names occurring from 1992 to 2017. The second sourse is the IMDb database, which has data about actors and movies. The third data source is the free Wikipedia encyclopedia , which holds interesting data about names. To be able to merge all the datasources requires a great range of frameworks. For importing the data, the programming language Python is used. For data storage about the number of babynames, the unrelation database Elasticsearch is used. For the exchange of data which is stored on the inter- net or on local machine servers, either Python or Node.js. are implemented. In addition, the basic web technologies JavaScript and D3.js are the main tools for data visualization.

    Item Type: Thesis (EngD thesis)
    Keywords: Web server, database, source of data, visualization, baby names.
    Number of Pages: 41
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    viš. pred. dr. Alenka Kavčič264Mentor
    as. dr. Matevž PesekComentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1538207171)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 4421
    Date Deposited: 29 Mar 2019 14:43
    Last Modified: 17 Apr 2019 11:08
    URI: http://eprints.fri.uni-lj.si/id/eprint/4421

    Actions (login required)

    View Item