ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Extraction and processing data from the web

Marko Balažic (2015) Extraction and processing data from the web. EngD thesis.

Download (1776Kb)


    Web users are searching information on the internet on daily basis. Easiest way to acquire data from the internet is by using search engines, that provide us with many different results. Despite the fact these results are being well chosen for us, there is still a great deal of filtering involved when looking for vital information. In my thesis I have also dealt with solving this problem myself. I wrote a web application in which I merged the information from several domains. The application’s dashboard enables the user a complete overview of snippets from various websites. The user can add and edit the snippets themselves. One can set up alarms through which the system informs the user of changes that have occurred. I implemented the system with the help of web crawlers that scrape data and saves it to a database.

    Item Type: Thesis (EngD thesis)
    Keywords: spider, scraping, application, servis, extension, notification
    Number of Pages: 53
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Marko Bajec245Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536570819 )
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3114
    Date Deposited: 15 Sep 2015 17:18
    Last Modified: 16 Oct 2015 13:01
    URI: http://eprints.fri.uni-lj.si/id/eprint/3114

    Actions (login required)

    View Item