ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

String similarity measures

Mojca Lorber (2016) String similarity measures. EngD thesis.

Download (1026Kb)


    The thesis String similarity measures examines string matching problem, where we are interested in matchings allowing errors. Such problem is also called approximate string matching problem, and its essential part is the definition of error model and by this the type of a similarity or dissimilarity measure. In the beginning of the thesis we present a general overview of measures, then we further focus on the group of measures based on the edit operations on strings. The definition of such distance between strings is established with the cost of operations that are needed for an optimal transformation from one string to another. Further on, we describe a few algorithms based on dynamic programming, and then we add a couple of upgraded versions. With a help of an example we try to demonstrate their performance and analyse their computational complexity.

    Item Type: Thesis (EngD thesis)
    Keywords: similarity, dissimilarity, similarity measure, string matching, string alignment, edit distance, longest common subsequence
    Number of Pages: 91
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    doc. dr. Jurij Mihelič305Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536792259 )
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3250
    Date Deposited: 15 Feb 2016 15:26
    Last Modified: 25 Feb 2016 13:07
    URI: http://eprints.fri.uni-lj.si/id/eprint/3250

    Actions (login required)

    View Item