String similarity measures

Mojca Lorber (2016) String similarity measures. EngD thesis.

Preview

Abstract

The thesis String similarity measures examines string matching problem, where we are interested in matchings allowing errors. Such problem is also called approximate string matching problem, and its essential part is the definition of error model and by this the type of a similarity or dissimilarity measure. In the beginning of the thesis we present a general overview of measures, then we further focus on the group of measures based on the edit operations on strings. The definition of such distance between strings is established with the cost of operations that are needed for an optimal transformation from one string to another. Further on, we describe a few algorithms based on dynamic programming, and then we add a couple of upgraded versions. With a help of an example we try to demonstrate their performance and analyse their computational complexity.

Item Type:

Thesis (EngD thesis)

Keywords:

similarity, dissimilarity, similarity measure, string matching, string alignment, edit distance, longest common subsequence

Number of Pages:

Language of Content:

Slovenian

Mentor / Comentors:

Name and Surname	ID	Function
doc. dr. Jurij Mihelič	305	Mentor

Link to COBISS:

http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1536792259 )

Institution:

University of Ljubljana

Department:

Faculty of Computer and Information Science

Item ID:

3250

Date Deposited:

15 Feb 2016 15:26

Last Modified:

25 Feb 2016 13:07

URI:

http://eprints.fri.uni-lj.si/id/eprint/3250

Actions (login required)

View Item