Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Adaptive Similarity of XML Data
Thesis title in Czech: Adaptive Similarity of XML Data
Thesis title in English: Adaptive Similarity of XML Data
Key words: rozhodovací strom, podobnost schémat, PIM schéma, PSM schéma, XML Schema
English key words: decision tree, schema similarity, PIM schema, PSM schema, XML Schema
Academic year of topic announcement: 2010/2011
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: doc. RNDr. Irena Holubová, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 09.11.2010
Date of assignment: 09.11.2010
Date and time of defence: 27.01.2014 10:00
Date of electronic submission:04.12.2013
Date of submission of printed version:06.12.2013
Date of proceeded defence: 27.01.2014
Opponents: RNDr. Martin Svoboda, Ph.D.
 
 
 
Guidelines
Exploitation of similarity of XML data is currently a typical optimization strategy for many related areas of their processing. However, most of the approaches suffer from the same problem caused by the fact that different similarity evaluations are suitable for different types of data. The current strategies are either fixed, or their calibration for particular type of data has to be done manually which is not an easy task.
The aim of this work is a research on various aspects of (semi-)automatic adaptive evaluation of similarity of XML data. Firstly, it is necessary to analyze existing solutions in the area of both XML and non-XML data and to discuss their advantages and disadvantages. The core of the work is a proposal and implementation of own method for similarity evaluation strategy focusing on the found disadvantages and shortcomings. The work will include suitable experimental results.
References
Mlynkova, I. - Necasky, M. - Pokorny, J. - Richta, K. - Toman, K. - Toman, V.: Technologie XML - Principy a aplikace v praxi. Grada Publishing, Prague, Czech Republic, zari 2008. ISBN 978-80-247-2725-7.

W3C. W3C Technical Reports and Publications. http://www.w3.org/TR/

Jakub Stárka. Similarity of XML Data. Master thesis. Charles University in Prague, Czech Republic, 2010. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Starka.pdf

E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. The VLDB Journal, 10(4):334-350, 2001.

H. Do, S. Melnik, and E. Rahm. Comparison of Schema Matching Evaluations. In Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems, pages 221-237, London, UK, 2003. Springer-Verlag.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html