Adaptive Similarity of XML Data
Thesis title in Czech: | Adaptive Similarity of XML Data |
---|---|
Thesis title in English: | Adaptive Similarity of XML Data |
Key words: | rozhodovací strom, podobnost schémat, PIM schéma, PSM schéma, XML Schema |
English key words: | decision tree, schema similarity, PIM schema, PSM schema, XML Schema |
Academic year of topic announcement: | 2010/2011 |
Thesis type: | diploma thesis |
Thesis language: | angličtina |
Department: | Department of Software Engineering (32-KSI) |
Supervisor: | doc. RNDr. Irena Holubová, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 09.11.2010 |
Date of assignment: | 09.11.2010 |
Date and time of defence: | 27.01.2014 10:00 |
Date of electronic submission: | 04.12.2013 |
Date of submission of printed version: | 06.12.2013 |
Date of proceeded defence: | 27.01.2014 |
Opponents: | RNDr. Martin Svoboda, Ph.D. |
Guidelines |
Exploitation of similarity of XML data is currently a typical optimization strategy for many related areas of their processing. However, most of the approaches suffer from the same problem caused by the fact that different similarity evaluations are suitable for different types of data. The current strategies are either fixed, or their calibration for particular type of data has to be done manually which is not an easy task.
The aim of this work is a research on various aspects of (semi-)automatic adaptive evaluation of similarity of XML data. Firstly, it is necessary to analyze existing solutions in the area of both XML and non-XML data and to discuss their advantages and disadvantages. The core of the work is a proposal and implementation of own method for similarity evaluation strategy focusing on the found disadvantages and shortcomings. The work will include suitable experimental results. |
References |
Mlynkova, I. - Necasky, M. - Pokorny, J. - Richta, K. - Toman, K. - Toman, V.: Technologie XML - Principy a aplikace v praxi. Grada Publishing, Prague, Czech Republic, zari 2008. ISBN 978-80-247-2725-7.
W3C. W3C Technical Reports and Publications. http://www.w3.org/TR/ Jakub Stárka. Similarity of XML Data. Master thesis. Charles University in Prague, Czech Republic, 2010. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Starka.pdf E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. The VLDB Journal, 10(4):334-350, 2001. H. Do, S. Melnik, and E. Rahm. Comparison of Schema Matching Evaluations. In Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems, pages 221-237, London, UK, 2003. Springer-Verlag. |