Similarity of XML Data
Název práce v češtině: | Podobnost XML dat |
---|---|
Název v anglickém jazyce: | Similarity of XML Data |
Akademický rok vypsání: | 2006/2007 |
Typ práce: | diplomová práce |
Jazyk práce: | angličtina |
Ústav: | Katedra softwarového inženýrství (32-KSI) |
Vedoucí / školitel: | doc. RNDr. Irena Holubová, Ph.D. |
Řešitel: | skrytý![]() |
Datum přihlášení: | 09.11.2006 |
Datum zadání: | 09.11.2006 |
Datum a čas obhajoby: | 26.05.2008 00:00 |
Datum odevzdání elektronické podoby: | 26.05.2008 |
Datum odevzdání tištěné podoby: | 26.05.2008 |
Datum proběhlé obhajoby: | 26.05.2008 |
Oponenti: | doc. Mgr. Martin Nečaský, Ph.D. |
Zásady pro vypracování |
A possible enhancing of XML data management tools is to store and manage similar XML documents in the same or similar way, i.e. to exploit the idea of clustering. For this purpose it is necessary to propose a suitable technique, which is able to measure similarity among XML documents, XML schemes, or between the two groups.
The aim of this work is a research on various aspects of the problem and its limitations. Firstly, it is necessary to analyze existing solutions and to discuss their advantages and disadvantages. The core of the work is a proposal and implementation of own method for similarity evaluation focusing on the found disadvantages and shortcomings. The work will include suitable experimental results. |
Seznam odborné literatury |
1. Extensible Markup Language (XML) 1.0 (Fourth Edition). 2000. W3C Recommendation, 16 August 2006. http://www.w3.org/TR/REC-xml
2. W3C. W3C Technical Reports and Publications. http://www.w3.org/TR/ 3. Mlýnková, I. - Pokorný, J. - Richta, K. - Toman, K. - Toman, V.: Technologie XML. Skripta. Karlova Univerzita, Praha, Česká republika, září 2006. 4. A. Nierman and H. V. Jagadish. Evaluating Structural Similarity in XML Documents. In Proceedings of the Fifth International Workshop on the Web and Databases - WebDB 2002, Madison, Wisconsin, USA, 2002. 5. T. Jiang, L. Wang, and K. Zhang. Alignment of Trees - An Alternative to Tree Edit. Theor. Comput. Sci., 143(1):137-148, 1995. 6. Z. Zhang, R. Li, S. Cao, and Y. Zhu. Similarity Metric for XML Documents. In Proceedings of FGWM03: Workshop on Knowledge and Experience Management, Karlsruhe, Germany, 2003. 7. E. Bertino, G. Guerrini, and M. Mesiti. A Matching Algorithm for Measuring the Structural Similarity between an XML Document and a DTD and its Applications. Inf. Syst., 29(1):23-46, 2004. 8. P. K.L. Ng and V. T.Y. Ng. Structural Similarity between XML Documents and DTDs. In Springer Berlin / Heidelberg, pages 412-421. Lecture Notes in Computer Science, 2003. 9. M. L. Lee, L. H. Yang, W. Hsu, and X. Yang. XClust: Clustering XML Schemas for Effective Integration. In CIKM '02: Proceedings of the Eleventh International Conference on Information and Knowledge Management, pages 292-299, New York, NY, USA, 2002. ACM Press. 10. E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. The VLDB Journal, 10(4):334-350, 2001. 11. H. Do, S. Melnik, and E. Rahm. Comparison of Schema Matching Evaluations. In Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems, pages 221-237, London, UK, 2003. Springer-Verlag. |