Témata prací (Výběr práce)Témata prací (Výběr práce)(verze: 368)
Detail práce
   Přihlásit přes CAS
Inference of Advanced Schemas for XML Documents Using Non-W3C Schema Languages
Název práce v češtině:
Název v anglickém jazyce: Inference of Advanced Schemas for XML Documents Using Non-W3C Schema Languages
Klíčová slova: XML, schema, inference, XML schema languages
Klíčová slova anglicky: XML, schema, inference, XML schema languages
Akademický rok vypsání: 2010/2011
Typ práce: diplomová práce
Jazyk práce: angličtina
Ústav: Katedra softwarového inženýrství (32-KSI)
Vedoucí / školitel: RNDr. Jakub Klímek, Ph.D.
Řešitel: skrytý - zadáno a potvrzeno stud. odd.
Datum přihlášení: 12.11.2010
Datum zadání: 12.11.2010
Zásady pro vypracování
Inference of XML schemas from sets of XML documents is not a new concept. However, with the development of recent, more effective and user friendlier languages such as Schematron and Relax NG, it may be possible to infer XML schemas with advanced properties. Firstly, it is necessary to analyze potential benefits of more powerful schema languages and discuss their advantages and disadvantages in contrast to existing approaches. The aim of this work is a research on potential improvements over current (semi)automatic methods of schema inference with regard to more accurate expression of data coherence. The core of the work is a proposal and implementation of a new method focusing on better expression of constraints and supplementation of weak points of grammar-based schema languages. The work will include suitable experimental results.
Seznam odborné literatury
[1] ISO/IEC 19757-3:2006: Information technology -- Document Schema Definition Language (DSDL) -- Part 3: Rule-based validation -- Schematron http://standards.iso.org/ittf/PubliclyAvailableStandards/c040833_ISO_IEC_19757-3_2006(E).zip

[2] ISO/IEC 19757-2:2008: Document Schema Definition Language (DSDL) -- Part 2: Regular-grammar-based validation -- RELAX NG http://standards.iso.org/ittf/PubliclyAvailableStandards/c052348_ISO_IEC_19757-2_2008(E).zip

[3] XML Path Language (XPath) 2.0, W3C Recommendation 23 January 2007. http://www.w3.org/TR/xpath20/

[4] Mlynkova, I.: An Analysis of Approaches to XML Schema Inference. SITIS'08, Bali, Indonesia, November/December 2008. IEEE Computer Society Press, 2008.

[5] Opocenska, K. - Kopecky, M.: Incox - a Language for XML Integrity Constraints Description. In DATESO'08, pages 1-12. CEUR-WS.org, 2008.

[6] Vošta, O.: Automatická konstrukce schématu pro množinu XML dokumentů. Diplomová práce, MFF UK, 2005. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Vosta.pdf

[7] Vyhnanovská, J.: Automatic Construction of an XML Schema for a Given Set of XML Documents. Diplomová práce, MFF UK, 2009. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Vyhnanovska.pdf

[8] Moh, C.-H. - Lim, E.-P. - Ng, W.-K.: Re-engineering Structures from Web Documents. In DL '00: Proc. of the 5th ACM Conf. on Digital Libraries, pages 67-76, New York, NY, USA, 2000. ACM Press.

[9] Fassetti. F. - Fazzinga, B.: FOX: Inference of Approximate Functional Dependencies from XML Data. In DEXA'07, pages 10-14, Washington, DC, USA, 2007. IEEE.

[10] Fan, W.: XML Constraints: Specification, Analysis, and Applications. In DEXA'05, pages 805-809, IEEE, 2005.
 
Univerzita Karlova | Informační systém UK