Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Generating of Synthetic XML Data
Thesis title in Czech: Generování syntetických XML dokumentů
Thesis title in English: Generating of Synthetic XML Data
Key words: XML, DTD, XPath, generování dat, syntetické dokumenty
English key words: XML, DTD, XPath, data generating, synthetic documents
Academic year of topic announcement: 2012/2013
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: doc. RNDr. Irena Holubová, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 06.11.2012
Date of assignment: 06.11.2012
Confirmed by Study dept. on: 27.11.2012
Date and time of defence: 26.05.2014 00:00
Date of electronic submission:07.04.2014
Date of submission of printed version:07.04.2014
Date of proceeded defence: 26.05.2014
Opponents: Mgr. Marek Polák, Ph.D.
 
 
 
Guidelines
The aim of this work is a research on possibilities and limitations of automatic generating of synthetic XML documents for the purpose of testing of XML applications. First of all it is necessary to analyze existing data generators and to discuss their advantages and disadvantages. The core of the work will be a proposal and implementation of a system that will solve selected problems of the existing tools. The work will include suitable experimental results that will provide the proof of the concept.
References
W3C. W3C Technical Reports and Publications. http://www.w3.org/TR/

XML benchmarking projects:
XMark http://monetdb.cwi.nl/xml/
XOO7 Benchmark http://www.comp.nus.edu.sg/~ebh/XOO7.html
XMach-1 http://dbs.uni-leipzig.de/en/projekte/XML/XmlBenchmarking.html
The Michigan Benchmark http://www.eecs.umich.edu/db/mbench/
XBench http://se.uwaterloo.ca/~ddbms/projects/xbench/
XPathMark http://users.dimi.uniud.it/~massimo.franceschet/xpathmark/
MemBeR: XQuery Micro-Benchmark Repository http://ilps.science.uva.nl/Resources/MemBeR/index.html
TPoX http://tpox.sourceforge.net/

Data generators:
ToXgene http://www.alphaworks.ibm.com/tech/toxgene
A. Aboulnaga, J. F. Naughton, and C. Zhang. Generating Synthetic Complex-Structured XML Data. In WebDB'01: Proc. of the 4th Int. Workshop on the Web and Databases, pages 79-84, Washington, DC, USA, 2001.
L. Afanasiev, I. Manolescu, and P. Michiels. MemBeR XML Generator. http://ilps.science.uva.nl/Resources/MemBeR/member-generator.html
P. Azalov and F. Zlatarova. SDG - A System for Synthetic Data Generation. In ITCC'03: Proc of the Int. Conf. on Information Technology: Computers and Communications, pages 69-75, Washington, DC, USA, 2003. IEEE Computer Society.

Mlynkova, I. - Toman, K. - Pokorný, J.: Statistical Analysis of Real XML Data Collections. Technical report 2006/5. Charles University, Prague, Czech Republic, June 2006, 43 pages. http://www.ksi.mff.cuni.cz/~mlynkova/doc/tr2006-5.pdf

Maroš Vranec: XML Benchmarking http://www.ksi.mff.cuni.cz/~mlynkova/dp/Vranec.pdf

Mlynkova, I.: XML Benchmarking: Limitations and Opportunities. Technical report 2008/1. Charles University, Prague, Czech Republic, January 2008, 23 pages. http://www.ksi.mff.cuni.cz/~mlynkova/doc/tr2008-1.pdf
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html