Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Analysis of Real-World Data and Their Exploitation
Thesis title in Czech: Analýzy reálných dat a jejich využití
Thesis title in English: Analysis of Real-World Data and Their Exploitation
Academic year of topic announcement: 2010/2011
Thesis type: dissertation
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: doc. RNDr. Irena Holubová, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 09.12.2010
Date of assignment: 09.12.2010
Date and time of defence: 23.09.2013 13:00
Date of electronic submission:20.06.2013
Date of submission of printed version:20.06.2013
Date of proceeded defence: 23.09.2013
Opponents: prof. Ing. Michal Krátký, Ph.D.
  prof. Martine Collard
 
 
Guidelines
The typical optimization strategy of many data processing techniques is exploitation of the knowledge of constructs typically user in real-world applications. However, such approach requires a repeatable, updatable and detailed analysis of a representative data set, i.e. a robust and extensible tool that enables to process the data. With such an application a number of related problems arise, such as automatic crawling of the data, number of errors in real-world data, efficient performance of analyses over a huge data volume, user-friendly processing and interpretation of the results as well as exploitation of the results in current applications.
References
I. Mlynkova, K. Toman, and J. Pokorny. Statistical Analysis of Real XML Data Collections. Technical report 2006/5, Charles University, June 2006. http://kocour.ms.mff.cuni.cz/~mlynkova/doc/tr2006-5.pdf.

M. Klettke, L. Schneider, and A. Heuer. Metrics for XML Document Collections. In XMLDM Workshop, pages 162-176, Prague, Czech Republic, 2002.

B. Choi. What are real DTDs like? In WebDB'02, Proceedings of the 5th International Workshop on the Web and Databases, pages 43-48, Madison, Wisconsin, USA, 2002. ACM Press.

A. Sahuguet. Everything You Ever Wanted to Know About DTDs, ButWere Afraid to Ask (Extended Abstract). In Selected papers from the 3rd International Workshop WebDB 2000 on The World Wide Web and Databases, pages 171-183, London, UK, 2001. Springer-Verlag.

L. Mignet, D. Barbosa, and P. Veltri. The XML Web: a First Study. In WWW '03, Proceedings of the 12th international conference on World Wide Web, Volume 2, pages 500-510, New York, NY, USA, 2003. ACM Press.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html