Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Generation of Synthetic RDF Documents
Thesis title in Czech:
Thesis title in English: Generation of Synthetic RDF Documents
Academic year of topic announcement: 2011/2012
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: RNDr. Martin Svoboda, Ph.D.
Author:
Guidelines
RDF triples representing graph data seem to be an appropriate way how to enrich the contemporary Web of Documents towards the Web of Data more suitable for processing by programs. If we want to enable efficient processing and especially querying over such data, we need to find a compromise between storages, indexing structures and query evaluation algorithms. However, the development and proposals of new approaches suffer from the lack of easily accessible data that could be used for experiments, evaluation and benchmarking of such proposals. Moreover, if we are provided with suitable testing data, we can even harness identified observations to improve these methods.
The aim of this thesis is a research on possibilities and limitations of techniques for automatic generation of synthetic RDF data graphs. It is necessary to analyze existing theoretical and commercial approaches and discuss their advantages and disadvantages. The core of the work should be a proposal of own approach dealing with the analysis results. A special attention should be put into a highly parameterizable model of allowed constructs with respect to RDFS schemata or OWL ontologies. The approach implementation will be supported by experimental results.
References
Manola, F., Miller, E.: RDF Primer (2004), http://www.w3.org/TR/rdf-primer/
Prud'hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF (2008), http://www.w3.org/TR/rdf-sparql-query/
Brickley, D., Guha, R.V.: RDF Vocabulary Description Language 1.0: RDF Schema (2004), http://www.w3.org/TR/rdf-schema/
McGuinness, D.L., Harmelen, F.v.: OWL Web Ontology Language: Overview (2004), http://www.w3.org/TR/owl-features/
Bizer, C., Heath, T., Berners-Lee, T.: Linked Data – The Story so far. International Journal on Semantic Web and Information Systems 5(3), 1–22 (2009)
Billion Triple Challenge 2011, http://challenge.semanticweb.org/
Berlin SPARQL Benchmark (BSBM), http://www4.wiwiss.fu-berlin.de/bizer/berlinsparqlbenchmark/
Aboulnaga, A., Naughton, J. F., Zhang, C.: Generating Synthetic Complex-Structured XML Data. In WebDB'01: Proc. of the 4th Int. Workshop on the Web and Databases, pages 79-84, Washington, DC, USA (2001)
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html