Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Linked Data Integration
Thesis title in Czech: Integrade Linked Data
Thesis title in English: Linked Data Integration
Key words: Linked Data, datová integrace, datová kvalita, datové konflikty
English key words: Linked Data, data integration, data quality, conflict resolution, data fusion
Academic year of topic announcement: 2012/2013
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: RNDr. Tomáš Knap, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 09.04.2013
Date of assignment: 12.04.2013
Confirmed by Study dept. on: 17.04.2013
Date and time of defence: 09.09.2013 00:00
Date of electronic submission:31.07.2013
Date of submission of printed version:01.08.2013
Date of proceeded defence: 09.09.2013
Opponents: RNDr. Jakub Klímek, Ph.D.
 
 
 
Advisors: doc. Mgr. Martin Nečaský, Ph.D.
Guidelines
One of the most important benefits of Linked Data [4] is the possibility to integrate data from multiple sources. This poses new challenges in data fusion [3], quality assessment [5], and provenance tracking [6].

The topic of the thesis is a data fusion (conflict resolution) component that would enable users to integrate and filter RDF data, estimate their quality and obtain provenance information. Data fusion component proposed may be executed at query time according to policies given by data consumers [2,5] or offline (batch mode). Data fusion will be implemented as a standalone component applicable to data represented in RDF and also deployed as part of the ODCleanStore project [1]. The thesis will also provide experimental evaluation of the proposed data fusion component and provide comparison of the proposed data fusion technique with existing data fusion techniques in relational databases.
References
[1] ODCleanStore project, http://sourceforge.net/p/odcleanstore. Available Online.

[2] C. Bizer and R. Oldakowski. Using Context- and Content-based Trust Policies on the Semantic Web. In Proceedings of the 13th International World Wide Web conference on Alternate track papers & posters, WWW Alt. '04, pages 228-229, New York, NY, USA, 2004. ACM.

[3] P. N. Mendes, H. Mühleisen, and C. Bizer. Sieve: Linked Data Quality Assessment and Fusion. In Proceedings of the 2012 Joint EDBT/ICDT Workshops, pages 116-123, Berlin, Germany, March 2012. ACM.

[4] C. Bizer, T. Heath, and T. Berners-Lee. Linked Data - The Story So Far. International Journal on Semantic Web and Information Systems, 5(3):1-22, 2009.

[5] C. Bizer. Quality-Driven Information Filtering in the Context of Web-Based Information Systems. Dissertation, 2007. http://wifo5-03.informatik.uni-mannheim.de/bizer/pub/DisertationChrisBizer.pdf, Retrieved 07/03/2013.

[6] A. Freitas, T. Knap, S. O'Riain, and E. Curry. W3P: Building an OPM based provenance model for the Web. Future Generation Comp. Syst., 27(6):766-774, 2011, ISSN: 0167-739X.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html