Linked Data Integration
Thesis title in Czech: | Integrade Linked Data |
---|---|
Thesis title in English: | Linked Data Integration |
Key words: | Linked Data, datová integrace, datová kvalita, datové konflikty |
English key words: | Linked Data, data integration, data quality, conflict resolution, data fusion |
Academic year of topic announcement: | 2012/2013 |
Thesis type: | diploma thesis |
Thesis language: | angličtina |
Department: | Department of Software Engineering (32-KSI) |
Supervisor: | RNDr. Tomáš Knap, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 09.04.2013 |
Date of assignment: | 12.04.2013 |
Confirmed by Study dept. on: | 17.04.2013 |
Date and time of defence: | 09.09.2013 00:00 |
Date of electronic submission: | 31.07.2013 |
Date of submission of printed version: | 01.08.2013 |
Date of proceeded defence: | 09.09.2013 |
Opponents: | RNDr. Jakub Klímek, Ph.D. |
Advisors: | doc. Mgr. Martin Nečaský, Ph.D. |
Guidelines |
One of the most important benefits of Linked Data [4] is the possibility to integrate data from multiple sources. This poses new challenges in data fusion [3], quality assessment [5], and provenance tracking [6].
The topic of the thesis is a data fusion (conflict resolution) component that would enable users to integrate and filter RDF data, estimate their quality and obtain provenance information. Data fusion component proposed may be executed at query time according to policies given by data consumers [2,5] or offline (batch mode). Data fusion will be implemented as a standalone component applicable to data represented in RDF and also deployed as part of the ODCleanStore project [1]. The thesis will also provide experimental evaluation of the proposed data fusion component and provide comparison of the proposed data fusion technique with existing data fusion techniques in relational databases. |
References |
[1] ODCleanStore project, http://sourceforge.net/p/odcleanstore. Available Online.
[2] C. Bizer and R. Oldakowski. Using Context- and Content-based Trust Policies on the Semantic Web. In Proceedings of the 13th International World Wide Web conference on Alternate track papers & posters, WWW Alt. '04, pages 228-229, New York, NY, USA, 2004. ACM. [3] P. N. Mendes, H. Mühleisen, and C. Bizer. Sieve: Linked Data Quality Assessment and Fusion. In Proceedings of the 2012 Joint EDBT/ICDT Workshops, pages 116-123, Berlin, Germany, March 2012. ACM. [4] C. Bizer, T. Heath, and T. Berners-Lee. Linked Data - The Story So Far. International Journal on Semantic Web and Information Systems, 5(3):1-22, 2009. [5] C. Bizer. Quality-Driven Information Filtering in the Context of Web-Based Information Systems. Dissertation, 2007. http://wifo5-03.informatik.uni-mannheim.de/bizer/pub/DisertationChrisBizer.pdf, Retrieved 07/03/2013. [6] A. Freitas, T. Knap, S. O'Riain, and E. Curry. W3P: Building an OPM based provenance model for the Web. Future Generation Comp. Syst., 27(6):766-774, 2011, ISSN: 0167-739X. |