Thesis (Selection of subject)Thesis (Selection of subject)(version: 390)
Thesis details
   Login via CAS
Semi-automated data to ontology mapping
Thesis title in Czech: Semi-automatické mapování dat do ontologií
Thesis title in English: Semi-automated data to ontology mapping
Key words: propojená data|datový katalog
English key words: linked data|data catalog
Academic year of topic announcement: 2022/2023
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: Mgr. Petr Škoda, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 23.06.2023
Date of assignment: 27.06.2023
Confirmed by Study dept. on: 18.07.2023
Date and time of defence: 11.06.2024 09:00
Date of electronic submission:02.05.2024
Date of submission of printed version:02.05.2024
Date of proceeded defence: 11.06.2024
Opponents: doc. Mgr. Martin Nečaský, Ph.D.
 
 
 
Guidelines
When a person wants to publish their data as linked data, it is important to find suitable terms from existing vocabularies to represent the data so that the published data are interoperable with the existing linked data ecosystem and easy to consume. However, it is quite challenging to find such vocabularies and their terms even for linked data experts.

Therefore, the goal of the thesis is to try to remedy this situation by proposing a method for searching for fitting vocabularies and their terms based on data. The general idea of the method is creating a schema of the input data then searching for vocabularies and their terms that would fit the schema, and transforming the schema, all done iteratively. The intermediate results are presented to the user and they can interact with it using manual means or semi-automatic recommendations for search or transformation. Once the user is satisfied, the output data based on the final result can be generated.

The thesis should include a proof-of-concept application with an implementation of the aforementioned method. Apart from the method, the application ought to contain a catalog whose purpose is to provide means for adding and retrieving vocabularies and their related data so they can be used by the method.
References
* https://www.w3.org/TR/rdf11-concepts/
* https://www.w3.org/TR/rdf12-schema/
* https://www.w3.org/TR/owl2-primer/
* https://www.w3.org/TR/vocab-dcat-3/
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html