Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Efektivní reprezentace instanční kategorie
Thesis title in Czech: Efektivní reprezentace instanční kategorie
Thesis title in English: Efficient Representation of Instance Category
Academic year of topic announcement: 2023/2024
Thesis type: diploma thesis
Thesis language:
Department: Department of Software Engineering (32-KSI)
Supervisor: RNDr. Martin Svoboda, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 30.08.2023
Date of assignment: 31.08.2023
Confirmed by Study dept. on: 17.01.2024
Guidelines
NoSQL database management systems allow us to work with data represented in various logical models, ranging from key/value over document to graph models, to enumerate at least some of them. However, many real-world information systems need to use several such models simultaneously to respect the nature of the data as well as to deal with the different expressive power of the involved and often proprietary query languages. Multi-model database systems in particular seem to be helpful in these situations. Unfortunately, their users still need to become acquainted with the specifics of each model and language.
In order to gain the ability to model the data in a truly unified way and at the conceptual layer, a new mechanism based on category theory has recently been proposed. The intended structure of the data is first described using the notion of a schema category, so that the actual data can then be captured via an instance category. While the latter construct serves as a reasonable basis for describing the intended semantics of the data adhering to a given schema category, its motivation is primarily theoretical and cannot be directly exploited for the implementation. Or at least if nothing better than a naive solution would be expected.
The aim of this thesis is to propose a suitable data structure and related mechanisms allowing for the representation of the data conforming to a given schema category within the system memory. In other words, the proposed representation should serve as an efficient implementation of the corresponding instance category, respecting the context and basic principles of the envisioned unified database system. It is expected that the C/C++ programming language will be used. An analysis of the instance category model and the required language constructs and mechanisms will be performed, as the use of standard containers or dynamic memory allocation may not be sufficient. The prototype implementation will also be evaluated experimentally.
References
Martin Svoboda, Pavel Čontoš, Irena Holubová: Categorical Management of Multi-Model Data. 25th International Database Engineering & Applications Symposium, IDEAS 2021. Montreal, Canada, July 2021. DOI: 10.1145/3472163.3472166
Martin Svoboda, Pavel Čontoš, Irena Holubová: Categorical Modeling of Multi-model Data: One Model to Rule Them All. 10th International Conference on Model and Data Engineering, MEDI 2021. Tallinn, Estonia, June 2021. Online. DOI: 10.1007/978-3-030-78428-7_15
Irena Holubová, Martin Svoboda, Jiaheng Lu: Unified Management of Multi-Model Data. 38th International Conference on Conceptual Modeling, ER 2019. Salvador, Bahia, Brazil, November 2019. DOI: 10.1007/978-3-030-33223-5_36
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html