Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Škálovatelná a přesná analýza původu dat ve složitých softwarových systémech
Thesis title in Czech: Škálovatelná a přesná analýza původu dat ve složitých softwarových systémech
Thesis title in English: Scalable and precise analysis of data lineage in complex software systems
English key words: data lineage, data flow, static analysis, data processing frameworks
Academic year of topic announcement: 2021/2022
Thesis type: dissertation
Thesis language:
Department: Department of Distributed and Dependable Systems (32-KDSS)
Supervisor: doc. RNDr. Pavel Parízek, Ph.D.
Author:
Guidelines
Large information systems are typically implemented using a combination of database queries and code in object-oriented programming languages.
An important property of such systems is data lineage - the flow of data loaded from a database or another source through the program code.
The main goal of this project is to develop scalable and precise analyses of data lineage in complex software systems.
Proposed techniques will support classic SQL databases, modern data processing frameworks, and multithreaded programs.
References
F. Nielson, H.R. Nielson, and C. Hankin. Principles of Program Analysis. Springer 1999
A.V. Aho, M.S. Lam, R. Sethi, and J.D. Ullman. Compilers: Principles, Techniques, and Tools. Addison Wesley, 2nd edition, 2006
D. Clarke, J. Noble and T. Wrigstad. Aliasing in Object-Oriented Programming: Types, Analysis and Verification. LNCS 7850, Springer, 2013
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html