Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
High-performance inverted index database
Thesis title in Czech: Vysoce výkonná databáze invertovaných indexů
Thesis title in English: High-performance inverted index database
Key words: invertované indexy, databáze, vysoce výkonný software
English key words: inverted index, database, high-performance software
Academic year of topic announcement: 2018/2019
Thesis type: Bachelor's thesis
Thesis language: angličtina
Department: Department of Software Engineering (32-KSI)
Supervisor: RNDr. Miroslav Kratochvíl, Ph.D.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 25.08.2018
Date of assignment: 28.08.2018
Confirmed by Study dept. on: 03.12.2018
Date and time of defence: 05.09.2019 09:00
Date of electronic submission:18.07.2019
Date of submission of printed version:19.07.2019
Date of proceeded defence: 05.09.2019
Opponents: Mgr. Ladislav Peška, Ph.D.
 
 
 
Guidelines
The aim of the thesis is to describe and implement a lightweight high-performance implementation of a reliable skip-list-based inverted index database, similar to the backends of available search engines (e.g. Solr, Lucene or Xapian). The performance will be derived from using a system-level programming environment that allows implementation of various performance improvements based on data transfer efficiency, cache-oblivious behavior, or parallelism. Resulting software will be directly applicable as a simpler and faster replacement of the aforementioned search engines in several non-textual uses.
References
Boldi, P., & Vigna, S. (2005, November). Compressed perfect embedded skip lists for quick inverted-index lookups. In International Symposium on String Processing and Information Retrieval (pp. 25-28). Springer, Berlin, Heidelberg.

Frigo, M., Leiserson, C. E., Prokop, H., & Ramachandran, S. (1999). Cache-oblivious algorithms. In Foundations of Computer Science, 1999. 40th Annual Symposium on (pp. 285-297). IEEE.

Białecki, A., Muir, R., & Ingersoll, G. (2012, August). Apache lucene 4. In SIGIR 2012 workshop on open source information retrieval (p. 17).
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html