High-performance inverted index database
Thesis title in Czech: | Vysoce výkonná databáze invertovaných indexů |
---|---|
Thesis title in English: | High-performance inverted index database |
Key words: | invertované indexy, databáze, vysoce výkonný software |
English key words: | inverted index, database, high-performance software |
Academic year of topic announcement: | 2018/2019 |
Thesis type: | Bachelor's thesis |
Thesis language: | angličtina |
Department: | Department of Software Engineering (32-KSI) |
Supervisor: | RNDr. Miroslav Kratochvíl, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 25.08.2018 |
Date of assignment: | 28.08.2018 |
Confirmed by Study dept. on: | 03.12.2018 |
Date and time of defence: | 05.09.2019 09:00 |
Date of electronic submission: | 18.07.2019 |
Date of submission of printed version: | 19.07.2019 |
Date of proceeded defence: | 05.09.2019 |
Opponents: | Mgr. Ladislav Peška, Ph.D. |
Guidelines |
The aim of the thesis is to describe and implement a lightweight high-performance implementation of a reliable skip-list-based inverted index database, similar to the backends of available search engines (e.g. Solr, Lucene or Xapian). The performance will be derived from using a system-level programming environment that allows implementation of various performance improvements based on data transfer efficiency, cache-oblivious behavior, or parallelism. Resulting software will be directly applicable as a simpler and faster replacement of the aforementioned search engines in several non-textual uses. |
References |
Boldi, P., & Vigna, S. (2005, November). Compressed perfect embedded skip lists for quick inverted-index lookups. In International Symposium on String Processing and Information Retrieval (pp. 25-28). Springer, Berlin, Heidelberg.
Frigo, M., Leiserson, C. E., Prokop, H., & Ramachandran, S. (1999). Cache-oblivious algorithms. In Foundations of Computer Science, 1999. 40th Annual Symposium on (pp. 285-297). IEEE. Białecki, A., Muir, R., & Ingersoll, G. (2012, August). Apache lucene 4. In SIGIR 2012 workshop on open source information retrieval (p. 17). |