Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Indexace PDF dokumentu s pevnou strukturou pomocou OCR
Thesis title in Czech: Indexace PDF dokumentu s pevnou strukturou pomocou OCR
Thesis title in English: Using OCR to index PDF documents with predefined structure
Academic year of topic announcement: 2008/2009
Thesis type: Bachelor's thesis
Thesis language:
Department: Department of Software Engineering (32-KSI)
Supervisor: RNDr. Jozef Mišutka, Ph.D.
Author:
Guidelines
Student navrhne a rozsiri stavajici vyhledavac pro pdf dokumenty konvertovane do textovej podoby za pouziti OCR. Tento vyhledavac musi zobrazovat a vyhledavat v specialni strukuturovanych dokumentoch napr. bakalarskych/diplomovych praci. Soucasti prace je import casti existujicich bakalarskych/diplomovych praci.
References
[1] The PDF Reference, http://partners.adobe.com/public/developer/en/pdf/PDFReference.pdf
[2] A review of free optical character recognition software, http://groundstate.ca/ocr
[3] Egothor - java search engine, http://www.egothor.org/
Preliminary scope of work
Cilem prace je spristupnit bakalarske/diplomove prace poskytnute v pdf formate s moznosti vyhledavani v nich.
Preliminary scope of work in English
The aim of this work is to provide access and searching capabilities to Bc. & Msc. theses.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html