Témata prací (Výběr práce)

Váš prohlížeč nepodporuje JavaScript nebo je jeho podpora vypnutá. Některé funkce nemusejí být dostupné.

Získávání komplexní informace z textových dokumentů

Název práce v češtině:	Získávání komplexní informace z textových dokumentů
Název v anglickém jazyce:	Document-level information extraction
Klíčová slova:	extrakce informací\|zpracování přirozeného jazyka\|hluboké učení
Klíčová slova anglicky:	information extraction\|natural language processing\|deep learning
Akademický rok vypsání:	2023/2024
Typ práce:	disertační práce
Jazyk práce:
Ústav:	Ústav formální a aplikované lingvistiky (32-UFAL)
Vedoucí / školitel:	doc. RNDr. Pavel Pecina, Ph.D.
Řešitel:	skrytý - zadáno a potvrzeno stud. odd.
Datum přihlášení:	19.09.2023
Datum zadání:	19.09.2023
Datum potvrzení stud. oddělením:	21.09.2023

Zásady pro vypracování

Information extraction is the task of automatically extracting structured information from unstructured data, usually textual documents. The basic sub-tasks are mainly solved on the sentence level (e.g. named entity recognition, extraction of relations between the entities, and linking the entities to an ontology). More complex information is extracted on document level and includes, for instance, template filling which attempts to fill a fixed set of fields from an entire document. The thesis will explore document-level information extraction using deep-learning based models in multilingual and domain-specific settings.

Seznam odborné literatury

Goodfellow, I., Y. Bengio, and A. Courville 2016. Deep learning. Cambridge, MA, USA: MIT press.

Du, Xinya, Alexander M. Rush, and Claire Cardie. "Template filling with generative transformers." Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2021.