Témata prací (Výběr práce)Témata prací (Výběr práce)(verze: 368)
Detail práce
   Přihlásit přes CAS
Semantic disambiguation using Distributional Semantics
Název práce v češtině: Semantic disambiguation using Distributional Semantics
Název v anglickém jazyce: Semantic disambiguation using Distributional Semantics
Klíčová slova: -
Klíčová slova anglicky: WORD SENSE DISAMBIGUATION, VECTOR SPACE MODEL, PRAGUE DEPENDENCY TREEBANK
Akademický rok vypsání: 2010/2011
Typ práce: diplomová práce
Jazyk práce: angličtina
Ústav: Ústav formální a aplikované lingvistiky (32-UFAL)
Vedoucí / školitel: RNDr. Jiří Hana, Ph.D.
Řešitel: skrytý - zadáno a potvrzeno stud. odd.
Datum přihlášení: 10.12.2010
Datum zadání: 14.01.2011
Datum a čas obhajoby: 10.05.2012 00:00
Datum odevzdání elektronické podoby:13.04.2012
Datum odevzdání tištěné podoby:13.04.2012
Datum proběhlé obhajoby: 10.05.2012
Oponenti: doc. Mgr. Barbora Vidová Hladká, Ph.D.
 
 
 
Zásady pro vypracování
The goal of this thesis is to employ the combination of Distributional Semantics as used in Natural Language Programming (e.g. Schütze 1998) and of the traditional propositional semantics, as suggested for example by E. Hovy (2010), in a task of automatic categorization (for example, lemma disambiguation on the Prague Dependency Treebank).
E. Hovy's semantics combines traditional propositional semantics based on symbolic logic and statistical word distribution information of Distributional Semantics as used in Natural Language Programming (e.g. Schütze 1998). The core resource is a single lexico-semantic lexicon where concepts are organized as tensors encoding strenght of relations
to other concepts. Using these strenghts of relations, appropriateness of terms given a particular context can be determined, and used for a variety of tasks, including term disambiguation. Distributional Semantics has a strong cognitive plausibility, as shown for example by its ability to predict human brain activity associated with the meanings of nouns (Mitchell et al 2008).
The result of this thesis should be a system performing automatic categorization using Hovy's semantics, for example, a system for lexical disambiguation tested on the Prague Dependency Treebank. Lexical disambiguation is a process of determining the correct meaning of a word based on its context (e.g. determining whether 'bank' refers to an institution or to a river bank).
Seznam odborné literatury
Hovy, Eduard (2010): Distributional Semantics and the Lexicon, Keynote speech at COLLING 2010.

Landauer, Thomas K. and Dumais, Susan T. (1997). A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review, 104(2), 211-240.

Mitchell, Tom M.; Shinkareva, Svetlana V.; Carlson, Andrew; Chang, Kai-Min; Malave, Vicente L.; Mason, Robert A.; Just, Marcel Adam (2008). Predicting human brain activity
associated with the meanings of nouns. Science, 320, 1191-1195.

Schütze, Hinrich (1998). Automatic word sense discrimination. Computational Linguistics, 24(1), 97-123.

Stefan Evert, Alessandro Lenci: Distributional Semantic Models - A course at ESSLLI 2009, Bordeaux, July 27-31 2009.

Lin, Dekang (1998). Automatic retrieval and clustering of similar words. In Proceedings of the 17th International Conference on Computational Linguistics (COLING-ACL 1998), pages 768-774, Montreal, Canada.
 
Univerzita Karlova | Informační systém UK