An online collaborative platform for the development of empirical grammars
Název práce v češtině: | On-line platforma pro spolupráci na vývoji empirických gramatik |
---|---|
Název v anglickém jazyce: | An online collaborative platform for the development of empirical grammars |
Klíčová slova: | gramatika; spolupráce na vývoji; webová platforma; HPSG |
Klíčová slova anglicky: | grammar; collaborative development; web-based platform; HPSG |
Akademický rok vypsání: | 2014/2015 |
Typ práce: | diplomová práce |
Jazyk práce: | angličtina |
Ústav: | Ústav formální a aplikované lingvistiky (32-UFAL) |
Vedoucí / školitel: | Ing. Alexandr Rosen, Ph.D. |
Řešitel: | Mgr. Antonio Fernando Garcia Sevilla - zadáno a potvrzeno stud. odd. |
Datum přihlášení: | 09.03.2015 |
Datum zadání: | 14.03.2015 |
Datum potvrzení stud. oddělením: | 15.07.2015 |
Datum a čas obhajoby: | 03.02.2016 09:00 |
Datum odevzdání elektronické podoby: | 21.01.2016 |
Datum odevzdání tištěné podoby: | 04.12.2015 |
Datum proběhlé obhajoby: | 03.02.2016 |
Oponenti: | RNDr. Jiří Hana, Ph.D. |
Zásady pro vypracování |
The thesis is based on the assumption that the development of a formal grammar and its integration with data-driven methods and testing environment becomes significantly easier if the grammar developers could use a single platform, unifying available resources and results and allowing for collaborative development. More specifically, the goals of the thesis are twofold:
(1) To develop a web-based, collaborative user-friendly platform for the development of HPSG grammars, where users could build, test and share their grammars, data and results. A typical scenario would include the option of collaborative development within a large project. (2) To build and implement an HPSG grammar of Spanish of non-trivial coverage, using existing resources as the starting point and the proposed platform as the grammar writing and testing environment. The platform should allow for extending the rule-based components by data-driven modules to make the system more robust and adaptive, including applications such as dynamic lexicon induction and constraint application weighing. |
Seznam odborné literatury |
Abeillé, A., Borsley, R. D., and Espinal, M.-T. (2006). The syntax of comparative correlatives in French and Spanish. In Müller, S., editor, The Proceedings of the 13th International Conference on Head-Driven Phrase Structure Grammar, pages 6–26, Stanford. CSLI Publications.
Bildhauer, F. (2007). Representing Information Structure in an HPSG Grammar of Spanish. PhD thesis, Universität Bremen. Bildhauer, F. (2008). Clitic left dislocation and focus projection in Spanish. In Müller, S., editor, Proceedings of the 15th International Conference on Head-Driven Phrase Structure Grammar, National Institute of Information and Communications Technology, Keihanna, pages 346–357, Stanford, CA. CSLI Publications. Baldwin, T., Bender, E. M., Flickinger, D., Kim, A., and Oepen, S. (2004). Road-testing the English Resource Grammar over the British National Corpus. In In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pages 2047–2050. Brew, C. (1995). Stochastic HPSG. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, Dublin, Ireland, March 28–31. University College, pages 83–89. Copestake, A. and Flickinger, D. (2000). An open-source grammar development environment and broad-coverage English grammar using HPSG. In Proceedings of the Second conference on Language Resources and Evaluation (LREC-2000), Athens, Greece. Crysmann, B., Frank, A., Kiefer, B., Krieger, H.-U., Müller, S., Neumann, G., Piskorski, J., Schäfer, U., Siegel, M., Uszkoreit, H., and Xu, F. (2002). An integrated architecture for shallow and deep processing. In Proceedings of ACL-2002, 40th Anniversary Meeting, Philadelphia, USA. Association for Computational Linguistic, Association for Computational Linguistics. Gilcub, M. M. and Marimon, M. (2002). Integrating shallow linguistic processing into a unification-based Spanish grammar. In Proceedings of COLING-2002. Marimon, M., Bel, N., Espeja, S., and Seghezzi, N. (2007). The Spanish Resource Grammar: Pre-processing strategy and lexical acquisition. In Proceedings of the Workshop on Deep Lin- guistic Processing, DeepLP ’07, pages 105–111, Stroudsburg, PA, USA. Association for Computational Linguistics. Marimon, M. (2010). The Spanish Resource Grammar. In Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., and Tapias, D., editors, Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta, Malta. European Language Resources Association (ELRA). Meza, I. and Pineda, L. (2002). The Spanish auxiliary verb system in HPSG. In Proceedings of CICLing-2002. Springer-Verlag. Meza, I. and Pineda, L. (2005). Syntax-driven bindings of Spanish clitic pronoun. Procesamiento del Lenguaje Natural, 35. Pineda, L. and Meza, I. (2000). Una gramática básica del español en HPSG. Technical report, Universidad Nacional Autónoma de México. Smith, T. C. and Cleary, J. G. (1997). Probabilistic unification grammars. In Australasian Natural Language Processing Workshop, pages 25–32. Macquarie University. Torruella, M. C. and Antonín, A. M. (2002). Design principles for a Spanish treebank. In 1st Workshop on Treebanks and Linguistic Theories (TLT), Sozopol, Bulgaria. |