Thesis (Selection of subject)Thesis (Selection of subject)(version: 390)
Thesis details
   Login via CAS
Effects of Semantic Network Structure of English on Word Processing
Thesis title in Czech: Efekty struktury sémantické sítě angličtiny na zpracování slov
Thesis title in English: Effects of Semantic Network Structure of English on Word Processing
Key words: sémantická síť|zpracování slov|slovní vektory|strojové učení|psycholingvistika|network science
English key words: semantic network|word processing|word vectors|machine learning|psycholinguistics|network science
Academic year of topic announcement: 2023/2024
Thesis type: diploma thesis
Thesis language: angličtina
Department: Department of the English Language and ELT Methodology (21-UAJD)
Supervisor: doc. Dr. phil. Eva Maria Luef, Mag. phil.
Author: hidden - assigned and confirmed by the Study Dept.
Date of registration: 10.10.2023
Date of assignment: 10.10.2023
Administrator's approval: approved
Confirmed by Study dept. on: 12.08.2024
Date and time of defence: 09.09.2024 08:30
Date of electronic submission:12.08.2024
Date of proceeded defence: 09.09.2024
Submitted/finalized: committed by student and finalized
Opponents: Mgr. Kateřina Vašků, Ph.D.
 
 
 
Guidelines
This work analyses semantic network of English using the tools of social network analysis (SNA). Assuming small-world characteristics of this network, the focus will be on the comparison of words from lexical islands, i.e., smaller interconnected clusters within the network and the giant component, i.e., the biggest interconnected chunk within the network. It will build upon previous research on phonological networks that showed that words from lexical islands are being processed more quickly and efficiently during psycholinguistic experiments. The aim of this work is to test if similar effect occurs in semantic network. The semantic network in this work is constructed by first using a machine learning method based on word vector embeddings (e.g., word2vec, BERT, etc.) on a corpus of text that capture the meaning of a word as set of vectors in vector space. The embeddings can be visualized as data points in space where each data point represents one word and the distance between them represents the semantic similarity as calculated from the corpus by the algorithm. To construct an interconnected semantic network, a maximal distance between two data points (i.e., words) will be established, which produces a list of edges between the words of the network.
Práce bude vypracována v anglickém jazyce.
References
Chan, Kit Ying, and Michael S. Vitevitch. ‘The Influence of the Phonological Neighborhood Clustering Coefficient on Spoken Word Recognition.’ _Journal of Experimental Psychology: Human Perception and Performance_ 35, no. 6 (2009): 1934–49. (https://doi.org/10.1037/a0016902).
Gavin, Michael, Collin Jennings, Lauren Kersey, and Brad Pasanek. ‘Chapter 21 Spaces of Meaning: Conceptual History, Vector Semantics, and Close Reading’. In Debates in the Digital Humanities 2019, edited by Matthew K. Gold and Lauren F. Klein. University of Minnesota Press, 2019. (https://doi.org/10.5749/j.ctvg251hk).
Keuleers, Emmanuel, and David A. Balota. ‘Megastudies, Crowdsourcing, and Large Datasets in Psycholinguistics: An Overview of Recent Developments’. Quarterly Journal of Experimental Psychology 68, no. 8 (1 August 2015): 1457–68. (https://doi.org/10.1080/17470218.2015.1051065).
Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. ‘Distributed Representations of Words and Phrases and Their Compositionality’. arXiv, 16 October 2013. (https://doi.org/10.48550/arXiv.1310.4546).
Siew, Cynthia S. Q., and Michael S. Vitevitch. ‘Spoken Word Recognition and Serial Recall of Words from Components in the Phonological Network.’ Journal of Experimental Psychology: Learning, Memory, and Cognition 42, no. 3 (2016): 394–410. (https://doi.org/10.1037/xlm0000139).
Siew, Cynthia S. Q., and Michael S. Vitevitch. ‘An Investigation of Network Growth Principles in the Phonological Language Network.’ Journal of Experimental Psychology: General 149, no. 12 (December 2020): 2376–94. (https://doi.org/10.1037/xge0000876).
Steyvers, Mark, and Joshua B. Tenenbaum. ‘The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth’. Cognitive Science 29, no. 1 (2 January 2005): 41–78. (https://doi.org/10.1207/s15516709cog2901_3).
Sun, Kun, and Xiaofei Lu. ‘Assessing Lexical Psychological Properties in Second Language Production: A Dynamic Semantic Similarity Approach’. _Frontiers in Psychology_ 12 (2021). (https://www.frontiersin.org/articles/10.3389/fpsyg.2021.672243).
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html