{"title":"Complex Words as Shortest Paths in the Network of Lexical Knowledge","authors":"Sergei Monakhov, Holger Diessel","doi":"10.1111/cogs.70005","DOIUrl":null,"url":null,"abstract":"<p>Lexical models diverge on the question of how to represent complex words. Under the morpheme-based approach, each morpheme is treated as a separate unit, while under the word-based approach, morphological structure is derived from complex words. In this paper, we propose a new computational model of morphology that is based on graph theory and is intended to elaborate the word-based network approach. Specifically, we use a key concept of network science, the notion of shortest path, to investigate how complex words are learned, stored, and processed. The notion of shortest path refers to the task of finding the shortest or most optimal path connecting two non-adjacent nodes in a network. Building on this notion, the current study shows (i) that new complex words can be segmented into morphemes through the shortest path analysis; (ii) that attested English words tend to represent the shortest paths in the morphological network; and (iii) that novel (unattested) words receive higher acceptability ratings in experiments when they are formed along established optimal paths. The model's performance is tested in two experiments with human participants as well as against the behavioral data from the English Lexicon Project. We interpret our empirical results from the perspective of a usage-based model of grammar and argue that network science provides a powerful tool for analyzing language structure.</p>","PeriodicalId":48349,"journal":{"name":"Cognitive Science","volume":"48 11","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70005","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Science","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70005","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Lexical models diverge on the question of how to represent complex words. Under the morpheme-based approach, each morpheme is treated as a separate unit, while under the word-based approach, morphological structure is derived from complex words. In this paper, we propose a new computational model of morphology that is based on graph theory and is intended to elaborate the word-based network approach. Specifically, we use a key concept of network science, the notion of shortest path, to investigate how complex words are learned, stored, and processed. The notion of shortest path refers to the task of finding the shortest or most optimal path connecting two non-adjacent nodes in a network. Building on this notion, the current study shows (i) that new complex words can be segmented into morphemes through the shortest path analysis; (ii) that attested English words tend to represent the shortest paths in the morphological network; and (iii) that novel (unattested) words receive higher acceptability ratings in experiments when they are formed along established optimal paths. The model's performance is tested in two experiments with human participants as well as against the behavioral data from the English Lexicon Project. We interpret our empirical results from the perspective of a usage-based model of grammar and argue that network science provides a powerful tool for analyzing language structure.
期刊介绍:
Cognitive Science publishes articles in all areas of cognitive science, covering such topics as knowledge representation, inference, memory processes, learning, problem solving, planning, perception, natural language understanding, connectionism, brain theory, motor control, intentional systems, and other areas of interdisciplinary concern. Highest priority is given to research reports that are specifically written for a multidisciplinary audience. The audience is primarily researchers in cognitive science and its associated fields, including anthropologists, education researchers, psychologists, philosophers, linguists, computer scientists, neuroscientists, and roboticists.