{"title":"复杂词是词汇知识网络中的最短路径","authors":"Sergei Monakhov, Holger Diessel","doi":"10.1111/cogs.70005","DOIUrl":null,"url":null,"abstract":"<p>Lexical models diverge on the question of how to represent complex words. Under the morpheme-based approach, each morpheme is treated as a separate unit, while under the word-based approach, morphological structure is derived from complex words. In this paper, we propose a new computational model of morphology that is based on graph theory and is intended to elaborate the word-based network approach. Specifically, we use a key concept of network science, the notion of shortest path, to investigate how complex words are learned, stored, and processed. The notion of shortest path refers to the task of finding the shortest or most optimal path connecting two non-adjacent nodes in a network. Building on this notion, the current study shows (i) that new complex words can be segmented into morphemes through the shortest path analysis; (ii) that attested English words tend to represent the shortest paths in the morphological network; and (iii) that novel (unattested) words receive higher acceptability ratings in experiments when they are formed along established optimal paths. The model's performance is tested in two experiments with human participants as well as against the behavioral data from the English Lexicon Project. We interpret our empirical results from the perspective of a usage-based model of grammar and argue that network science provides a powerful tool for analyzing language structure.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70005","citationCount":"0","resultStr":"{\"title\":\"Complex Words as Shortest Paths in the Network of Lexical Knowledge\",\"authors\":\"Sergei Monakhov, Holger Diessel\",\"doi\":\"10.1111/cogs.70005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Lexical models diverge on the question of how to represent complex words. Under the morpheme-based approach, each morpheme is treated as a separate unit, while under the word-based approach, morphological structure is derived from complex words. In this paper, we propose a new computational model of morphology that is based on graph theory and is intended to elaborate the word-based network approach. Specifically, we use a key concept of network science, the notion of shortest path, to investigate how complex words are learned, stored, and processed. The notion of shortest path refers to the task of finding the shortest or most optimal path connecting two non-adjacent nodes in a network. Building on this notion, the current study shows (i) that new complex words can be segmented into morphemes through the shortest path analysis; (ii) that attested English words tend to represent the shortest paths in the morphological network; and (iii) that novel (unattested) words receive higher acceptability ratings in experiments when they are formed along established optimal paths. The model's performance is tested in two experiments with human participants as well as against the behavioral data from the English Lexicon Project. We interpret our empirical results from the perspective of a usage-based model of grammar and argue that network science provides a powerful tool for analyzing language structure.</p>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70005\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
Complex Words as Shortest Paths in the Network of Lexical Knowledge
Lexical models diverge on the question of how to represent complex words. Under the morpheme-based approach, each morpheme is treated as a separate unit, while under the word-based approach, morphological structure is derived from complex words. In this paper, we propose a new computational model of morphology that is based on graph theory and is intended to elaborate the word-based network approach. Specifically, we use a key concept of network science, the notion of shortest path, to investigate how complex words are learned, stored, and processed. The notion of shortest path refers to the task of finding the shortest or most optimal path connecting two non-adjacent nodes in a network. Building on this notion, the current study shows (i) that new complex words can be segmented into morphemes through the shortest path analysis; (ii) that attested English words tend to represent the shortest paths in the morphological network; and (iii) that novel (unattested) words receive higher acceptability ratings in experiments when they are formed along established optimal paths. The model's performance is tested in two experiments with human participants as well as against the behavioral data from the English Lexicon Project. We interpret our empirical results from the perspective of a usage-based model of grammar and argue that network science provides a powerful tool for analyzing language structure.