{"title":"Verb Classification Across Languages","authors":"Olga Majewska, A. Korhonen","doi":"10.1146/annurev-linguistics-030521-043632","DOIUrl":null,"url":null,"abstract":"Recent developments in language modeling have enabled large text encoders to derive a wealth of linguistic information from raw text corpora without supervision. Their success across natural language processing (NLP) tasks has called into question the role of man-made computational resources, such as verb lexicons, in supporting modern NLP. Still, probing analyses have concurrently exposed the limitations of the knowledge possessed by the large neural architectures, revealing them to be clever task solvers rather than self-taught linguists. Can human-designed lexical resources still help fill their knowledge gaps? Focusing on verb classification, we discuss approaches to generating verb classes multilingually and weigh the relative benefits of undertaking expensive lexicographic work and outsourcing the task to untrained native speakers. Then, we consider the evidence for the utility of augmenting pretrained language models with external verb knowledge and ponder the ways in which human expertise can continue to benefit multilingual NLP. Expected final online publication date for the Annual Review of Linguistics, Volume 9 is January 2023. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":45803,"journal":{"name":"Annual Review of Linguistics","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Review of Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1146/annurev-linguistics-030521-043632","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Recent developments in language modeling have enabled large text encoders to derive a wealth of linguistic information from raw text corpora without supervision. Their success across natural language processing (NLP) tasks has called into question the role of man-made computational resources, such as verb lexicons, in supporting modern NLP. Still, probing analyses have concurrently exposed the limitations of the knowledge possessed by the large neural architectures, revealing them to be clever task solvers rather than self-taught linguists. Can human-designed lexical resources still help fill their knowledge gaps? Focusing on verb classification, we discuss approaches to generating verb classes multilingually and weigh the relative benefits of undertaking expensive lexicographic work and outsourcing the task to untrained native speakers. Then, we consider the evidence for the utility of augmenting pretrained language models with external verb knowledge and ponder the ways in which human expertise can continue to benefit multilingual NLP. Expected final online publication date for the Annual Review of Linguistics, Volume 9 is January 2023. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
期刊介绍:
The Annual Review of Linguistics, in publication since 2015, covers significant developments in the field of linguistics, including phonetics, phonology, morphology, syntax, semantics, pragmatics, and their interfaces. Reviews synthesize advances in linguistic theory, sociolinguistics, psycholinguistics, neurolinguistics, language change, biology and evolution of language, typology, as well as applications of linguistics in many domains.