{"title":"Supervised prediction of production patterns using machine learning algorithms","authors":"Jungyeon Kim","doi":"10.1515/lingvan-2024-0034","DOIUrl":null,"url":null,"abstract":"When an English word ending in a stop is adapted to Korean, a vowel is variably inserted after the final stop: some words always take the epenthetic vowel, and some never do, while some vary between these alternatives. Although there are different linguistic factors that possibly affect this insertion, it is not easy to determine which pattern will be chosen if a new word comes into the borrowing language. This study conducted classification data analyses of production patterns based on machine learning algorithms including support vector machines and random forests. These two classifiers show similar results where vowel tenseness is the best predictor among all the possible predictors. This indicates that vowel tenseness is most influential in classifying the patterns (no vowel insertion, optional vowel insertion, or vowel insertion). Results suggest that while vowel tenseness remains significant, other factors such as stop voicing and stop place also hold some importance, albeit to a lesser degree. The contribution of this study is that it provides insight into the factors that regulate vowel insertion, and these findings support the need for a behavioral experiment to see if the current results can make right predictions with respect to the behavior of nonce items.","PeriodicalId":55960,"journal":{"name":"Linguistics Vanguard","volume":"25 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistics Vanguard","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/lingvan-2024-0034","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
When an English word ending in a stop is adapted to Korean, a vowel is variably inserted after the final stop: some words always take the epenthetic vowel, and some never do, while some vary between these alternatives. Although there are different linguistic factors that possibly affect this insertion, it is not easy to determine which pattern will be chosen if a new word comes into the borrowing language. This study conducted classification data analyses of production patterns based on machine learning algorithms including support vector machines and random forests. These two classifiers show similar results where vowel tenseness is the best predictor among all the possible predictors. This indicates that vowel tenseness is most influential in classifying the patterns (no vowel insertion, optional vowel insertion, or vowel insertion). Results suggest that while vowel tenseness remains significant, other factors such as stop voicing and stop place also hold some importance, albeit to a lesser degree. The contribution of this study is that it provides insight into the factors that regulate vowel insertion, and these findings support the need for a behavioral experiment to see if the current results can make right predictions with respect to the behavior of nonce items.
期刊介绍:
Linguistics Vanguard is a new channel for high quality articles and innovative approaches in all major fields of linguistics. This multimodal journal is published solely online and provides an accessible platform supporting both traditional and new kinds of publications. Linguistics Vanguard seeks to publish concise and up-to-date reports on the state of the art in linguistics as well as cutting-edge research papers. With its topical breadth of coverage and anticipated quick rate of production, it is one of the leading platforms for scientific exchange in linguistics. Its broad theoretical range, international scope, and diversity of article formats engage students and scholars alike. All topics within linguistics are welcome. The journal especially encourages submissions taking advantage of its new multimodal platform designed to integrate interactive content, including audio and video, images, maps, software code, raw data, and any other media that enhances the traditional written word. The novel platform and concise article format allows for rapid turnaround of submissions. Full peer review assures quality and enables authors to receive appropriate credit for their work. The journal publishes general submissions as well as special collections. Ideas for special collections may be submitted to the editors for consideration.