自动分析照顾者的输入和孩子的生产

IF 0.4 0 LANGUAGE & LINGUISTICS
Gyu-Ho Shin
{"title":"自动分析照顾者的输入和孩子的生产","authors":"Gyu-Ho Shin","doi":"10.1075/kl.20002.shi","DOIUrl":null,"url":null,"abstract":"\n The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child\n corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and\n open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by\n adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event\n (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this\n study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on\n child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding\n developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often\n lacking in previous corpus-based research on child language development in Korean.","PeriodicalId":29725,"journal":{"name":"Korean Linguistics","volume":" ","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic analysis of caregiver input and child production\",\"authors\":\"Gyu-Ho Shin\",\"doi\":\"10.1075/kl.20002.shi\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child\\n corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and\\n open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by\\n adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event\\n (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this\\n study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on\\n child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding\\n developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often\\n lacking in previous corpus-based research on child language development in Korean.\",\"PeriodicalId\":29725,\"journal\":{\"name\":\"Korean Linguistics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2022-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Korean Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1075/kl.20002.shi\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Korean Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/kl.20002.shi","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 2

摘要

本研究探讨自然语言处理(NLP)技术在韩语儿童语料库研究中的适用性。我们在CHILDES数据库(目前最大的开放访问韩语儿童语料库数据)中使用照顾者输入和儿童生产数据,并以两种方式对数据应用NLP技术:通过采用机器学习算法自动标记词性,以及(半)自动提取表达及物事件的结构模式(主动及物和后缀被动)。本研究是首个使用nlp辅助分析韩语儿童语料库的实证报告,希望能够揭示其优势和不足,从而为进一步开展语料库介导的韩语儿童语言发展研究打开一扇窗。本研究结果的启示也将有助于通过儿童语料库进行韩语发展研究的研究实践,确保程序和结果的可重复性,这在以往基于语料库的韩语儿童语言发展研究中经常缺乏。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic analysis of caregiver input and child production
The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often lacking in previous corpus-based research on child language development in Korean.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.30
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信