基于AntConc的ESP双语平行语料库构建:应用与评价

Xi Lu, Kai Jiang
{"title":"基于AntConc的ESP双语平行语料库构建:应用与评价","authors":"Xi Lu, Kai Jiang","doi":"10.25236/FER.2021.040910","DOIUrl":null,"url":null,"abstract":"AntConc is a free and green corpus tool developed by Japanese scholar Laurence Anthony featured by three main functions: concordance, wordlist and keywords. The author first describes the principles and process of constructing a web-based bilingual parallel corpus for the purpose of translation studies and ESP research based on AntConc. Constructing principles include strict linguistic standards, balance of corpus, and appropriate size. Based on the principles, language data is accumulated, processed and entered. After that, language processing software CLAWS part-of-speech tagger and Wmatrix are respectively used for text marking, annotating, high-frequency vocabulary extracting and corpus distribution balancing. In the end, texts, paragraphs and sentences are aligned by the corpus tool ParaConc. After the construction, the author uses the retrieval software Wordsmith 4.0 and statistical software SPSS 11.5 to prove the feasibility and effectiveness of the corpus with a six-month experiment that covers 16 translators, 25 teachers and 285 students.","PeriodicalId":314716,"journal":{"name":"Frontiers in Educational Research","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Constructing an ESP Bilingual Parallel Corpus Based on AntConc: Application and Assessment\",\"authors\":\"Xi Lu, Kai Jiang\",\"doi\":\"10.25236/FER.2021.040910\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"AntConc is a free and green corpus tool developed by Japanese scholar Laurence Anthony featured by three main functions: concordance, wordlist and keywords. The author first describes the principles and process of constructing a web-based bilingual parallel corpus for the purpose of translation studies and ESP research based on AntConc. Constructing principles include strict linguistic standards, balance of corpus, and appropriate size. Based on the principles, language data is accumulated, processed and entered. After that, language processing software CLAWS part-of-speech tagger and Wmatrix are respectively used for text marking, annotating, high-frequency vocabulary extracting and corpus distribution balancing. In the end, texts, paragraphs and sentences are aligned by the corpus tool ParaConc. After the construction, the author uses the retrieval software Wordsmith 4.0 and statistical software SPSS 11.5 to prove the feasibility and effectiveness of the corpus with a six-month experiment that covers 16 translators, 25 teachers and 285 students.\",\"PeriodicalId\":314716,\"journal\":{\"name\":\"Frontiers in Educational Research\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Educational Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25236/FER.2021.040910\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Educational Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25236/FER.2021.040910","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

AntConc是由日本学者Laurence Anthony开发的一个免费的绿色语料库工具,其主要功能有三个:词表、词表和关键词。本文首先介绍了基于AntConc构建基于网络的双语平行语料库的原理和过程,以用于翻译研究和ESP研究。构建原则包括严格的语言标准、平衡的语料库和适当的规模。根据这些原则,对语言数据进行积累、处理和录入。之后分别使用语言处理软件CLAWS词性标注器和Wmatrix进行文本标注、标注、高频词汇提取和语料库分布平衡。最后,通过语料库工具ParaConc对文本、段落和句子进行对齐。构建完成后,作者利用检索软件Wordsmith 4.0和统计软件SPSS 11.5对语料库进行了为期6个月的实验,实验对象包括16名翻译、25名教师和285名学生,验证了语料库的可行性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Constructing an ESP Bilingual Parallel Corpus Based on AntConc: Application and Assessment
AntConc is a free and green corpus tool developed by Japanese scholar Laurence Anthony featured by three main functions: concordance, wordlist and keywords. The author first describes the principles and process of constructing a web-based bilingual parallel corpus for the purpose of translation studies and ESP research based on AntConc. Constructing principles include strict linguistic standards, balance of corpus, and appropriate size. Based on the principles, language data is accumulated, processed and entered. After that, language processing software CLAWS part-of-speech tagger and Wmatrix are respectively used for text marking, annotating, high-frequency vocabulary extracting and corpus distribution balancing. In the end, texts, paragraphs and sentences are aligned by the corpus tool ParaConc. After the construction, the author uses the retrieval software Wordsmith 4.0 and statistical software SPSS 11.5 to prove the feasibility and effectiveness of the corpus with a six-month experiment that covers 16 translators, 25 teachers and 285 students.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信