FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit

{"title":"FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit","authors":"","doi":"10.1016/j.acorp.2024.100103","DOIUrl":null,"url":null,"abstract":"<div><p>Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text <em>do</em> exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English <em>and</em> Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (<span><span>www.freetxt.app</span><svg><path></path></svg></span>), we aimed to empower end-users to <em>direct</em> and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by <em>anyone</em> in <em>any sector</em> in Wales and beyond.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000200/pdfft?md5=65f8a01d41b4150af967f22d4f542b8f&pid=1-s2.0-S2666799124000200-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Corpus Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666799124000200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English and Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (www.freetxt.app), we aimed to empower end-users to direct and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by anyone in any sector in Wales and beyond.

FreeTxt:基于语料库的双语自由文本调查和问卷数据分析工具包
定性的自由文本回复(如来自问卷和调查的回复)给许多公司和机构带来了挑战,因为它们缺乏轻松分析此类数据的专业知识。虽然目前确实存在一系列复杂的文本分析工具,但这些工具往往价格昂贵、难以使用和/或非专家用户无法使用。这些工具还缺乏对英语和威尔士语文本分析的支持,这在威尔士的双语环境中是一个特殊的挑战。本文详细介绍了首个基于语料库的 "FreeTxt "工具包的主要功能,该工具包旨在支持自由文本数据的系统分析和可视化,是对这两个关键需求的直接回应。本文展示了软件工程师、自然语言处理(NLP)专家和语料库语言学家如何通过合作,与最终用户和受益者共同为现实问题提供有效的解决方案。通过开发 FreeTxt (www.freetxt.app),我们旨在授权最终用户指导和领导他们自己对小规模和更大规模数据集的分析,以最大限度地扩大影响范围和潜在影响。本文所报告的方法和开发的双语工具包可在其他语言环境和一系列公共与专业部门中复制和扩展使用。FreeTxt 现在可用于威尔士语和/或英语的分析,供威尔士及其他地区任何部门的任何人使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Corpus Linguistics
Applied Corpus Linguistics Linguistics and Language
CiteScore
1.30
自引率
0.00%
发文量
0
审稿时长
70 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信