FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit

IF 2.1

Applied Corpus Linguistics Pub Date : 2024-08-23 DOI:10.1016/j.acorp.2024.100103

Dawn Knight , Nouran Khallaf , Paul Rayson , Mahmoud El-Haj , Ignatius Ezeani , Steve Morris

{"title":"FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit","authors":"Dawn Knight , Nouran Khallaf , Paul Rayson , Mahmoud El-Haj , Ignatius Ezeani , Steve Morris","doi":"10.1016/j.acorp.2024.100103","DOIUrl":null,"url":null,"abstract":"<div><p>Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text <em>do</em> exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English <em>and</em> Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (<span><span>www.freetxt.app</span><svg><path></path></svg></span>), we aimed to empower end-users to <em>direct</em> and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by <em>anyone</em> in <em>any sector</em> in Wales and beyond.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"4 3","pages":"Article 100103"},"PeriodicalIF":2.1000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799124000200/pdfft?md5=65f8a01d41b4150af967f22d4f542b8f&pid=1-s2.0-S2666799124000200-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Corpus Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666799124000200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English and Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (www.freetxt.app), we aimed to empower end-users to direct and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by anyone in any sector in Wales and beyond.

查看原文本刊更多论文

FreeTxt：基于语料库的双语自由文本调查和问卷数据分析工具包

定性的自由文本回复（如来自问卷和调查的回复）给许多公司和机构带来了挑战，因为它们缺乏轻松分析此类数据的专业知识。虽然目前确实存在一系列复杂的文本分析工具，但这些工具往往价格昂贵、难以使用和/或非专家用户无法使用。这些工具还缺乏对英语和威尔士语文本分析的支持，这在威尔士的双语环境中是一个特殊的挑战。本文详细介绍了首个基于语料库的 "FreeTxt "工具包的主要功能，该工具包旨在支持自由文本数据的系统分析和可视化，是对这两个关键需求的直接回应。本文展示了软件工程师、自然语言处理（NLP）专家和语料库语言学家如何通过合作，与最终用户和受益者共同为现实问题提供有效的解决方案。通过开发 FreeTxt (www.freetxt.app)，我们旨在授权最终用户指导和领导他们自己对小规模和更大规模数据集的分析，以最大限度地扩大影响范围和潜在影响。本文所报告的方法和开发的双语工具包可在其他语言环境和一系列公共与专业部门中复制和扩展使用。FreeTxt 现在可用于威尔士语和/或英语的分析，供威尔士及其他地区任何部门的任何人使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊