First steps towards improving official statistics data accessibility in Mexico: Query expansion with neural networks and ad-hoc space vectors

Q3 Decision Sciences
Alejandro Pimentel, Oswaldo Díaz, E. Villaseñor, Jose-Luis Marquez
{"title":"First steps towards improving official statistics data accessibility in Mexico: Query expansion with neural networks and ad-hoc space vectors","authors":"Alejandro Pimentel, Oswaldo Díaz, E. Villaseñor, Jose-Luis Marquez","doi":"10.3233/sji-230014","DOIUrl":null,"url":null,"abstract":"Mexico’s National Institute of Statistics and Geography (INEGI) has announced plans to improve its information search service, with the aim of increasing the accessibility of official statistical data. The upgraded search engine will include a new component that offers more sophisticated search capabilities. These include the ability to conduct intelligent searches that do not require an exact match of the search text, as well as the expansion of searches using related ad-hoc terms. Additionally, the new component will provide feedback through the most appropriate relations. To achieve this, the system will utilize neural network-based distributional word representation systems to identify relationships between related terms. The vector spaces and representation will be manipulated to keep connections within the most relevant vocabulary for the institute’s type of searches. The usability testing department at the institute conducted blind pilot tests to compare the quality reported by users with and without the new enhancements. Although the evaluation survey showed significant improvements in the search engine’s performance, the tool presented is just the first step towards a system that allows continuous interaction and feedback with users to improve the quality of the responses presented. This strategy is not currently implemented by the institute, making this an immediate and easy-to-replicate approach for obtaining useful interactions with users.","PeriodicalId":55877,"journal":{"name":"Statistical Journal of the IAOS","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Journal of the IAOS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/sji-230014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Decision Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Mexico’s National Institute of Statistics and Geography (INEGI) has announced plans to improve its information search service, with the aim of increasing the accessibility of official statistical data. The upgraded search engine will include a new component that offers more sophisticated search capabilities. These include the ability to conduct intelligent searches that do not require an exact match of the search text, as well as the expansion of searches using related ad-hoc terms. Additionally, the new component will provide feedback through the most appropriate relations. To achieve this, the system will utilize neural network-based distributional word representation systems to identify relationships between related terms. The vector spaces and representation will be manipulated to keep connections within the most relevant vocabulary for the institute’s type of searches. The usability testing department at the institute conducted blind pilot tests to compare the quality reported by users with and without the new enhancements. Although the evaluation survey showed significant improvements in the search engine’s performance, the tool presented is just the first step towards a system that allows continuous interaction and feedback with users to improve the quality of the responses presented. This strategy is not currently implemented by the institute, making this an immediate and easy-to-replicate approach for obtaining useful interactions with users.
改善墨西哥官方统计数据可及性的第一步:用神经网络和特设空间向量扩展查询
墨西哥国家统计和地理研究所(INEGI)宣布了改善其信息搜索服务的计划,目的是增加官方统计数据的可访问性。升级后的搜索引擎将包含一个新组件,提供更复杂的搜索功能。其中包括执行不需要精确匹配搜索文本的智能搜索的能力,以及使用相关的特别术语扩展搜索的能力。此外,新组件将通过最合适的关系提供反馈。为了实现这一点,该系统将利用基于神经网络的分布式单词表示系统来识别相关术语之间的关系。将对向量空间和表示进行处理,以保持与研究所搜索类型最相关的词汇内的连接。该研究所的可用性测试部门进行了盲测试点测试,以比较用户报告的有和没有新增强功能的质量。尽管评估调查显示搜索引擎的性能有了显著的改善,但所提供的工具只是迈向一个系统的第一步,该系统允许与用户进行持续的互动和反馈,以提高所提供的响应的质量。该研究所目前还没有实施这一策略,因此这是一种即时且易于复制的方法,可以与用户进行有用的交互。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Statistical Journal of the IAOS
Statistical Journal of the IAOS Economics, Econometrics and Finance-Economics and Econometrics
CiteScore
1.30
自引率
0.00%
发文量
116
期刊介绍: This is the flagship journal of the International Association for Official Statistics and is expected to be widely circulated and subscribed to by individuals and institutions in all parts of the world. The main aim of the Journal is to support the IAOS mission by publishing articles to promote the understanding and advancement of official statistics and to foster the development of effective and efficient official statistical services on a global basis. Papers are expected to be of wide interest to readers. Such papers may or may not contain strictly original material. All papers are refereed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信