信息体系结构在自然语言处理中的应用:一项建议信息科学对人工神经网络训练和学习的数据预处理的贡献

IF 0.3 Q4 INFORMATION SCIENCE & LIBRARY SCIENCE
George Júnior, C. Duque
{"title":"信息体系结构在自然语言处理中的应用:一项建议信息科学对人工神经网络训练和学习的数据预处理的贡献","authors":"George Júnior, C. Duque","doi":"10.20396/rdbci.v21i00.8671396/30919","DOIUrl":null,"url":null,"abstract":"Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.","PeriodicalId":36988,"journal":{"name":"Revista Digital de Biblioteconomia e Ciencia da Informacao","volume":"58 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Information architecture applied on natural language processing: a proposalInformation Science contributions on data pre-processing for training and learning of artificial neural networks\",\"authors\":\"George Júnior, C. Duque\",\"doi\":\"10.20396/rdbci.v21i00.8671396/30919\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.\",\"PeriodicalId\":36988,\"journal\":{\"name\":\"Revista Digital de Biblioteconomia e Ciencia da Informacao\",\"volume\":\"58 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Revista Digital de Biblioteconomia e Ciencia da Informacao\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20396/rdbci.v21i00.8671396/30919\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Digital de Biblioteconomia e Ciencia da Informacao","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20396/rdbci.v21i00.8671396/30919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

引言:通过人工神经网络进行的自然语言处理存在空白,信息科学可以通过信息架构来解决这些空白。目的:介绍知识组织应用于人工神经网络训练方法的信息科学贡献,将其定位为人工智能问题的活跃知识体。方法论:采用形而上学、科学和技术三个层次的分析路径来指导和支撑研究。在形而上学的层面上,对自然语言处理技术目前的发展阶段进行了验证和分析。在科学发现方面,提出了一个五步程序,旨在设计、分析和准备人工神经网络训练和学习方法的信息空间,以填补由专注于计算机科学实现的作者确定的空白。在技术实施方面,将五步程序应用于由16个科学知识领域的文本组成的3个数据集,作为评估依据。结果:通过比较预处理数据和原始数据获得的结果,显示出开发多模态信息架构结构化方法的巨大潜力,该方法提供了能够组织用作人工神经网络测试和学习样本的数据的工具。结论:该方法可以将信息科学作为数据预处理解决方案的生产者,取代其目前作为计算机科学预制解决方案的消费者的角色。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Information architecture applied on natural language processing: a proposalInformation Science contributions on data pre-processing for training and learning of artificial neural networks
Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Revista Digital de Biblioteconomia e Ciencia da Informacao
Revista Digital de Biblioteconomia e Ciencia da Informacao Social Sciences-Library and Information Sciences
CiteScore
0.90
自引率
0.00%
发文量
24
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信