{"title":"信息体系结构在自然语言处理中的应用:一项建议信息科学对人工神经网络训练和学习的数据预处理的贡献","authors":"George Júnior, C. Duque","doi":"10.20396/rdbci.v21i00.8671396/30919","DOIUrl":null,"url":null,"abstract":"Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.","PeriodicalId":36988,"journal":{"name":"Revista Digital de Biblioteconomia e Ciencia da Informacao","volume":"58 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Information architecture applied on natural language processing: a proposalInformation Science contributions on data pre-processing for training and learning of artificial neural networks\",\"authors\":\"George Júnior, C. Duque\",\"doi\":\"10.20396/rdbci.v21i00.8671396/30919\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.\",\"PeriodicalId\":36988,\"journal\":{\"name\":\"Revista Digital de Biblioteconomia e Ciencia da Informacao\",\"volume\":\"58 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Revista Digital de Biblioteconomia e Ciencia da Informacao\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20396/rdbci.v21i00.8671396/30919\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Digital de Biblioteconomia e Ciencia da Informacao","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20396/rdbci.v21i00.8671396/30919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
Information architecture applied on natural language processing: a proposalInformation Science contributions on data pre-processing for training and learning of artificial neural networks
Introduction:Natural Language Processing through artificial neural networks has gaps that can be addressed by Information Science through Information Architecture. Objective:To present Information Science contributions on Knowledge Organization applied to artificial neural networks training methods, positioning it as an active body of knowledge in artificial intelligence problems. Methodology:A three-leveled analysis path (metaphysical, scientific, and technological) is adopted to guide and ground the study. On metaphysical level, current development stage of natural language processing techniques is verified and analyzed. On scientific findings, a five-step procedure is proposed which aims to design, analyze, and prepare information spaces for artificial neural networks training and learning methods, fulfilling gaps identified by authors focused on Computer Science implementations. On technological implementation, the five-step procedure is applied to 3 datasets formed by texts from 16 scientific knowledge areas, as an evaluation basis. Results:Results obtained through pre-processed data and raw data where compared, showing great potential in developing a structuredmethod of Multimodal Information Architecture that provide instruments able to organize data used as test and learning samples in artificial neural networks. Conclusion:This method could place Information Science as a producer of data pre-processing solutions, replacing its current role as consumer of prefabricated solutions made by Computer Science.