Vinicius dos Santos, P. R. Silva, Erica Ferreira, K. Felizardo, W. Watanabe, Arnaldo Cândido Júnior, G. V. Meinerz, S. Aluísio, N. Vijaykumar
{"title":"Using Open Information Extraction to Extract Relations: An Extended Systematic Mapping","authors":"Vinicius dos Santos, P. R. Silva, Erica Ferreira, K. Felizardo, W. Watanabe, Arnaldo Cândido Júnior, G. V. Meinerz, S. Aluísio, N. Vijaykumar","doi":"10.1109/CLEI53233.2021.9639968","DOIUrl":null,"url":null,"abstract":"Context: For thousands of years humans have been using natural language to register their knowledge on important information to enable its access to future generations. With internet, a large amount of textual data is produced and shared on a daily basis. So, scientists started to research techniques for efficiently process knowledge stored in textual format. In this context, Natural Language Processing (NLP) became a popular area studying linguistic phenomena and using computational methods to process texts in natural language. In particular, Open Information Extraction (Open IE) was proposed to gather information from plain text. Despite the advances in this area, it is still necessary to map details about how these approaches were proposed to support the community while creating more efficient Open IE systems. Objective: In this paper, we identify, in the literature, the main characteristics of proposed Open IE approaches. Method: First, we extended the search performed in a systematic mapping previously published by using backward snowballing and a manual search. Next, we updated the electronic database search including ACL Anthology. Finally, 159 studies proposing Open IE approaches were considered for data extraction. Results: Data analysis showed a significant increase in the number of studies published about Open IE in the last years. In addition, we provide important details about how these techniques were proposed (e.g., data sets used and output evaluation techniques). Results indicate that researchers started to adopt neural networks to perform Open IE instead of using conventional supervised learning techniques. Conclusion: Recent advances in Artificial Intelligence and neural networks techniques allowed scientists to have a new perspective on how to perform efficient textual data management. Therefore, Open IE approaches gained much attention as they can help in many contexts, especially in knowledge management tasks.","PeriodicalId":6803,"journal":{"name":"2021 XLVII Latin American Computing Conference (CLEI)","volume":"128 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 XLVII Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI53233.2021.9639968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Context: For thousands of years humans have been using natural language to register their knowledge on important information to enable its access to future generations. With internet, a large amount of textual data is produced and shared on a daily basis. So, scientists started to research techniques for efficiently process knowledge stored in textual format. In this context, Natural Language Processing (NLP) became a popular area studying linguistic phenomena and using computational methods to process texts in natural language. In particular, Open Information Extraction (Open IE) was proposed to gather information from plain text. Despite the advances in this area, it is still necessary to map details about how these approaches were proposed to support the community while creating more efficient Open IE systems. Objective: In this paper, we identify, in the literature, the main characteristics of proposed Open IE approaches. Method: First, we extended the search performed in a systematic mapping previously published by using backward snowballing and a manual search. Next, we updated the electronic database search including ACL Anthology. Finally, 159 studies proposing Open IE approaches were considered for data extraction. Results: Data analysis showed a significant increase in the number of studies published about Open IE in the last years. In addition, we provide important details about how these techniques were proposed (e.g., data sets used and output evaluation techniques). Results indicate that researchers started to adopt neural networks to perform Open IE instead of using conventional supervised learning techniques. Conclusion: Recent advances in Artificial Intelligence and neural networks techniques allowed scientists to have a new perspective on how to perform efficient textual data management. Therefore, Open IE approaches gained much attention as they can help in many contexts, especially in knowledge management tasks.