{"title":"CollabAS2: Enhancing Arabic Answer Sentence Selection Using Transformer-Based Collaborative Models","authors":"Asma Aouichat, Ahmed Guessoum","doi":"10.1007/s13369-024-09345-3","DOIUrl":null,"url":null,"abstract":"<p>Accurately identifying pertinent text segments as answers to questions is crucial for optimizing question-answering systems, underscoring the pivotal role of precision in Answer Sentence Selection (AS2) modules. This study introduces an innovative AS2 module design leveraging the AraBERT transformer to encode inputs-one for the question and one for the candidate answer-with the goal of enhancing comprehension of both inputs. Each encoded input is subsequently processed in parallel by a collaborative layer employing two distinct deep learning models: a bidirectional long short-term memory (BiLSTM) and a convolutional neural network (CNN). This collaborative approach forms the AraBERT.Collab-BiLSTM/CNN model. Additionally, extensions to the study include AraBERT.Collab-BiLSTM/AVG, incorporating a BiLSTM and AVG collaboration layer, as well as the use of the AraELECTRA pre-trained model, yielding the AraELECTRA.Collab-BiLSTM/CNN and AraELECTRA.Collab-BiLSTM/AVG configurations. Furthermore, the study investigates Arabic word embedding models as alternatives to pre-trained models, resulting in the WordEmb.Collab-BiLSTM/CNN and WordEmb.Collab-BiLSTM/AVG models. Experimental results on our BARAQA (Big-ARAbic-Question-Answering) dataset and the SemEval Arabic Question-Answering corpus demonstrate that the AraELECTRA.Collab-BiLSTM/CNN model achieves high accuracies of 84.64% and 45.93%, respectively. Moreover, the WordEmb.Collab-BiLSTM/AVG model significantly enhances accuracy to 91.61% and 81.23% on the respective datasets, showcasing the effectiveness of our collaborative techniques. Our proposed architecture represents a substantial improvement over previous models, emphasizing the importance of advanced techniques and collaborative strategies in handling complex language structures and diverse text dependencies. Additionally, the study underscores the performance of Arabic transformer-based encoding and suggests further exploration of transformers and collaborative strategies to bolster AS2 performance.</p>","PeriodicalId":8109,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"36 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1007/s13369-024-09345-3","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately identifying pertinent text segments as answers to questions is crucial for optimizing question-answering systems, underscoring the pivotal role of precision in Answer Sentence Selection (AS2) modules. This study introduces an innovative AS2 module design leveraging the AraBERT transformer to encode inputs-one for the question and one for the candidate answer-with the goal of enhancing comprehension of both inputs. Each encoded input is subsequently processed in parallel by a collaborative layer employing two distinct deep learning models: a bidirectional long short-term memory (BiLSTM) and a convolutional neural network (CNN). This collaborative approach forms the AraBERT.Collab-BiLSTM/CNN model. Additionally, extensions to the study include AraBERT.Collab-BiLSTM/AVG, incorporating a BiLSTM and AVG collaboration layer, as well as the use of the AraELECTRA pre-trained model, yielding the AraELECTRA.Collab-BiLSTM/CNN and AraELECTRA.Collab-BiLSTM/AVG configurations. Furthermore, the study investigates Arabic word embedding models as alternatives to pre-trained models, resulting in the WordEmb.Collab-BiLSTM/CNN and WordEmb.Collab-BiLSTM/AVG models. Experimental results on our BARAQA (Big-ARAbic-Question-Answering) dataset and the SemEval Arabic Question-Answering corpus demonstrate that the AraELECTRA.Collab-BiLSTM/CNN model achieves high accuracies of 84.64% and 45.93%, respectively. Moreover, the WordEmb.Collab-BiLSTM/AVG model significantly enhances accuracy to 91.61% and 81.23% on the respective datasets, showcasing the effectiveness of our collaborative techniques. Our proposed architecture represents a substantial improvement over previous models, emphasizing the importance of advanced techniques and collaborative strategies in handling complex language structures and diverse text dependencies. Additionally, the study underscores the performance of Arabic transformer-based encoding and suggests further exploration of transformers and collaborative strategies to bolster AS2 performance.
期刊介绍:
King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE).
AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.