{"title":"基于深度学习方法的答案选择任务研究","authors":"Na Wang, Ruoyan Chen, Kunming Du","doi":"10.1109/AEMCSE55572.2022.00100","DOIUrl":null,"url":null,"abstract":"Answer selection task is an important task in question answering systems. In this work, we propose several deep learning methods to address answer selection task. Current answer selection tasks use LSTM networks to learn the contextual information of query and candidate answer sequences, but the LSTM network suffers from the problem of gradient instability and fail to extract local information. Aiming to solve these problems, we first introduce fusion layer with residual ideas to alleviate gradient instability. Then we further introduce CNN networks to capture local n-gram information. In addition, we introduce one-way and two-way attention mechanism respectively, in order to capture the interaction between query and candidate answer, and further improve model performance. Experimental results of two public datasets InsuranceQA and WikiQA show that our methods outperform baseline methods, which conclude the effectiveness of our methods proposed.","PeriodicalId":309096,"journal":{"name":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","volume":"55 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Study of Answer Selection Task Based on Deep Learning Methods\",\"authors\":\"Na Wang, Ruoyan Chen, Kunming Du\",\"doi\":\"10.1109/AEMCSE55572.2022.00100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Answer selection task is an important task in question answering systems. In this work, we propose several deep learning methods to address answer selection task. Current answer selection tasks use LSTM networks to learn the contextual information of query and candidate answer sequences, but the LSTM network suffers from the problem of gradient instability and fail to extract local information. Aiming to solve these problems, we first introduce fusion layer with residual ideas to alleviate gradient instability. Then we further introduce CNN networks to capture local n-gram information. In addition, we introduce one-way and two-way attention mechanism respectively, in order to capture the interaction between query and candidate answer, and further improve model performance. Experimental results of two public datasets InsuranceQA and WikiQA show that our methods outperform baseline methods, which conclude the effectiveness of our methods proposed.\",\"PeriodicalId\":309096,\"journal\":{\"name\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"volume\":\"55 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEMCSE55572.2022.00100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEMCSE55572.2022.00100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Study of Answer Selection Task Based on Deep Learning Methods
Answer selection task is an important task in question answering systems. In this work, we propose several deep learning methods to address answer selection task. Current answer selection tasks use LSTM networks to learn the contextual information of query and candidate answer sequences, but the LSTM network suffers from the problem of gradient instability and fail to extract local information. Aiming to solve these problems, we first introduce fusion layer with residual ideas to alleviate gradient instability. Then we further introduce CNN networks to capture local n-gram information. In addition, we introduce one-way and two-way attention mechanism respectively, in order to capture the interaction between query and candidate answer, and further improve model performance. Experimental results of two public datasets InsuranceQA and WikiQA show that our methods outperform baseline methods, which conclude the effectiveness of our methods proposed.