{"title":"用CNN增强BERT进行选择题回答","authors":"Shishir Roy, Nayeem Ehtesham, Md Saiful Islam, Marium-E-Jannat","doi":"10.1109/ICCIT54785.2021.9689877","DOIUrl":null,"url":null,"abstract":"Multiple Choice Question (MCQ) answering is a strenuous task intended to determine the right answer from a set of given options. It demands a deep semantic understanding of the question, answer, and knowledge. In this article, we present a Convolutional Neural Networks (CNN) based model extended with Bidirectional Encoder Representations from Transformers (BERT) to answer complex multiple-choice questions. Given an article and a MCQ, the model selects the correct option by ranking each question-option tuple. The proposed CNN based model uses question-option tuple as input to perform better than Long Short-Term Memory(LSTM) based baselines by 22.7 for the Textbook Question Answering (TQA) [1] and SciQ [2] datasets.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Augmenting BERT with CNN for Multiple Choice Question Answering\",\"authors\":\"Shishir Roy, Nayeem Ehtesham, Md Saiful Islam, Marium-E-Jannat\",\"doi\":\"10.1109/ICCIT54785.2021.9689877\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multiple Choice Question (MCQ) answering is a strenuous task intended to determine the right answer from a set of given options. It demands a deep semantic understanding of the question, answer, and knowledge. In this article, we present a Convolutional Neural Networks (CNN) based model extended with Bidirectional Encoder Representations from Transformers (BERT) to answer complex multiple-choice questions. Given an article and a MCQ, the model selects the correct option by ranking each question-option tuple. The proposed CNN based model uses question-option tuple as input to perform better than Long Short-Term Memory(LSTM) based baselines by 22.7 for the Textbook Question Answering (TQA) [1] and SciQ [2] datasets.\",\"PeriodicalId\":166450,\"journal\":{\"name\":\"2021 24th International Conference on Computer and Information Technology (ICCIT)\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 24th International Conference on Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT54785.2021.9689877\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689877","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Augmenting BERT with CNN for Multiple Choice Question Answering
Multiple Choice Question (MCQ) answering is a strenuous task intended to determine the right answer from a set of given options. It demands a deep semantic understanding of the question, answer, and knowledge. In this article, we present a Convolutional Neural Networks (CNN) based model extended with Bidirectional Encoder Representations from Transformers (BERT) to answer complex multiple-choice questions. Given an article and a MCQ, the model selects the correct option by ranking each question-option tuple. The proposed CNN based model uses question-option tuple as input to perform better than Long Short-Term Memory(LSTM) based baselines by 22.7 for the Textbook Question Answering (TQA) [1] and SciQ [2] datasets.