{"title":"Academic Aggregated Search Approach Based on BERT Language Model","authors":"Sanae Achsas, E. Nfaoui","doi":"10.1109/IRASET52964.2022.9737888","DOIUrl":null,"url":null,"abstract":"Academic Search concerns the indexing and retrieval of information objects like tutorials, blog posts, papers, books, etc. in the domain of academic research. When looking for a specific information, many researchers can't find all the information needed on the topic for which they are searching in the same academic search engine, which pushes them to search in each one separately. Therefore, we propose to exploit the task of aggregating search results from various sources in an academic domain. Exploiting the advantage of such systems may improve retrieval performance of academic search results by selecting the most appropriate vertical results according to the user's information need, and presenting them in a single page. Our proposed approach consists of exploiting BERT contextual embeddings in unsupervised fashion and taking advantage of their rich semantic features to improve the quality of the retrieved results. To the best of our knowledge, this is the first time when aggregated search paradigm is investigated in an in academic environment using BERT embeddings. Experimental results show that by only using BERT algorithm as a core component in our system, has given excellent results for our task and helped us achieved our objective in this work. In addition, a comparison with other state-of-the-art language models such as ELMo, USE and XLNet was made. The findings showed that the best performance was obtained using the BERT language model, which proved that using BERT model in our academic aggregated search system is a good choice.","PeriodicalId":377115,"journal":{"name":"2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRASET52964.2022.9737888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Academic Search concerns the indexing and retrieval of information objects like tutorials, blog posts, papers, books, etc. in the domain of academic research. When looking for a specific information, many researchers can't find all the information needed on the topic for which they are searching in the same academic search engine, which pushes them to search in each one separately. Therefore, we propose to exploit the task of aggregating search results from various sources in an academic domain. Exploiting the advantage of such systems may improve retrieval performance of academic search results by selecting the most appropriate vertical results according to the user's information need, and presenting them in a single page. Our proposed approach consists of exploiting BERT contextual embeddings in unsupervised fashion and taking advantage of their rich semantic features to improve the quality of the retrieved results. To the best of our knowledge, this is the first time when aggregated search paradigm is investigated in an in academic environment using BERT embeddings. Experimental results show that by only using BERT algorithm as a core component in our system, has given excellent results for our task and helped us achieved our objective in this work. In addition, a comparison with other state-of-the-art language models such as ELMo, USE and XLNet was made. The findings showed that the best performance was obtained using the BERT language model, which proved that using BERT model in our academic aggregated search system is a good choice.