{"title":"基于拥抱脸预训练模型的COVID-19科研主题自动文本摘要","authors":"Sakdipat Ontoum, Jonathan H. Chan","doi":"10.1109/RI2C56397.2022.9910274","DOIUrl":null,"url":null,"abstract":"Automated text summarizing helps the scientific and medical sectors by identifying and extracting relevant information from articles. Automatic text summarization is a way of compressing text documents so that users may find important and useful information in the original text in reduced time. We will first review some new works in the field of summarization that uses deep learning approaches, and then we will explain the application to COVID-19 related research papers. The ease with which a reader can grasp written text is referred to as the readability test. The substance of text determines its readability in natural language processing. We constructed word clouds using the abstracts’ most commonly used text. By looking at those three measurements, we can determine the performance measures of ROUGE-1, ROUGE-2, ROUGE-L, ROUGE-L-SUM. Our findings indicated that Distilbart-mnli-12-6 and GPT2-large outperform than others considered.","PeriodicalId":403083,"journal":{"name":"2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Automatic Text Summarization of COVID-19 Scientific Research Topics Using Pre-trained Models from Hugging Face\",\"authors\":\"Sakdipat Ontoum, Jonathan H. Chan\",\"doi\":\"10.1109/RI2C56397.2022.9910274\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automated text summarizing helps the scientific and medical sectors by identifying and extracting relevant information from articles. Automatic text summarization is a way of compressing text documents so that users may find important and useful information in the original text in reduced time. We will first review some new works in the field of summarization that uses deep learning approaches, and then we will explain the application to COVID-19 related research papers. The ease with which a reader can grasp written text is referred to as the readability test. The substance of text determines its readability in natural language processing. We constructed word clouds using the abstracts’ most commonly used text. By looking at those three measurements, we can determine the performance measures of ROUGE-1, ROUGE-2, ROUGE-L, ROUGE-L-SUM. Our findings indicated that Distilbart-mnli-12-6 and GPT2-large outperform than others considered.\",\"PeriodicalId\":403083,\"journal\":{\"name\":\"2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RI2C56397.2022.9910274\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RI2C56397.2022.9910274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Text Summarization of COVID-19 Scientific Research Topics Using Pre-trained Models from Hugging Face
Automated text summarizing helps the scientific and medical sectors by identifying and extracting relevant information from articles. Automatic text summarization is a way of compressing text documents so that users may find important and useful information in the original text in reduced time. We will first review some new works in the field of summarization that uses deep learning approaches, and then we will explain the application to COVID-19 related research papers. The ease with which a reader can grasp written text is referred to as the readability test. The substance of text determines its readability in natural language processing. We constructed word clouds using the abstracts’ most commonly used text. By looking at those three measurements, we can determine the performance measures of ROUGE-1, ROUGE-2, ROUGE-L, ROUGE-L-SUM. Our findings indicated that Distilbart-mnli-12-6 and GPT2-large outperform than others considered.