Chandrika Prasad, Jagdish S. Kallimani, Divakar Harekal, N. Sharma
{"title":"基于Seq2Seq技术的自动文本摘要模型","authors":"Chandrika Prasad, Jagdish S. Kallimani, Divakar Harekal, N. Sharma","doi":"10.1109/I-SMAC49090.2020.9243572","DOIUrl":null,"url":null,"abstract":"Increasing acquisition of digitization over the information storing and processing in our daily lives has increased the demand of digitization in multiple facets including in investigation processes as well. In fact, for crimes involving computer systems requires the adoption of best practices for the process of evidence extraction from acquired devices from the crime scenes. Over the past years, summarization has become a topic of research. Various techniques of Natural Language Processing (NLP) enabling researchers to generate efficient results for a wide spectrum of documents. In the proposed work Seq2Seq Architecture with RNN is used to perform summarization tasks for documents. The nature of the summary is abstractive and allows the generation of internal meaning by the model itself. With refinement and continual work, this model becomes a strong foundation to perform summarization on longer and legal documents. The results are efficient summary generation and ROUGE scores in the range of 0.6 - 0.7.","PeriodicalId":432766,"journal":{"name":"2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic Text Summarization Model using Seq2Seq Technique\",\"authors\":\"Chandrika Prasad, Jagdish S. Kallimani, Divakar Harekal, N. Sharma\",\"doi\":\"10.1109/I-SMAC49090.2020.9243572\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Increasing acquisition of digitization over the information storing and processing in our daily lives has increased the demand of digitization in multiple facets including in investigation processes as well. In fact, for crimes involving computer systems requires the adoption of best practices for the process of evidence extraction from acquired devices from the crime scenes. Over the past years, summarization has become a topic of research. Various techniques of Natural Language Processing (NLP) enabling researchers to generate efficient results for a wide spectrum of documents. In the proposed work Seq2Seq Architecture with RNN is used to perform summarization tasks for documents. The nature of the summary is abstractive and allows the generation of internal meaning by the model itself. With refinement and continual work, this model becomes a strong foundation to perform summarization on longer and legal documents. The results are efficient summary generation and ROUGE scores in the range of 0.6 - 0.7.\",\"PeriodicalId\":432766,\"journal\":{\"name\":\"2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I-SMAC49090.2020.9243572\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I-SMAC49090.2020.9243572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Text Summarization Model using Seq2Seq Technique
Increasing acquisition of digitization over the information storing and processing in our daily lives has increased the demand of digitization in multiple facets including in investigation processes as well. In fact, for crimes involving computer systems requires the adoption of best practices for the process of evidence extraction from acquired devices from the crime scenes. Over the past years, summarization has become a topic of research. Various techniques of Natural Language Processing (NLP) enabling researchers to generate efficient results for a wide spectrum of documents. In the proposed work Seq2Seq Architecture with RNN is used to perform summarization tasks for documents. The nature of the summary is abstractive and allows the generation of internal meaning by the model itself. With refinement and continual work, this model becomes a strong foundation to perform summarization on longer and legal documents. The results are efficient summary generation and ROUGE scores in the range of 0.6 - 0.7.