Amita Arora, Akanksha Diwedy, Manjeet Singh, N. Chauhan
{"title":"文本摘要的机器学习方法","authors":"Amita Arora, Akanksha Diwedy, Manjeet Singh, N. Chauhan","doi":"10.14257/ijdta.2017.10.8.08","DOIUrl":null,"url":null,"abstract":"With the abundance of interminable text documents, providing summaries can help in retrieval of relevant information very quickly. The technique is to extract those sentences from the document that contain important information. This paper presents the results of our research on extractive summarization with a method based on Support Vector Machines (SVMs). The SVMs are trained using DUC-2002 dataset and the importance of sentences is judged on the basis of salient features. To evaluate the performance of our system, comparisons are conducted with two existing methods. ROUGE scores are used to compare the system generated summaries with the human generated summaries, and the experimental results show that our system's performance achieved high metrics.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":"5 1","pages":"83-90"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine Learning Approach for Text Summarization\",\"authors\":\"Amita Arora, Akanksha Diwedy, Manjeet Singh, N. Chauhan\",\"doi\":\"10.14257/ijdta.2017.10.8.08\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the abundance of interminable text documents, providing summaries can help in retrieval of relevant information very quickly. The technique is to extract those sentences from the document that contain important information. This paper presents the results of our research on extractive summarization with a method based on Support Vector Machines (SVMs). The SVMs are trained using DUC-2002 dataset and the importance of sentences is judged on the basis of salient features. To evaluate the performance of our system, comparisons are conducted with two existing methods. ROUGE scores are used to compare the system generated summaries with the human generated summaries, and the experimental results show that our system's performance achieved high metrics.\",\"PeriodicalId\":13926,\"journal\":{\"name\":\"International journal of database theory and application\",\"volume\":\"5 1\",\"pages\":\"83-90\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of database theory and application\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14257/ijdta.2017.10.8.08\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of database theory and application","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14257/ijdta.2017.10.8.08","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
With the abundance of interminable text documents, providing summaries can help in retrieval of relevant information very quickly. The technique is to extract those sentences from the document that contain important information. This paper presents the results of our research on extractive summarization with a method based on Support Vector Machines (SVMs). The SVMs are trained using DUC-2002 dataset and the importance of sentences is judged on the basis of salient features. To evaluate the performance of our system, comparisons are conducted with two existing methods. ROUGE scores are used to compare the system generated summaries with the human generated summaries, and the experimental results show that our system's performance achieved high metrics.