使用TextRank算法的英语到印地语跨语言文本摘要器

IF 0.3

International Journal of Next-Generation Computing Pub Date : 2023-02-15 DOI:10.47164/ijngc.v14i1.1025

S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt

{"title":"使用TextRank算法的英语到印地语跨语言文本摘要器","authors":"S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt","doi":"10.47164/ijngc.v14i1.1025","DOIUrl":null,"url":null,"abstract":"Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"64 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"English to Hindi Cross-Lingual Text Summarizer using TextRank Algorithm\",\"authors\":\"S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt\",\"doi\":\"10.47164/ijngc.v14i1.1025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.\",\"PeriodicalId\":42021,\"journal\":{\"name\":\"International Journal of Next-Generation Computing\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2023-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Next-Generation Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47164/ijngc.v14i1.1025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Next-Generation Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/ijngc.v14i1.1025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

跨语言总结器开发的摘录的要点写在印度的国家语言印地语的英语。这有助于非英语国家的人理解印度语的文本内容。本文采用摘要提取法对文章进行总结。用英语生成的摘要然后被翻译成印地语，并提供给印地语读者。印度语读者能读到他们想读的文章的核心。由于互联网的爆炸式增长，获取大量的信息现在是高效的，但越来越难。本文讨论了一种捕获文本文档的相关度的文本提取摘要方法。自然语言处理(NLP)的众多用途之一是文本摘要，它对我们的日常生活产生了重大影响。谁有时间通读完整的文章、文件或书籍，以确定它们是否有助于数字媒体的扩张和大量发表的文章?该技术是使用TextRank创建的，它是使用为网站上的每个页面建立PageRank的想法确定的。该方法构建了一个以句子为节点的图，以连接两个句子的边的权重为节点。修正逆句-余弦频率相似度赋予句子中不同的词不同的权值。支持摘要技术的性能评估证明了该过程的成功。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

English to Hindi Cross-Lingual Text Summarizer using TextRank Algorithm

Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Next-Generation Computing COMPUTER SCIENCE, THEORY & METHODS-

自引率

66.70%

发文量