使用TextRank算法的英语到印地语跨语言文本摘要器

IF 0.3
S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt
{"title":"使用TextRank算法的英语到印地语跨语言文本摘要器","authors":"S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt","doi":"10.47164/ijngc.v14i1.1025","DOIUrl":null,"url":null,"abstract":"Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"64 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"English to Hindi Cross-Lingual Text Summarizer using TextRank Algorithm\",\"authors\":\"S. Rawat, Kavita B. Kalambe, Sagarika Jaywant, Lakshita Werulkar, Mukul Barbate, Tarrun Jaiswalt\",\"doi\":\"10.47164/ijngc.v14i1.1025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.\",\"PeriodicalId\":42021,\"journal\":{\"name\":\"International Journal of Next-Generation Computing\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2023-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Next-Generation Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47164/ijngc.v14i1.1025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Next-Generation Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/ijngc.v14i1.1025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

跨语言总结器开发的摘录的要点写在印度的国家语言印地语的英语。这有助于非英语国家的人理解印度语的文本内容。本文采用摘要提取法对文章进行总结。用英语生成的摘要然后被翻译成印地语,并提供给印地语读者。印度语读者能读到他们想读的文章的核心。由于互联网的爆炸式增长,获取大量的信息现在是高效的,但越来越难。本文讨论了一种捕获文本文档的相关度的文本提取摘要方法。自然语言处理(NLP)的众多用途之一是文本摘要,它对我们的日常生活产生了重大影响。谁有时间通读完整的文章、文件或书籍,以确定它们是否有助于数字媒体的扩张和大量发表的文章?该技术是使用TextRank创建的,它是使用为网站上的每个页面建立PageRank的想法确定的。该方法构建了一个以句子为节点的图,以连接两个句子的边的权重为节点。修正逆句-余弦频率相似度赋予句子中不同的词不同的权值。支持摘要技术的性能评估证明了该过程的成功。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
English to Hindi Cross-Lingual Text Summarizer using TextRank Algorithm
Cross-Lingual Summarizer develops a gist of the extract written in English in the National Language of India Hindi. This helps non-anglophonic people to understand what the text says in Hindi. The extractive method of summarization is being used in this paper for summarizing the article. The summary generated in English is then translated into Hindi and made available for Hindi Readers. The Hindi readers get the heart of the article they want to read. Due to the Internet’s explosive growth, access to a vast amount of information is now efficient but getting harder and harder. An approach to text extraction summarization that captures the aboutness of the text document was discussed in this paper. One of the many uses for natural language processing (NLP) that significantly affects our daily lives is text summarization. Who has the time to read through complete articles, documents, or books to determine whether they are helpful with the expansion of digital media and the profusion of articles published? The technique was created using TextRank, which was determined using the idea of PageRank established for each page on a website. The presented approach builds a graph with sentences as nodes and the weight of the edge connecting two sentences as its nodes. Modified inverse sentence-cosine frequency similarity gives different words in a sentence different weights. The success of the procedure is demonstrated by the performance evaluation that supported the summary technique.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Next-Generation Computing
International Journal of Next-Generation Computing COMPUTER SCIENCE, THEORY & METHODS-
自引率
66.70%
发文量
60
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信