Analysis of Trends in Online Romanian News Using Semantic Models

A. Simion, M. Dascalu, Stefan Trausan-Matu
{"title":"Analysis of Trends in Online Romanian News Using Semantic Models","authors":"A. Simion, M. Dascalu, Stefan Trausan-Matu","doi":"10.1109/CSCS.2019.00075","DOIUrl":null,"url":null,"abstract":"Online news are currently the most frequently accessed source of information. Moreover, online media is bound to grow even further in significance and popularity due to the increasing usage of smart devices. As such, it becomes of great interest to study and analyze various trends in online media. Considering the sheer number of articles that are published every day, a manual approach to generate meaningful statistics for large-scale media networks is unfeasible. In this paper, we aim to analyze some of the most visited online Romanian news websites using various Natural Language Processing techniques. Our analysis has two main highlights: the extraction of trending topics and concepts from the news, together with the objective of ranking publications in accordance to their relative influence within our generated network. Unlike other systems which rely on assigning influence according to direct links or citations, we propose a novel ranking method based on intertextuality links identified using document embeddings.","PeriodicalId":352411,"journal":{"name":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCS.2019.00075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Online news are currently the most frequently accessed source of information. Moreover, online media is bound to grow even further in significance and popularity due to the increasing usage of smart devices. As such, it becomes of great interest to study and analyze various trends in online media. Considering the sheer number of articles that are published every day, a manual approach to generate meaningful statistics for large-scale media networks is unfeasible. In this paper, we aim to analyze some of the most visited online Romanian news websites using various Natural Language Processing techniques. Our analysis has two main highlights: the extraction of trending topics and concepts from the news, together with the objective of ranking publications in accordance to their relative influence within our generated network. Unlike other systems which rely on assigning influence according to direct links or citations, we propose a novel ranking method based on intertextuality links identified using document embeddings.
用语义模型分析罗马尼亚在线新闻的趋势
网络新闻是目前最常用的信息来源。此外,由于智能设备的日益普及,网络媒体的重要性和受欢迎程度必将进一步提高。因此,研究和分析网络媒体的各种趋势就变得非常有趣。考虑到每天发布的大量文章,为大型媒体网络生成有意义的统计数据的人工方法是不可行的。在本文中,我们的目标是使用各种自然语言处理技术分析一些访问量最大的在线罗马尼亚新闻网站。我们的分析有两个主要亮点:从新闻中提取热门话题和概念,以及根据出版物在我们生成的网络中的相对影响力对其进行排名。与其他依赖于根据直接链接或引用分配影响力的系统不同,我们提出了一种基于使用文档嵌入识别互文链接的新型排名方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信