ACM Transactions on the Web最新文献

筛选
英文 中文
B-TTDb: A Database of Turkish Tweets for Predicting the Top One Hundred Emojis B-TTDb:用于预测百大表情符号的土耳其推文数据库
IF 2.6 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-07-24 DOI: 10.1145/3681783
Y. Bi̇ti̇ri̇m
{"title":"B-TTDb: A Database of Turkish Tweets for Predicting the Top One Hundred Emojis","authors":"Y. Bi̇ti̇ri̇m","doi":"10.1145/3681783","DOIUrl":"https://doi.org/10.1145/3681783","url":null,"abstract":"Emoji prediction is an important research task that focuses on finding the most appropriate emoji(s) quickly and effortlessly for a specific text. Now that Turkish is on the list of the top 20 most spoken languages in the world and there are a considerable number of Turkish-speaking social media users, studying emoji prediction in Turkish holds significant value. In this study, a Turkish tweets database, named Bitirim's Turkish Tweets Database (B-TTDb), was constructed for academic and industrial studies based on the prediction of the top 100 emojis. B-TTDb consists of four datasets. The first dataset includes raw tweets, the second dataset is the organized version of the first dataset, the third dataset is the pre-processed version of the second dataset, and the last one is the organized version of the third dataset. The last one is the final version and it is named Bitirim's Dataset (B-D). It includes a total of 158,201 unique tweets belonging to the top 100 emoji classes. For database validation, experiments were conducted on B-D with popular machine learning algorithms for the top 10, 20, 50, and 100 emojis. This study could be considered as the first study that contributes to the literature by the first validated large database of Turkish tweets that includes such a large number of emojis. In addition, B-TTDb could be a basis as well as motivation for various further studies.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141807431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
INCEPT: A Framework for Duplicate Posts Classification with Combined Text Representations INCEPT:利用组合文本表示法进行重复帖子分类的框架
IF 2.6 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-07-15 DOI: 10.1145/3677322
Erjon Skenderi, Jukka Huhtamäki, Salla-Maaria Laaksonen, Kostas Stefanidis
{"title":"INCEPT: A Framework for Duplicate Posts Classification with Combined Text Representations","authors":"Erjon Skenderi, Jukka Huhtamäki, Salla-Maaria Laaksonen, Kostas Stefanidis","doi":"10.1145/3677322","DOIUrl":"https://doi.org/10.1145/3677322","url":null,"abstract":"Dealing with many of the problems related to the quality of textual content online involves identifying similar content. Algorithmic solutions for duplicate content classification typically rely on text vector representation, which maps textual information into a set of features. Ideally, this representation would capture all aspects of the underlying text, including length, word frequencies, syntax, and semantics. While recent advancements in text representation have led to improved performance, a comprehensive approach that explicitly incorporates all text features has not yet been proposed. In this study, we present the INCEPT framework that utilizes multiple representation methods to detect duplicate text pairs, taking advantage of their individual strengths. The core of our approach involves using a stacking ensemble of pairwise vector distance measurements that are computed from multiple text representation methods. A stacking classifier then utilizes these distance scores as input and learns to identify duplicate posts. We assess the proposed framework’s effectiveness in identifying duplicate posts in an online Question and Answer platform. By combining several text representation methods, INCEPT performs well in the duplicate posts classification task. Our experiments demonstrate that specific framework configurations outperform the accuracy scores obtained from individual text representation methods. Therefore, we also infer that no single text representation method can independently capture a text’s features.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141647670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DCDIMB: Dynamic Community-based Diversified Influence Maximization using Bridge Nodes DCDIMB:利用桥节点实现基于社区的动态多元化影响力最大化
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-05-11 DOI: 10.1145/3664618
Sunil Meena, SHASHANK SINGH, Kuldeep Singh
{"title":"DCDIMB: Dynamic Community-based Diversified Influence Maximization using Bridge Nodes","authors":"Sunil Meena, SHASHANK SINGH, Kuldeep Singh","doi":"10.1145/3664618","DOIUrl":"https://doi.org/10.1145/3664618","url":null,"abstract":"<p>Influence maximization (IM) is the fundamental study of social network analysis. The IM problem finds the top <i>k</i> nodes that have maximum influence in the network. Most of the studies in IM focus on maximizing the number of activated nodes in the static social network. But in real life, social networks are dynamic in nature. This work addresses the diversification of activated nodes in the dynamic social network. This work proposes an objective function that maximizes the number of communities by utilizing bridge nodes. We also propose a diffusion model that considers the role of inactive nodes in influencing a node. We prove the submodularity, and monotonicity of the objective function under the proposed diffusion model. This work analyzes the impact of different ratios of bridge nodes in the seed set on real-world and synthetic datasets. Further, we prove the NP-Hardness of the objective function under the proposed diffusion model. The experiments are conducted on various real-world and synthetic datasets with known and unknown community information. The proposed work experimentally shows that the objective function gives the maximum number of communities considering bridge nodes compared to the benchmark algorithms.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Know their Customers: An Empirical Study of Online Account Enumeration Attacks 了解他们的客户:在线账户枚举攻击实证研究
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-05-07 DOI: 10.1145/3664201
Maël Maceiras, Kavous Salehzadeh Niksirat, Gaël Bernard, Benoit Garbinato, Mauro Cherubini, Mathias Humbert, Kévin Huguenin
{"title":"Know their Customers: An Empirical Study of Online Account Enumeration Attacks","authors":"Maël Maceiras, Kavous Salehzadeh Niksirat, Gaël Bernard, Benoit Garbinato, Mauro Cherubini, Mathias Humbert, Kévin Huguenin","doi":"10.1145/3664201","DOIUrl":"https://doi.org/10.1145/3664201","url":null,"abstract":"<p>Internet users possess accounts on dozens of online services where they are often identified by one of their e-mail addresses. They often use the same address on multiple services and for communicating with their contacts. In this paper, we investigate attacks that enable an adversary (e.g., company, friend) to determine (stealthily or not) whether an individual, identified by their e-mail address, has an account on certain services (i.e., an <i>account enumeration attack</i>). Such attacks on <i>account privacy</i> have serious implications as information about one’s accounts can be used to (1) profile them and (2) improve the effectiveness of phishing. We take a multifaceted approach and study these attacks through a combination of experiments (63 services), surveys (318 respondents), and focus groups (13 participants). We demonstrate the high vulnerability of popular services (93.7%) and the concerns of users about their account privacy, as well as their increased susceptibility to phishing e-mails that impersonate services on which they have an account. We also provide findings on the challenges in implementing countermeasures for service providers and on users’ ideas for enhancing their account privacy. Finally, our interaction with national data protection authorities led to the inclusion of recommendations in their developers’ guide.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Dynamic Multimodal Network Slot Concepts from the Web for Forecasting Environmental, Social and Governance Ratings 从网络中学习动态多模态网络插槽概念,用于预测环境、社会和治理评级
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-05-03 DOI: 10.1145/3663674
Gary Ang, Ee-Peng Lim
{"title":"Learning Dynamic Multimodal Network Slot Concepts from the Web for Forecasting Environmental, Social and Governance Ratings","authors":"Gary Ang, Ee-Peng Lim","doi":"10.1145/3663674","DOIUrl":"https://doi.org/10.1145/3663674","url":null,"abstract":"<p>Dynamic multimodal networks are networks with node attributes from different modalities where the attributes and network relationships evolve across time, i.e. both networks and multimodal attributes are dynamic. For example, dynamic relationship networks between companies that evolve across time due to changes in business strategies and alliances, which are associated with dynamic company attributes from multiple modalities such as textual online news, categorical events, and numerical financial-related data. Such information can be useful in predictive tasks involving companies. Environmental, social and governance (ESG) ratings of companies are important for assessing the sustainability risks of companies. The process of generating ESG ratings by expert analysts is however laborious and time-intensive. We thus explore the use of dynamic multimodal networks extracted from the web for forecasting ESG ratings. Learning such dynamic multimodal networks from the web for forecasting ESG ratings is however challenging due to its heterogeneity, and the low signal-to-noise ratios and non-stationary distributions of web information. Human analysts cope with such issues by learning concepts from past experience through relational thinking, and scanning for such concepts when analyzing new information about a company. In this paper, we propose the Dynamic Multimodal Slot Concept Attention-based Network (DynScan) model. DynScan utilizes slot attention mechanisms together with slot concept alignment and disentanglement loss functions to learn latent slot concepts from dynamic multimodal networks to improve performance on ESG rating forecasting tasks. DynScan is evaluated on forecasting tasks on six data sets, comprising three ESG ratings across two sets of companies. Our experiments show that DynScan outperforms other state-of-the-art models on these forecasting tasks. We also visualize the slot concepts learnt by DynScan on five synthetic datasets and three real-world datasets and observe distinct and meaningful slot concepts being learnt by DynScan across both synthetic and real-world datasets.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140830824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MuLX-QA: Classifying Multi-Labels and Extracting Rationale Spans in Social Media Posts MuLX-QA:对社交媒体帖子中的多标签进行分类并提取理由跨度
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-03-21 DOI: 10.1145/3653303
Soham Poddar, Rajdeep Mukherjee, Azlaan Mustafa Samad, Niloy Ganguly, Saptarshi Ghosh
{"title":"MuLX-QA: Classifying Multi-Labels and Extracting Rationale Spans in Social Media Posts","authors":"Soham Poddar, Rajdeep Mukherjee, Azlaan Mustafa Samad, Niloy Ganguly, Saptarshi Ghosh","doi":"10.1145/3653303","DOIUrl":"https://doi.org/10.1145/3653303","url":null,"abstract":"<p>While social media platforms play an important role in our daily lives in obtaining the latest news and trends from across the globe, they are known to be prone to widespread proliferation of harmful information in different forms leading to misconceptions among the masses. Accordingly, several prior works have attempted to tag social media posts with labels/classes reflecting their veracity, sentiments, hate content, etc. However, in order to have a convincing impact, it is important to additionally extract the post snippets on which the labelling decision is based. We call such a post snippet as the ‘rationale’. These rationales significantly improve human trust and debuggability of the predictions, especially when detecting misinformation or stigmas from social media posts. These rationale spans or snippets are also helpful in post-classification social analysis, such as for finding out the target communities in hate-speech, or for understanding the arguments or concerns against the intake of vaccines. Also it is observed that a post may express multiple notions of misinformation, hate, sentiment, etc. Thus, the task of determining (one or multiple) labels for a given piece of text, along with the <i>text snippets explaining the rationale behind each of the identified labels</i> is a challenging <i>multi-label, multi-rationale</i> classification task, which is still nascent in the literature. </p><p>While <i>transformer</i>-based encoder-decoder generative models such as BART and T5 are well-suited for the task, in this work we show how a relatively simpler <b>encoder-only</b> discriminative question-answering (QA) model can be effectively trained using <b>simple template-based questions</b> to accomplish the task. We thus propose <b>MuLX-QA</b> and demonstrate its utility in producing (label, rationale span) pairs in two different settings: <i>multi-class</i> (on the <i>HateXplain</i> dataset related to hate speech on social media), and <i>multi-label</i> (on the <i>CAVES</i> dataset related to COVID-19 anti-vaccine concerns). <b>MuLX-QA outperforms heavier generative models</b> in both settings. We also demonstrate the relative advantage of our proposed model MuLX-QA over strong baselines when trained with limited data. We perform several ablation studies, and experiments to better understand the effect of training MuLX-QA with different question prompts, and draw interesting inferences. Additionally, we show that MuLX-QA is effective on social media posts in resource-poor non-English languages as well. Finally, we perform a qualitative analysis of our model predictions and compare them with those of our strongest baseline.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140200422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous Graph Neural Network with Personalized and Adaptive Diversity for News Recommendation 用于新闻推荐的具有个性化和自适应多样性的异构图神经网络
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-03-08 DOI: 10.1145/3649886
Guangping Zhang, Dongsheng Li, Hansu Gu, Tun Lu, Ning Gu
{"title":"Heterogeneous Graph Neural Network with Personalized and Adaptive Diversity for News Recommendation","authors":"Guangping Zhang, Dongsheng Li, Hansu Gu, Tun Lu, Ning Gu","doi":"10.1145/3649886","DOIUrl":"https://doi.org/10.1145/3649886","url":null,"abstract":"<p>The emergence of online media has facilitated the dissemination of news, but has also introduced the problem of information overload. To address this issue, providing users with accurate and diverse news recommendations has become increasingly important. News possesses rich and heterogeneous content, and the factors that attract users to news reading are varied. Consequently, accurate news recommendation requires modeling of both the heterogeneous content of news and the heterogeneous user-news relationships. Furthermore, users’ news consumption is highly dynamic, which is reflected in the differences in topic concentration among different users and in the real-time changes in user interests. To this end, we propose a Heterogeneous Graph Neural Network with Personalized and Adaptive Diversity for News Recommendation (DivHGNN). DivHGNN first represents the heterogeneous content of news and the heterogeneous user-news relationships as an attributed heterogeneous graph. Then, through a heterogeneous node content adapter, it models the heterogeneous node attributes into aligned and fused node representations. With the proposed attributed heterogeneous graph neural network, DivHGNN integrates the heterogeneous relationships to enhance node representation for accurate news recommendations. We also discuss relation pruning, model deployment, and cold-start issues to further improve model efficiency. In terms of diversity, DivHGNN simultaneously models the variance of nodes through variational representation learning for providing personalized diversity. Additionally, a time-continuous exponentially decaying distribution cache is proposed to model the temporal dynamics of user real-time interests for providing adaptive diversity. Extensive experiments on real-world news datasets demonstrate the effectiveness of the proposed method.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140073483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy Influence Maximization in Social Networks 社交网络中的模糊影响力最大化
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-03-01 DOI: 10.1145/3650179
Ahmad Zareie, Rizos Sakellariou
{"title":"Fuzzy Influence Maximization in Social Networks","authors":"Ahmad Zareie, Rizos Sakellariou","doi":"10.1145/3650179","DOIUrl":"https://doi.org/10.1145/3650179","url":null,"abstract":"<p>Influence maximization is a fundamental problem in social network analysis. This problem refers to the identification of a set of influential users as initial spreaders to maximize the spread of a message in a network. When such a message is spread, some users may be influenced by it. A common assumption of existing work is that the impact of a message is essentially binary: a user is either influenced (activated) or not influenced (non-activated). However, how strongly a user is influenced by a message may play an important role in this user’s attempt to influence subsequent users and spread the message further; existing methods may fail to model accurately the spreading process and identify influential users. In this paper, we propose a novel approach to model a social network as a fuzzy graph where a fuzzy variable is used to represent the extent to which a user is influenced by a message (user’s activation level). By extending a diffusion model to simulate the spreading process in such a fuzzy graph we conceptually formulate the fuzzy influence maximization problem for which three methods are proposed to identify influential users. Experimental results demonstrate the accuracy of the proposed methods in determining influential users in social networks.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140003813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Envisioning Information Access Systems: What Makes for Good Tools and a Healthy Web? 设想信息获取系统:什么是好的工具和健康的网络?
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-02-26 DOI: 10.1145/3649468
Chirag Shah, Emily M. Bender
{"title":"Envisioning Information Access Systems: What Makes for Good Tools and a Healthy Web?","authors":"Chirag Shah, Emily M. Bender","doi":"10.1145/3649468","DOIUrl":"https://doi.org/10.1145/3649468","url":null,"abstract":"<p>We observe a recent trend towards applying large language models (LLMs) in search and positioning them as effective information access systems. While the interfaces may look appealing and the apparent breadth of applicability is exciting, we are concerned that the field is rushing ahead with a technology without sufficient study of the uses it is meant to serve, how it would be used, and what its use would mean. We argue that it is important to reassert the central research focus of the field of information retrieval, because information access is not merely an application to be solved by the so-called ‘AI’ techniques du jour. Rather, it is a key human activity, with impacts on both individuals and society. As information scientists, we should be asking what do people and society want and need from information access systems and how do we design and build systems to meet those needs? With that goal, in this conceptual paper we investigate fundamental questions concerning information access from user and societal viewpoints. We revisit foundational work related to information behavior, information seeking, information retrieval, information filtering, and information access to resurface what we know about these fundamental questions and what may be missing. We then provide our conceptual framing about how we could fill this gap, focusing on methods as well as experimental and evaluation frameworks. We consider the Web as an information ecosystem and explore the ways in which synthetic media, produced by LLMs and otherwise, endangers that ecosystem. The primary goal of this conceptual paper is to shed light on what we still do not know about the potential impacts of LLM-based information access systems, how to advance our understanding of user behaviors, and where the next generations of students, scholars, and developers could fruitfully invest their energies.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139968433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exif2Vec: A Framework to Ascertain Untrustworthy Crowdsourced Images Using Metadata Exif2Vec:利用元数据确定不可信的众包图像的框架
IF 3.5 4区 计算机科学
ACM Transactions on the Web Pub Date : 2024-02-13 DOI: 10.1145/3645094
Muhammad Umair, Athman Bouguettaya, Abdallah Lakhdari, Mourad Ouzzani, Yuyun Liu
{"title":"Exif2Vec: A Framework to Ascertain Untrustworthy Crowdsourced Images Using Metadata","authors":"Muhammad Umair, Athman Bouguettaya, Abdallah Lakhdari, Mourad Ouzzani, Yuyun Liu","doi":"10.1145/3645094","DOIUrl":"https://doi.org/10.1145/3645094","url":null,"abstract":"<p>In the context of social media, the integrity of images is often dubious. To tackle this challenge, we introduce <i>Exif2Vec</i>, a novel framework specifically designed to discover modifications in social media images. The proposed framework leverages an image’s metadata to discover changes in an image. We use a service-oriented approach that considers <i>discovery of changes in images</i> as a <i>service</i>. A novel word-embedding based approach is proposed to discover semantic inconsistencies in an image metadata that are reflective of the changes in an image. These inconsistencies are used to measure the severity of changes. The novelty of the approach resides in that it does not require the use of images to determine the underlying changes. We use a pretrained Word2Vec model to conduct experiments. The model is validated on two different fact-checked image datasets, i.e., images related to general context and a context specific image dataset. Notably, our findings showcase the remarkable efficacy of our approach, yielding results of up to 80% accuracy. This underscores the potential of our framework.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139770919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信