改进的方法来帮助无监督的基于证据的事实核查在线健康新闻

J. Data Intell. Pub Date : 1900-01-01 DOI:10.26421/jdi3.4-5

Pritam Deka, Anna Jurek-Loughrey, P Deepak

{"title":"改进的方法来帮助无监督的基于证据的事实核查在线健康新闻","authors":"Pritam Deka, Anna Jurek-Loughrey, P Deepak","doi":"10.26421/jdi3.4-5","DOIUrl":null,"url":null,"abstract":"False information in the domain of online health related articles is of great concern, which has been witnessed abundantly in the current pandemic situation of Covid-19. Recent advancements in the field of Machine Learning and Natural Language Processing can be leveraged to aid people in distinguishing false information from the truth in the domain of online health articles. Whilst there has been substantial progress in this space over the years, research in this area has mainly focused on the sphere of political news. Health fake news is markedly different from fake news in the political context as health information should be evaluated against the most recent and reliable medical resources such as scholarly repositories. However, one of the challenges with such an approach is the retrieval of the pertinent resources. In this work, we formulate two techniques for the retrieval of the most relevant authoritative and reliable medical content from scholarly repositories which can be used to assess veracity of an online health article. The first technique is an unsupervised method of generating queries from claims which are extracted from an online health article. We propose a three-step approach for it and illustrate that our method is able to generate effective queries which can be used for retrieval of information from medical knowledge databases. The second method involves a filtering approach for extracting the most relevant information for the claims. We show how this can be achieved with the help of state of the art transformer models and illustrate it's effectiveness over other methods.","PeriodicalId":232625,"journal":{"name":"J. Data Intell.","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Improved Methods to Aid Unsupervised Evidence-Based Fact Checking for Online Heath News\",\"authors\":\"Pritam Deka, Anna Jurek-Loughrey, P Deepak\",\"doi\":\"10.26421/jdi3.4-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"False information in the domain of online health related articles is of great concern, which has been witnessed abundantly in the current pandemic situation of Covid-19. Recent advancements in the field of Machine Learning and Natural Language Processing can be leveraged to aid people in distinguishing false information from the truth in the domain of online health articles. Whilst there has been substantial progress in this space over the years, research in this area has mainly focused on the sphere of political news. Health fake news is markedly different from fake news in the political context as health information should be evaluated against the most recent and reliable medical resources such as scholarly repositories. However, one of the challenges with such an approach is the retrieval of the pertinent resources. In this work, we formulate two techniques for the retrieval of the most relevant authoritative and reliable medical content from scholarly repositories which can be used to assess veracity of an online health article. The first technique is an unsupervised method of generating queries from claims which are extracted from an online health article. We propose a three-step approach for it and illustrate that our method is able to generate effective queries which can be used for retrieval of information from medical knowledge databases. The second method involves a filtering approach for extracting the most relevant information for the claims. We show how this can be achieved with the help of state of the art transformer models and illustrate it's effectiveness over other methods.\",\"PeriodicalId\":232625,\"journal\":{\"name\":\"J. Data Intell.\",\"volume\":\"153 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Data Intell.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26421/jdi3.4-5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Data Intell.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26421/jdi3.4-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

网络健康相关文章中的虚假信息令人担忧，这在当前新冠肺炎疫情中得到了充分证明。机器学习和自然语言处理领域的最新进展可以用来帮助人们在在线健康文章领域区分虚假信息和真实信息。虽然多年来在这一领域取得了实质性进展，但这一领域的研究主要集中在政治新闻领域。健康假新闻与政治背景下的假新闻明显不同，因为健康信息应根据最新和可靠的医疗资源(如学术资料库)进行评估。然而，这种方法的挑战之一是检索相关资源。在这项工作中，我们制定了两种技术，用于从学术知识库中检索最相关的权威和可靠的医学内容，这些内容可用于评估在线健康文章的准确性。第一种技术是一种无监督的方法，从在线健康文章中提取的索赔中生成查询。我们提出了一个三步的方法，并说明我们的方法能够生成有效的查询，可以用于从医学知识库中检索信息。第二种方法涉及用于提取索赔的最相关信息的过滤方法。我们展示了如何在最先进的变压器模型的帮助下实现这一点，并说明了它比其他方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improved Methods to Aid Unsupervised Evidence-Based Fact Checking for Online Heath News

False information in the domain of online health related articles is of great concern, which has been witnessed abundantly in the current pandemic situation of Covid-19. Recent advancements in the field of Machine Learning and Natural Language Processing can be leveraged to aid people in distinguishing false information from the truth in the domain of online health articles. Whilst there has been substantial progress in this space over the years, research in this area has mainly focused on the sphere of political news. Health fake news is markedly different from fake news in the political context as health information should be evaluated against the most recent and reliable medical resources such as scholarly repositories. However, one of the challenges with such an approach is the retrieval of the pertinent resources. In this work, we formulate two techniques for the retrieval of the most relevant authoritative and reliable medical content from scholarly repositories which can be used to assess veracity of an online health article. The first technique is an unsupervised method of generating queries from claims which are extracted from an online health article. We propose a three-step approach for it and illustrate that our method is able to generate effective queries which can be used for retrieval of information from medical knowledge databases. The second method involves a filtering approach for extracting the most relevant information for the claims. We show how this can be achieved with the help of state of the art transformer models and illustrate it's effectiveness over other methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

J. Data Intell.

自引率

0.00%

发文量