Assessing the Reliability and Validity of Principles for Health-Related Information on Social Media (PRHISM) for Evaluating Breast Cancer Treatment Videos on YouTube: Instrument Validation Study.
{"title":"Assessing the Reliability and Validity of Principles for Health-Related Information on Social Media (PRHISM) for Evaluating Breast Cancer Treatment Videos on YouTube: Instrument Validation Study.","authors":"Hiroki Kusama, Yoshimitsu Takahashi, Shunichiro Orihara, Kayo Adachi, Yumiko Ishizuka, Ryoko Semba, Hidetaka Shima, Yoshiya Horimoto, Hiroshi Kaise, Masataka Taguri, Sho Inoue, Takeo Nakayama, Takashi Ishikawa","doi":"10.2196/66416","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There is breast cancer-related medical information on social media, but there is no established method for objectively evaluating the quality of this information. Principles for Health-Related Information on Social Media (PRHISM) is a newly developed tool for objectively assessing the quality of health-related information on social media; however, there have been no reports evaluating its reliability and validity.</p><p><strong>Objective: </strong>The purpose of this study was to statistically examine the reliability and validity of PRHISM using videos about breast cancer treatment on YouTube (Google).</p><p><strong>Methods: </strong>In total, 60 YouTube videos were selected on January 5, 2024, with the Japanese words for \"breast cancer,\" \"treatment,\" and \"chemotherapy,\" and assessed by 6 Japanese physicians with expertise in breast cancer. These evaluators independently evaluated the videos using PRHISM and an established tool for assessing the quality of health-related information, DISCERN, as well as through subjective assessments. We calculated interrater and intrarater agreement among evaluators with CIs, measuring agreement using weighted Cohen kappa.</p><p><strong>Results: </strong>The interrater agreement for PRHISM overall quality was κ=0.52 (90% CI 0.49-0.55), indicating that the expected level of agreement, statistically defined by the lower limit of the 90% CI exceeding 0.53, was not achieved. However, PRHISM demonstrated higher agreement compared with DISCERN overall quality, which had a κ=0.45 (90% CI 0.41-0.48). In terms of validity, the intrarater agreement between PRHISM and subjective assessments by breast experts was κ=0.37 (95% CI 0.14-0.60), while DISCERN showed an agreement of κ=0.27 (95% CI 0.07-0.48), indicating fair agreement and no significant difference in validity.</p><p><strong>Conclusions: </strong>PRHISM has demonstrated sufficient reliability and validity for evaluating the quality of health-related information on YouTube, making it a promising new metric. To further enhance objectivity, it is necessary to explore the use of artificial intelligence and other approaches.</p>","PeriodicalId":73554,"journal":{"name":"JMIR infodemiology","volume":"5 ","pages":"e66416"},"PeriodicalIF":3.5000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12175871/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR infodemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/66416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: There is breast cancer-related medical information on social media, but there is no established method for objectively evaluating the quality of this information. Principles for Health-Related Information on Social Media (PRHISM) is a newly developed tool for objectively assessing the quality of health-related information on social media; however, there have been no reports evaluating its reliability and validity.
Objective: The purpose of this study was to statistically examine the reliability and validity of PRHISM using videos about breast cancer treatment on YouTube (Google).
Methods: In total, 60 YouTube videos were selected on January 5, 2024, with the Japanese words for "breast cancer," "treatment," and "chemotherapy," and assessed by 6 Japanese physicians with expertise in breast cancer. These evaluators independently evaluated the videos using PRHISM and an established tool for assessing the quality of health-related information, DISCERN, as well as through subjective assessments. We calculated interrater and intrarater agreement among evaluators with CIs, measuring agreement using weighted Cohen kappa.
Results: The interrater agreement for PRHISM overall quality was κ=0.52 (90% CI 0.49-0.55), indicating that the expected level of agreement, statistically defined by the lower limit of the 90% CI exceeding 0.53, was not achieved. However, PRHISM demonstrated higher agreement compared with DISCERN overall quality, which had a κ=0.45 (90% CI 0.41-0.48). In terms of validity, the intrarater agreement between PRHISM and subjective assessments by breast experts was κ=0.37 (95% CI 0.14-0.60), while DISCERN showed an agreement of κ=0.27 (95% CI 0.07-0.48), indicating fair agreement and no significant difference in validity.
Conclusions: PRHISM has demonstrated sufficient reliability and validity for evaluating the quality of health-related information on YouTube, making it a promising new metric. To further enhance objectivity, it is necessary to explore the use of artificial intelligence and other approaches.
背景:社交媒体上有乳腺癌相关的医学信息,但没有既定的方法来客观评估这些信息的质量。社交媒体健康相关信息原则(PRHISM)是一项新开发的工具,用于客观评估社交媒体健康相关信息的质量;然而,目前尚无评价其信度和效度的报告。目的:本研究的目的是利用YouTube(谷歌)上有关乳腺癌治疗的视频,对PRHISM的信度和效度进行统计检验。方法:在2024年1月5日,共选择60个YouTube视频,其中包含日语单词“乳腺癌”,“治疗”和“化疗”,并由6名具有乳腺癌专业知识的日本医生进行评估。这些评估人员使用PRHISM和一个用于评估健康相关信息质量的既定工具DISCERN,以及通过主观评估对视频进行独立评估。我们计算了具有ci的评估者之间的解释者和解释者之间的一致性,使用加权Cohen kappa测量一致性。结果:PRHISM整体质量的判读一致性为κ=0.52 (90% CI 0.49-0.55),表明未达到预期的一致性水平,即90% CI超过0.53的下限。然而,与DISCERN整体质量相比,PRHISM表现出更高的一致性,κ=0.45 (90% CI 0.41-0.48)。在效度方面,PRHISM与乳腺专家主观评价的内部一致性为κ=0.37 (95% CI 0.14-0.60),而DISCERN的内部一致性为κ=0.27 (95% CI 0.07-0.48),表明一致性相当,效度无显著差异。结论:PRHISM在评价YouTube上健康相关信息质量方面具有足够的信度和效度,使其成为一个有前景的新指标。为了进一步增强客观性,有必要探索使用人工智能等方法。