The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web

P. Chang, F. Huang, Min-Ling Lai
{"title":"The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web","authors":"P. Chang, F. Huang, Min-Ling Lai","doi":"10.4258/JKSMI.2009.15.3.247","DOIUrl":null,"url":null,"abstract":"Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)","PeriodicalId":255087,"journal":{"name":"Journal of Korean Society of Medical Informatics","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Medical Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/JKSMI.2009.15.3.247","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)
利用分类与识别技术自动评估网路健康资讯品质的可行性
目的:建立一种健康网页质量自动检测工具,并对其可行性和性能进行了进一步的研究。方法:开发一套自动检测系统,自动评估台湾政府网站健康资讯网页的作者质量指标。该系统与台湾中央研究院开发的中文分词系统和支持向量机集成,作为支持向量机(SVM)分类器和一种信息提取和识别方法。本系统是用Visual Basic 6.0编写的,使用SQL 2000。结果:开发了首个中文网页自动分类和信息标识符,用于评价网页信息质量。分类器在网页训练集上的灵敏度和特异性均高达100%,测试集中只有一个健康网页被误分类,这是由于该网页同时包含健康和非健康信息内容。作者标识符的敏感性为75.3%,特异性为87.9%。结论:网络健康信息质量自动评估的技术可行性是可以接受的。虽然它不足以保证网络内容的总质量,但它足以用于支持整个质量保证计划。(韩国医学信息学会杂志15- 3,247 -254,2009)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信