HARISS:小样本参考区间的直方图分析仪,一个免费的Web应用程序来计算小样本的参考区间。

IF 1.1 4区 农林科学 Q3 VETERINARY SCIENCES
Kevin Le Boedec
{"title":"HARISS:小样本参考区间的直方图分析仪,一个免费的Web应用程序来计算小样本的参考区间。","authors":"Kevin Le Boedec","doi":"10.1111/vcp.70033","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.</p><p><strong>Objectives: </strong>This study aimed to develop a machine learning model for small-sample VADH.</p><p><strong>Methods: </strong>The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.</p><p><strong>Results: </strong>The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.</p><p><strong>Conclusions: </strong>The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.</p>","PeriodicalId":23593,"journal":{"name":"Veterinary clinical pathology","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HARISS: Histogram Analyzer for Reference Intervals of Small Samples, a Free Web App to Calculate Reference Intervals of Small Samples.\",\"authors\":\"Kevin Le Boedec\",\"doi\":\"10.1111/vcp.70033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.</p><p><strong>Objectives: </strong>This study aimed to develop a machine learning model for small-sample VADH.</p><p><strong>Methods: </strong>The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.</p><p><strong>Results: </strong>The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.</p><p><strong>Conclusions: </strong>The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.</p>\",\"PeriodicalId\":23593,\"journal\":{\"name\":\"Veterinary clinical pathology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Veterinary clinical pathology\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1111/vcp.70033\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"VETERINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Veterinary clinical pathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/vcp.70033","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"VETERINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:在小样本量下,参考区间(RI)估计不准确是有问题的。分布直方图的视觉评估(VADH)可以改善统计技术的选择,但其性能取决于人工操作者。目的:本研究旨在建立小样本VADH的机器学习模型。方法:训练集包括从模拟高斯、对数正态和左偏总体中提取的20 - 40个样本的45,000个分布直方图。通过VADH训练卷积神经网络(CNN)来预测原始种群分布。它的准确性在之前的一项研究(从20到60个个体)的900个人类分类直方图上进行了测试,并与夏皮罗-威尔克测试在确定原始种群分布方面的表现进行了比较。开发了一个web应用程序,允许使用CNN, 95% RI估计和90%置信区间(CI),通过自举和离群值检测。结果:CNN模型在训练集样本和测试集样本中分别有84.0% (95% CI: 83.7 ~ 84.4)和94.4% (95% CI: 92.7 ~ 95.8)能正确预测原始总体分布。相比之下,在测试集上,Shapiro-Wilk测试准确率分别为65.0% (95% CI: 61.8-68.1)和72.3% (95% CI: 69.3-75.2), p值阈值分别为0.05和0.2。web应用程序(名为HARISS)已成功部署,并可访问:https://hariss.streamlit.app/.Conclusions: CNN模型展示了有效的VADH,并可能通过使用HARISS web应用程序提高RI估计的准确性,尽管适当的参考个体选择和分析前/分析条件仍然是至关重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
HARISS: Histogram Analyzer for Reference Intervals of Small Samples, a Free Web App to Calculate Reference Intervals of Small Samples.

Background: Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.

Objectives: This study aimed to develop a machine learning model for small-sample VADH.

Methods: The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.

Results: The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.

Conclusions: The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Veterinary clinical pathology
Veterinary clinical pathology 农林科学-兽医学
CiteScore
1.70
自引率
16.70%
发文量
133
审稿时长
18-36 weeks
期刊介绍: Veterinary Clinical Pathology is the official journal of the American Society for Veterinary Clinical Pathology (ASVCP) and the European Society of Veterinary Clinical Pathology (ESVCP). The journal''s mission is to provide an international forum for communication and discussion of scientific investigations and new developments that advance the art and science of laboratory diagnosis in animals. Veterinary Clinical Pathology welcomes original experimental research and clinical contributions involving domestic, laboratory, avian, and wildlife species in the areas of hematology, hemostasis, immunopathology, clinical chemistry, cytopathology, surgical pathology, toxicology, endocrinology, laboratory and analytical techniques, instrumentation, quality assurance, and clinical pathology education.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信