{"title":"HARISS:小样本参考区间的直方图分析仪,一个免费的Web应用程序来计算小样本的参考区间。","authors":"Kevin Le Boedec","doi":"10.1111/vcp.70033","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.</p><p><strong>Objectives: </strong>This study aimed to develop a machine learning model for small-sample VADH.</p><p><strong>Methods: </strong>The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.</p><p><strong>Results: </strong>The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.</p><p><strong>Conclusions: </strong>The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.</p>","PeriodicalId":23593,"journal":{"name":"Veterinary clinical pathology","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HARISS: Histogram Analyzer for Reference Intervals of Small Samples, a Free Web App to Calculate Reference Intervals of Small Samples.\",\"authors\":\"Kevin Le Boedec\",\"doi\":\"10.1111/vcp.70033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.</p><p><strong>Objectives: </strong>This study aimed to develop a machine learning model for small-sample VADH.</p><p><strong>Methods: </strong>The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.</p><p><strong>Results: </strong>The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.</p><p><strong>Conclusions: </strong>The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.</p>\",\"PeriodicalId\":23593,\"journal\":{\"name\":\"Veterinary clinical pathology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Veterinary clinical pathology\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1111/vcp.70033\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"VETERINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Veterinary clinical pathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/vcp.70033","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"VETERINARY SCIENCES","Score":null,"Total":0}
HARISS: Histogram Analyzer for Reference Intervals of Small Samples, a Free Web App to Calculate Reference Intervals of Small Samples.
Background: Reference interval (RI) estimate inaccuracy is problematic at small sample sizes. Visual assessment of distribution histograms (VADH) may improve statistical technique selection, but its performance depends on the human operator.
Objectives: This study aimed to develop a machine learning model for small-sample VADH.
Methods: The training set consisted of 45 000 distribution histograms from samples ranging from 20 to 40 individuals extracted from simulated Gaussian, lognormal, and left-skewed populations. A convolutional neural network (CNN) was trained to predict the original population distribution by VADH. Its accuracy was tested on 900 human-classified histograms from a previous study (ranging from 20 to 60 individuals) and compared to the Shapiro-Wilk test performance in determining the original population distribution. A web application was developed to allow usage of the CNN, 95% RI estimation with 90% confidence intervals (CI) via bootstrapping, and outlier detection.
Results: The CNN model properly predicted the original population distribution by VADH in 84.0% (95% CI: 83.7-84.4) and 94.4% (95% CI: 92.7-95.8) of the samples of the training and the test sets, respectively. Comparatively, on the test set, the Shapiro-Wilk test accuracy was 65.0% (95% CI: 61.8-68.1) and 72.3% (95% CI: 69.3-75.2) using a p-value threshold of 0.05 and 0.2, respectively. The web application (named HARISS) was successfully deployed and is accessible at: https://hariss.streamlit.app/.
Conclusions: The CNN model demonstrated effective VADH and might enhance RI estimate accuracy through the use of the HARISS web app, though proper reference individual selection and preanalytical/analytical conditions remain paramount.
期刊介绍:
Veterinary Clinical Pathology is the official journal of the American Society for Veterinary Clinical Pathology (ASVCP) and the European Society of Veterinary Clinical Pathology (ESVCP). The journal''s mission is to provide an international forum for communication and discussion of scientific investigations and new developments that advance the art and science of laboratory diagnosis in animals. Veterinary Clinical Pathology welcomes original experimental research and clinical contributions involving domestic, laboratory, avian, and wildlife species in the areas of hematology, hemostasis, immunopathology, clinical chemistry, cytopathology, surgical pathology, toxicology, endocrinology, laboratory and analytical techniques, instrumentation, quality assurance, and clinical pathology education.