通过深度学习自动检测冷却塔,用于军团病爆发调查:模型开发与验证研究

IF 23.8 1区 医学 Q1 MEDICAL INFORMATICS
Karen K Wong MD , Thaddeus Segura MIDS , Gunnar Mein MIDS , Jia Lu PhD , Elizabeth J Hannapel MPH , Jasen M Kunz MPH , Troy Ritter PhD , Jessica C Smith MPH , Alberto Todeschini PhD , Fred Nugen PhD , Chris Edens PhD
{"title":"通过深度学习自动检测冷却塔,用于军团病爆发调查:模型开发与验证研究","authors":"Karen K Wong MD ,&nbsp;Thaddeus Segura MIDS ,&nbsp;Gunnar Mein MIDS ,&nbsp;Jia Lu PhD ,&nbsp;Elizabeth J Hannapel MPH ,&nbsp;Jasen M Kunz MPH ,&nbsp;Troy Ritter PhD ,&nbsp;Jessica C Smith MPH ,&nbsp;Alberto Todeschini PhD ,&nbsp;Fred Nugen PhD ,&nbsp;Chris Edens PhD","doi":"10.1016/S2589-7500(24)00094-3","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Cooling towers containing <em>Legionella</em> spp are a high-risk source of Legionnaires’ disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible.</p></div><div><h3>Methods</h3><p>Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists.</p></div><div><h3>Findings</h3><p>The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0–96·1) and a PPV of 90·1% (95% CI 90·0–90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2–93·7) and PPV was 80·8% (80·5–81·2). In Athens, sensitivity was 86·9% (75·8–94·2) and PPV was 85·5% (84·2–86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]).</p></div><div><h3>Interpretation</h3><p>The model could be used to accelerate investigation and source control during outbreaks of Legionnaires’ disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires’ disease.</p></div><div><h3>Funding</h3><p>None.</p></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"6 7","pages":"Pages e500-e506"},"PeriodicalIF":23.8000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2589750024000943/pdfft?md5=134e7afec7443d66f0fb73e4c1e6aabb&pid=1-s2.0-S2589750024000943-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Automated cooling tower detection through deep learning for Legionnaires’ disease outbreak investigations: a model development and validation study\",\"authors\":\"Karen K Wong MD ,&nbsp;Thaddeus Segura MIDS ,&nbsp;Gunnar Mein MIDS ,&nbsp;Jia Lu PhD ,&nbsp;Elizabeth J Hannapel MPH ,&nbsp;Jasen M Kunz MPH ,&nbsp;Troy Ritter PhD ,&nbsp;Jessica C Smith MPH ,&nbsp;Alberto Todeschini PhD ,&nbsp;Fred Nugen PhD ,&nbsp;Chris Edens PhD\",\"doi\":\"10.1016/S2589-7500(24)00094-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Cooling towers containing <em>Legionella</em> spp are a high-risk source of Legionnaires’ disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible.</p></div><div><h3>Methods</h3><p>Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists.</p></div><div><h3>Findings</h3><p>The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0–96·1) and a PPV of 90·1% (95% CI 90·0–90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2–93·7) and PPV was 80·8% (80·5–81·2). In Athens, sensitivity was 86·9% (75·8–94·2) and PPV was 85·5% (84·2–86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]).</p></div><div><h3>Interpretation</h3><p>The model could be used to accelerate investigation and source control during outbreaks of Legionnaires’ disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires’ disease.</p></div><div><h3>Funding</h3><p>None.</p></div>\",\"PeriodicalId\":48534,\"journal\":{\"name\":\"Lancet Digital Health\",\"volume\":\"6 7\",\"pages\":\"Pages e500-e506\"},\"PeriodicalIF\":23.8000,\"publicationDate\":\"2024-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2589750024000943/pdfft?md5=134e7afec7443d66f0fb73e4c1e6aabb&pid=1-s2.0-S2589750024000943-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lancet Digital Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2589750024000943\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lancet Digital Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589750024000943","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景含有军团菌的冷却塔是军团病爆发的高危来源。在疫情调查期间,从航空图像中手动定位冷却塔需要专业技术,耗费大量人力,而且容易出错。我们的目标是训练一个深度学习计算机视觉模型,以自动检测空中可见的冷却塔。方法在 2021 年 1 月 1 日至 31 日期间,我们从谷歌地图中提取了费城(PN,美国)和纽约州(NY,美国)的卫星视图图像,并标注了冷却塔,以创建训练数据集。我们使用合成数据和模型辅助标注的其他城市来扩充训练数据。我们使用包含 7292 座冷却塔的 2051 幅图像,使用 YOLOv5(一种检测图像中物体的模型)和 EfficientNet-b5 (一种对图像进行分类的模型)训练了一个两阶段模型。我们在包含 548 张图片的测试数据集上评估了该模型与人工标注相比的灵敏度和阳性预测值 (PPV),其中包括两个在训练中未曾出现过的城市(波士顿[美国马萨诸塞州]和雅典[美国佐治亚州])。在纽约市和费城,该模型识别可见冷却塔的灵敏度为 95-1%(95% CI 94-0-96-1),PPV 为 90-1%(95% CI 90-0-90-2)。在波士顿,灵敏度为 91-6%(89-2-93-7),PPV 为 80-8%(80-5-81-2)。在雅典,灵敏度为 86-9%(75-8-94-2),PPV 为 85-5%(84-2-86-7)。在纽约市 45 个街区(0-26 平方英里)的区域内,该模型的搜索速度(7-6 秒;识别出 351 个潜在冷却塔)比人类调查人员(平均 83-75 分钟 [SD 29-5];平均 310-8 个冷却塔 [42-2])快 600 多倍。该模型已被公共卫生团队用于疫情调查和冷却塔登记初始化,这被认为是预防和应对军团病爆发的最佳做法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automated cooling tower detection through deep learning for Legionnaires’ disease outbreak investigations: a model development and validation study

Background

Cooling towers containing Legionella spp are a high-risk source of Legionnaires’ disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible.

Methods

Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists.

Findings

The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0–96·1) and a PPV of 90·1% (95% CI 90·0–90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2–93·7) and PPV was 80·8% (80·5–81·2). In Athens, sensitivity was 86·9% (75·8–94·2) and PPV was 85·5% (84·2–86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]).

Interpretation

The model could be used to accelerate investigation and source control during outbreaks of Legionnaires’ disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires’ disease.

Funding

None.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
41.20
自引率
1.60%
发文量
232
审稿时长
13 weeks
期刊介绍: The Lancet Digital Health publishes important, innovative, and practice-changing research on any topic connected with digital technology in clinical medicine, public health, and global health. The journal’s open access content crosses subject boundaries, building bridges between health professionals and researchers.By bringing together the most important advances in this multidisciplinary field,The Lancet Digital Health is the most prominent publishing venue in digital health. We publish a range of content types including Articles,Review, Comment, and Correspondence, contributing to promoting digital technologies in health practice worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信