Open-Source Periorbital Segmentation Dataset for Ophthalmic Applications

IF 3.2 Q1 OPHTHALMOLOGY

Ophthalmology science Pub Date : 2025-03-05 DOI:10.1016/j.xops.2025.100757

George R. Nahass BA , Emma Koehler BS , Nicholas Tomaras BS , Danny Lopez BS , Madison Cheung BS , Alexander Palacios BA , Jeffrey C. Peterson MD, PhD , Sasha Hubschman MD , Kelsey Green BS , Chad A. Purnell MD , Pete Setabutr MD , Ann Q. Tran MD , Darvin Yi PhD

{"title":"Open-Source Periorbital Segmentation Dataset for Ophthalmic Applications","authors":"George R. Nahass BA , Emma Koehler BS , Nicholas Tomaras BS , Danny Lopez BS , Madison Cheung BS , Alexander Palacios BA , Jeffrey C. Peterson MD, PhD , Sasha Hubschman MD , Kelsey Green BS , Chad A. Purnell MD , Pete Setabutr MD , Ann Q. Tran MD , Darvin Yi PhD","doi":"10.1016/j.xops.2025.100757","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>We aimed to create and validate a dataset for oculoplastic segmentation and periorbital distance prediction.</div></div><div><h3>Design</h3><div>This was an experimental study.</div></div><div><h3>Subjects</h3><div>Images of faces from 2 open-source datasets were included in this study.</div></div><div><h3>Methods</h3><div>The images were sourced from 2 open-source datasets and cropped to include only the eyes. All images had the iris, sclera, lid, caruncle, and brow segmented by 5 trained annotators. Intergrader reliability analysis was done by having 5 annotators annotate the same 100 images randomly selected after at least a 2-week forgetting period. Intragrader analysis was done by having 5 annotators annotate the same 20 images after a 2-week forgetting period. Three DeepLabV3 segmentation models were trained for segmentation using the datasets following standard procedures.</div></div><div><h3>Main Outcome Measures</h3><div>The quality of the annotations was evaluated by Dice score through intragrader and intergrader experiments. Segmentation models were trained to demonstrate the dataset's utility for deep learning. The Dice score was used to evaluate deep learning models.</div></div><div><h3>Results</h3><div>We annotated 2842 images. Agreement between annotators (intergrader) on a randomly selected subset of 100 images was very high, with an average Dice score of 0.82 ± 0.01. Intragrader analysis also demonstrates that the same grader accurately reproduces annotations with an average Dice score, across all classes, of 0.81 ± 0.08. The average Dice score across all classes of a segmentation network trained on the Chicago Facial dataset, the CelebAMask-HQ dataset, and both combined was 0.90 ± 0.11, 0.81 ± 0.20, and 0.84 ± 0.18, respectively.</div></div><div><h3>Conclusions</h3><div>We have developed a first-of-its-kind dataset for use in oculoplastic and craniofacial segmentation tasks. All the annotations are publicly available for free download. Having access to segmentation datasets designed specifically for oculoplastic surgery will permit more rapid development of clinically useful segmentation networks that can be leveraged for periorbital distance prediction and other downstream tasks. In addition to the annotations, we also provide an open-source toolkit for periorbital distance prediction from segmentation masks, which are available via an application programming interface. The weights of all models have also been open-sourced and are publicly available for use by the community.</div></div><div><h3>Financial Disclosure(s)</h3><div>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</div></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":"5 4","pages":"Article 100757"},"PeriodicalIF":3.2000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914525000557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective

We aimed to create and validate a dataset for oculoplastic segmentation and periorbital distance prediction.

Design

This was an experimental study.

Subjects

Images of faces from 2 open-source datasets were included in this study.

Methods

The images were sourced from 2 open-source datasets and cropped to include only the eyes. All images had the iris, sclera, lid, caruncle, and brow segmented by 5 trained annotators. Intergrader reliability analysis was done by having 5 annotators annotate the same 100 images randomly selected after at least a 2-week forgetting period. Intragrader analysis was done by having 5 annotators annotate the same 20 images after a 2-week forgetting period. Three DeepLabV3 segmentation models were trained for segmentation using the datasets following standard procedures.

Main Outcome Measures

The quality of the annotations was evaluated by Dice score through intragrader and intergrader experiments. Segmentation models were trained to demonstrate the dataset's utility for deep learning. The Dice score was used to evaluate deep learning models.

Results

We annotated 2842 images. Agreement between annotators (intergrader) on a randomly selected subset of 100 images was very high, with an average Dice score of 0.82 ± 0.01. Intragrader analysis also demonstrates that the same grader accurately reproduces annotations with an average Dice score, across all classes, of 0.81 ± 0.08. The average Dice score across all classes of a segmentation network trained on the Chicago Facial dataset, the CelebAMask-HQ dataset, and both combined was 0.90 ± 0.11, 0.81 ± 0.20, and 0.84 ± 0.18, respectively.

Conclusions

We have developed a first-of-its-kind dataset for use in oculoplastic and craniofacial segmentation tasks. All the annotations are publicly available for free download. Having access to segmentation datasets designed specifically for oculoplastic surgery will permit more rapid development of clinically useful segmentation networks that can be leveraged for periorbital distance prediction and other downstream tasks. In addition to the annotations, we also provide an open-source toolkit for periorbital distance prediction from segmentation masks, which are available via an application programming interface. The weights of all models have also been open-sourced and are publicly available for use by the community.

Financial Disclosure(s)

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

查看原文本刊更多论文

开源眼科眶周分割数据集

目的建立和验证眼整形分割和眶周距离预测数据集。这是一项实验性研究。本研究采用了来自2个开源数据集的人脸图像。方法图像来源于2个开源数据集，裁剪后只包括眼睛。所有的图像都有虹膜、巩膜、眼睑、腕部和眉毛被5个训练有素的注释者分割。通过在至少2周的遗忘期后随机选择5名注释者对相同的100幅图像进行注释，进行综合可靠性分析。在两周的遗忘期后，让5名注释者对同样的20幅图像进行注释，从而完成内部分析。按照标准程序使用数据集训练三个DeepLabV3分割模型进行分割。主要观察指标通过内部和内部实验，用Dice评分来评价标注的质量。对分割模型进行了训练，以展示数据集对深度学习的实用性。Dice分数用于评估深度学习模型。结果共注释2842张图像。在随机选择的100张图像子集上，注释者（整合者）之间的一致性非常高，平均Dice得分为0.82±0.01。Intragrader分析还表明，同一评分器准确地再现了所有类别的平均Dice分数为0.81±0.08的注释。在芝加哥面部数据集、CelebAMask-HQ数据集和两者组合上训练的分割网络的所有类别的平均Dice得分分别为0.90±0.11、0.81±0.20和0.84±0.18。我们开发了一个首个用于眼整形和颅面分割任务的数据集。所有的注释都是公开的，可以免费下载。获得专为眼科整形手术设计的分割数据集将允许临床有用的分割网络更快地发展，这些网络可以用于眶周距离预测和其他下游任务。除了注释之外，我们还提供了一个开源工具包，用于从分割掩码预测轨道周围距离，该工具包可通过应用程序编程接口获得。所有模型的权重也都是开源的，可供社区公开使用。财务披露专有或商业披露可在本文末尾的脚注和披露中找到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊