GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases.

Hellen Lesmann, Alexander Hustinx, Shahida Moosa, Hannah Klinkhammer, Elaine Marchi, Pilar Caro, Ibrahim M Abdelrazek, Jean Tori Pantel, Merle Ten Hagen, Meow-Keong Thong, Rifhan Azwani Binti Mazlan, Sok Kun Tae, Tom Kamphans, Wolfgang Meiswinkel, Jing-Mei Li, Behnam Javanmardi, Alexej Knaus, Annette Uwineza, Cordula Knopp, Tinatin Tkemaladze, Miriam Elbracht, Larissa Mattern, Rami Abou Jamra, Clara Velmans, Vincent Strehlow, Maureen Jacob, Angela Peron, Cristina Dias, Beatriz Carvalho Nunes, Thainá Vilella, Isabel Furquim Pinheiro, Chong Ae Kim, Maria Isabel Melaragno, Hannah Weiland, Sophia Kaptain, Karolina Chwiałkowska, Miroslaw Kwasniewski, Ramy Saad, Sarah Wiethoff, Himanshu Goel, Clara Tang, Anna Hau, Tahsin Stefan Barakat, Przemysław Panek, Amira Nabil, Julia Suh, Frederik Braun, Israel Gomy, Luisa Averdunk, Ekanem Ekure, Gaber Bergant, Borut Peterlin, Claudio Graziano, Nagwa Gaboon, Moisés Fiesco-Roa, Alessandro Mauro Spinelli, Nina-Maria Wilpert, Prasit Phowthongkum, Nergis Güzel, Tobias B Haack, Rana Bitar, Andreas Tzschach, Agusti Rodriguez-Palmero, Theresa Brunet, Sabine Rudnik-Schöneborn, Silvina Noemi Contreras-Capetillo, Ava Oberlack, Carole Samango-Sprouse, Teresa Sadeghin, Margaret Olaya, Konrad Platzer, Artem Borovikov, Franziska Schnabel, Lara Heuft, Vera Herrmann, Renske Oegema, Nour Elkhateeb, Sheetal Kumar, Katalin Komlosi, Khoushoua Mohamed, Silvia Kalantari, Fabio Sirchia, Antonio F Martinez-Monseny, Matthias Höller, Louiza Toutouna, Amal Mohamed, Amaia Lasa-Aranzasti, John A Sayer, Nadja Ehmke, Magdalena Danyel, Henrike Sczakiel, Sarina Schwartzmann, Felix Boschann, Max Zhao, Ronja Adam, Lara Einicke, Denise Horn, Kee Seang Chew, Choy Chen Kam, Miray Karakoyun, Ben Pode-Shakked, Aviva Eliyahu, Rachel Rock, Teresa Carrion, Odelia Chorin, Yuri A Zarate, Marcelo Martinez Conti, Mert Karakaya, Moon Ley Tung, Bharatendu Chandra, Arjan Bouman, Aime Lumaka, Naveed Wasif, Marwan Shinawi, Patrick R Blackburn, Tianyun Wang, Tim Niehues, Axel Schmidt, Regina Rita Roth, Dagmar Wieczorek, Ping Hu, Rebekah L Waikel, Suzanna E Ledgister Hanchard, Gehad Elmakkawy, Sylvia Safwat, Frédéric Ebstein, Elke Krüger, Sébastien Küry, Stéphane Bézieau, Annabelle Arlt, Eric Olinger, Felix Marbach, Dong Li, Lucie Dupuis, Roberto Mendoza-Londono, Sofia Douzgou Houge, Denisa Weis, Brian Hon-Yin Chung, Christopher C Y Mak, Hülya Kayserili, Nursel Elcioglu, Ayca Aykut, Peli Özlem Şimşek-Kiper, Nina Bögershausen, Bernd Wollnik, Heidi Beate Bentzen, Ingo Kurth, Christian Netzer, Aleksandra Jezela-Stanek, Koen Devriendt, Karen W Gripp, Martin Mücke, Alain Verloes, Christian P Schaaf, Christoffer Nellåker, Benjamin D Solomon, Markus M Nöthen, Ebtesam Abdalla, Gholson J Lyon, Peter M Krawitz, Tzung-Chien Hsieh
{"title":"GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases.","authors":"Hellen Lesmann, Alexander Hustinx, Shahida Moosa, Hannah Klinkhammer, Elaine Marchi, Pilar Caro, Ibrahim M Abdelrazek, Jean Tori Pantel, Merle Ten Hagen, Meow-Keong Thong, Rifhan Azwani Binti Mazlan, Sok Kun Tae, Tom Kamphans, Wolfgang Meiswinkel, Jing-Mei Li, Behnam Javanmardi, Alexej Knaus, Annette Uwineza, Cordula Knopp, Tinatin Tkemaladze, Miriam Elbracht, Larissa Mattern, Rami Abou Jamra, Clara Velmans, Vincent Strehlow, Maureen Jacob, Angela Peron, Cristina Dias, Beatriz Carvalho Nunes, Thainá Vilella, Isabel Furquim Pinheiro, Chong Ae Kim, Maria Isabel Melaragno, Hannah Weiland, Sophia Kaptain, Karolina Chwiałkowska, Miroslaw Kwasniewski, Ramy Saad, Sarah Wiethoff, Himanshu Goel, Clara Tang, Anna Hau, Tahsin Stefan Barakat, Przemysław Panek, Amira Nabil, Julia Suh, Frederik Braun, Israel Gomy, Luisa Averdunk, Ekanem Ekure, Gaber Bergant, Borut Peterlin, Claudio Graziano, Nagwa Gaboon, Moisés Fiesco-Roa, Alessandro Mauro Spinelli, Nina-Maria Wilpert, Prasit Phowthongkum, Nergis Güzel, Tobias B Haack, Rana Bitar, Andreas Tzschach, Agusti Rodriguez-Palmero, Theresa Brunet, Sabine Rudnik-Schöneborn, Silvina Noemi Contreras-Capetillo, Ava Oberlack, Carole Samango-Sprouse, Teresa Sadeghin, Margaret Olaya, Konrad Platzer, Artem Borovikov, Franziska Schnabel, Lara Heuft, Vera Herrmann, Renske Oegema, Nour Elkhateeb, Sheetal Kumar, Katalin Komlosi, Khoushoua Mohamed, Silvia Kalantari, Fabio Sirchia, Antonio F Martinez-Monseny, Matthias Höller, Louiza Toutouna, Amal Mohamed, Amaia Lasa-Aranzasti, John A Sayer, Nadja Ehmke, Magdalena Danyel, Henrike Sczakiel, Sarina Schwartzmann, Felix Boschann, Max Zhao, Ronja Adam, Lara Einicke, Denise Horn, Kee Seang Chew, Choy Chen Kam, Miray Karakoyun, Ben Pode-Shakked, Aviva Eliyahu, Rachel Rock, Teresa Carrion, Odelia Chorin, Yuri A Zarate, Marcelo Martinez Conti, Mert Karakaya, Moon Ley Tung, Bharatendu Chandra, Arjan Bouman, Aime Lumaka, Naveed Wasif, Marwan Shinawi, Patrick R Blackburn, Tianyun Wang, Tim Niehues, Axel Schmidt, Regina Rita Roth, Dagmar Wieczorek, Ping Hu, Rebekah L Waikel, Suzanna E Ledgister Hanchard, Gehad Elmakkawy, Sylvia Safwat, Frédéric Ebstein, Elke Krüger, Sébastien Küry, Stéphane Bézieau, Annabelle Arlt, Eric Olinger, Felix Marbach, Dong Li, Lucie Dupuis, Roberto Mendoza-Londono, Sofia Douzgou Houge, Denisa Weis, Brian Hon-Yin Chung, Christopher C Y Mak, Hülya Kayserili, Nursel Elcioglu, Ayca Aykut, Peli Özlem Şimşek-Kiper, Nina Bögershausen, Bernd Wollnik, Heidi Beate Bentzen, Ingo Kurth, Christian Netzer, Aleksandra Jezela-Stanek, Koen Devriendt, Karen W Gripp, Martin Mücke, Alain Verloes, Christian P Schaaf, Christoffer Nellåker, Benjamin D Solomon, Markus M Nöthen, Ebtesam Abdalla, Gholson J Lyon, Peter M Krawitz, Tzung-Chien Hsieh","doi":"10.21203/rs.3.rs-4438861/v1","DOIUrl":null,"url":null,"abstract":"<p><p>The most important factor that complicates the work of dysmorphologists is the significant phenotypic variability of the human face. Next-Generation Phenotyping (NGP) tools that assist clinicians with recognizing characteristic syndromic patterns are particularly challenged when confronted with patients from populations different from their training data. To that end, we systematically analyzed the impact of genetic ancestry on facial dysmorphism. For that purpose, we established the GestaltMatcher Database (GMDB) as a reference dataset for medical images of patients with rare genetic disorders from around the world. We collected 10,980 frontal facial images - more than a quarter previously unpublished - from 8,346 patients, representing 581 rare disorders. Although the predominant ancestry is still European (67%), data from underrepresented populations have been increased considerably via global collaborations (19% Asian and 7% African). This includes previously unpublished reports for more than 40% of the African patients. The NGP analysis on this diverse dataset revealed characteristic performance differences depending on the composition of training and test sets corresponding to genetic relatedness. For clinical use of NGP, incorporating non-European patients resulted in a profound enhancement of GestaltMatcher performance. The top-5 accuracy rate increased by +11.29%. Importantly, this improvement in delineating the correct disorder from a facial portrait was achieved without decreasing the performance on European patients. By design, GMDB complies with the FAIR principles by rendering the curated medical data findable, accessible, interoperable, and reusable. This means GMDB can also serve as data for training and benchmarking. In summary, our study on facial dysmorphism on a global sample revealed a considerable cross ancestral phenotypic variability confounding NGP that should be counteracted by international efforts for increasing data diversity. GMDB will serve as a vital reference database for clinicians and a transparent training set for advancing NGP technology.</p>","PeriodicalId":94282,"journal":{"name":"Research square","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11188141/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research square","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21203/rs.3.rs-4438861/v1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The most important factor that complicates the work of dysmorphologists is the significant phenotypic variability of the human face. Next-Generation Phenotyping (NGP) tools that assist clinicians with recognizing characteristic syndromic patterns are particularly challenged when confronted with patients from populations different from their training data. To that end, we systematically analyzed the impact of genetic ancestry on facial dysmorphism. For that purpose, we established the GestaltMatcher Database (GMDB) as a reference dataset for medical images of patients with rare genetic disorders from around the world. We collected 10,980 frontal facial images - more than a quarter previously unpublished - from 8,346 patients, representing 581 rare disorders. Although the predominant ancestry is still European (67%), data from underrepresented populations have been increased considerably via global collaborations (19% Asian and 7% African). This includes previously unpublished reports for more than 40% of the African patients. The NGP analysis on this diverse dataset revealed characteristic performance differences depending on the composition of training and test sets corresponding to genetic relatedness. For clinical use of NGP, incorporating non-European patients resulted in a profound enhancement of GestaltMatcher performance. The top-5 accuracy rate increased by +11.29%. Importantly, this improvement in delineating the correct disorder from a facial portrait was achieved without decreasing the performance on European patients. By design, GMDB complies with the FAIR principles by rendering the curated medical data findable, accessible, interoperable, and reusable. This means GMDB can also serve as data for training and benchmarking. In summary, our study on facial dysmorphism on a global sample revealed a considerable cross ancestral phenotypic variability confounding NGP that should be counteracted by international efforts for increasing data diversity. GMDB will serve as a vital reference database for clinicians and a transparent training set for advancing NGP technology.

GestaltMatcher 数据库 - 罕见人类疾病面部表型变异的全球参考资料。
使畸形学家的工作复杂化的最重要因素是人类面部的显著表型变异。下一代表型分析(NGP)工具可帮助临床医生识别特征性综合征模式,但在面对与训练数据不同人群的患者时,这些工具尤其面临挑战。为此,我们系统分析了遗传血统对面部畸形的影响。为此,我们建立了 GestaltMatcher 数据库(GMDB),作为全球罕见遗传疾病患者医学图像的参考数据集。我们收集了来自 8346 名患者的 10980 张面部正面图像,其中超过四分之一之前未曾公开发表过,代表了 581 种罕见疾病。虽然主要血统仍是欧洲人(67%),但通过全球合作,来自代表性不足人群的数据已大大增加(19% 亚洲人和 7% 非洲人)。其中包括 40% 以上的非洲患者此前未发表的报告。对这一多样化数据集进行的 NGP 分析表明,根据与遗传相关性相对应的训练集和测试集的组成不同,其性能也存在明显差异。在 NGP 的临床应用中,纳入非欧洲患者可显著提高 GestaltMatcher 的性能。前五名的准确率提高了 11.29%。重要的是,这种从面部肖像中划分出正确疾病的改进并没有降低欧裔患者的性能。在设计上,GMDB 遵循 FAIR 原则,使整理后的医疗数据可查找、可访问、可互操作和可重复使用。这意味着 GMDB 也可以作为培训和基准测试的数据。总之,我们对全球样本面部畸形的研究揭示了相当大的跨祖先表型变异性对 NGP 的干扰,国际社会应努力提高数据的多样性以抵消这种干扰。全球面部畸形数据库将成为临床医生的重要参考数据库,也是促进 NGP 技术发展的透明培训集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信