Methods for Inclusive Underwriting of Breast Cancer Risk with Machine Learning and Innovative Algorithms.

Q3 Medicine
Manuel Plisson, Antoine Moll, Valentine Sarrazin, Denis Charles, Thibault Antoine, Razvan Ionescu, Odile Koehren, Eric Raymond
{"title":"Methods for Inclusive Underwriting of Breast Cancer Risk with Machine Learning and Innovative Algorithms.","authors":"Manuel Plisson,&nbsp;Antoine Moll,&nbsp;Valentine Sarrazin,&nbsp;Denis Charles,&nbsp;Thibault Antoine,&nbsp;Razvan Ionescu,&nbsp;Odile Koehren,&nbsp;Eric Raymond","doi":"10.17849/insm-50-1-36-48.1","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>-Due to early detection and improved therapies, the prevalence of long-term breast cancer survivors is increasing. This has increased the need for more inclusive underwriting in individuals with a history of breast cancer. Herein, we developed a method using algorithm aiming facilitating the underwriting of multiple parameters in breast cancer survivors.</p><p><strong>Methods: </strong>-Variables and data were extracted from the SEER database and analyzed using 4 different machine learning based algorithms (Logistic Regression, GA2M, Random Forest, and XGBoost) that were compared with Kaplan Meier survival estimates. The performances of these algorithms have been compared with multiple metrics (Log Loss, AUC, and SMR). In situ (non-invasive) and metastatic breast cancer were excluded from this analysis.</p><p><strong>Results: </strong>-Parameters included the pathological subtype, pTNM staging (T: tumor size, N; number of nodes; M presence or absence of metastases), Scarff-Bloom-Richardson grading, the expression of estrogen and progesterone hormone receptors were selected to predict the individual outcome at any time point from diagnosis. While all models had identical performance in terms of statistical metrics (AUC, Log Loss, and SMR), the logistic regression was the one and only model that respects all business constraints and was intelligible for medical and underwriting users.</p><p><strong>Conclusion: </strong>-This study provides insight to develop algorithms to set underwriter-friendly calculators for more accurate risk estimations that can be used to rationalize insurance pricing for breast cancer survivors. This study supports the development of a more inclusive underwriting based on models that can encompass the heterogeneity of several malignancies such as breast cancer.</p>","PeriodicalId":39345,"journal":{"name":"Journal of insurance medicine (New York, N.Y.)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of insurance medicine (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17849/insm-50-1-36-48.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: -Due to early detection and improved therapies, the prevalence of long-term breast cancer survivors is increasing. This has increased the need for more inclusive underwriting in individuals with a history of breast cancer. Herein, we developed a method using algorithm aiming facilitating the underwriting of multiple parameters in breast cancer survivors.

Methods: -Variables and data were extracted from the SEER database and analyzed using 4 different machine learning based algorithms (Logistic Regression, GA2M, Random Forest, and XGBoost) that were compared with Kaplan Meier survival estimates. The performances of these algorithms have been compared with multiple metrics (Log Loss, AUC, and SMR). In situ (non-invasive) and metastatic breast cancer were excluded from this analysis.

Results: -Parameters included the pathological subtype, pTNM staging (T: tumor size, N; number of nodes; M presence or absence of metastases), Scarff-Bloom-Richardson grading, the expression of estrogen and progesterone hormone receptors were selected to predict the individual outcome at any time point from diagnosis. While all models had identical performance in terms of statistical metrics (AUC, Log Loss, and SMR), the logistic regression was the one and only model that respects all business constraints and was intelligible for medical and underwriting users.

Conclusion: -This study provides insight to develop algorithms to set underwriter-friendly calculators for more accurate risk estimations that can be used to rationalize insurance pricing for breast cancer survivors. This study supports the development of a more inclusive underwriting based on models that can encompass the heterogeneity of several malignancies such as breast cancer.

癌症风险的机器学习和创新算法包容性承保方法。
简介:由于早期发现和改进的治疗方法,癌症长期幸存者的患病率正在增加。这增加了对有癌症病史的个人进行更具包容性承保的需求。在此,我们开发了一种使用算法的方法,旨在促进癌症幸存者的多个参数承保。方法:从SEER数据库中提取变量和数据,并使用4种不同的基于机器学习的算法(Logistic回归、GA2M、随机森林和XGBoost)进行分析,这些算法与Kaplan-Meier生存估计进行比较。将这些算法的性能与多种指标(对数损失、AUC和SMR)进行了比较。原位(非侵入性)和转移性癌症乳腺癌被排除在本分析之外。结果:-参数包括病理亚型、pTNM分期(T:肿瘤大小,N;淋巴结数量;M是否存在转移)、Scarff-Bloom-Richardson分级、雌激素和孕激素受体的表达,以预测诊断后任何时间点的个体结果。虽然所有模型在统计指标(AUC、对数损失和SMR)方面都具有相同的性能,但逻辑回归是唯一一个尊重所有业务约束的模型,并且对医疗和承保用户来说是可理解的。结论:本研究为开发算法提供了见解,以设置更准确的风险估计的保险商友好型计算器,可用于合理化癌症幸存者的保险定价。这项研究支持基于模型开发更具包容性的承保,该模型可以涵盖癌症等几种恶性肿瘤的异质性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.50
自引率
0.00%
发文量
6
期刊介绍: The Journal of Insurance Medicine is a peer reviewed scientific journal sponsored by the American Academy of Insurance Medicine, and is published quarterly. Subscriptions to the Journal of Insurance Medicine are included in your AAIM membership.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信