Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects.

IF 1.8 4区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY
Kadhir Velu Karthik, Aruna Rajalingam, Mallaiah Shivashankar, Anjali Ganjiwale
{"title":"Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects.","authors":"Kadhir Velu Karthik,&nbsp;Aruna Rajalingam,&nbsp;Mallaiah Shivashankar,&nbsp;Anjali Ganjiwale","doi":"10.2174/1389202923666220511162038","DOIUrl":null,"url":null,"abstract":"<p><p><b><i>Background</i>:</b> Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. <b><i>Objective</i>:</b> The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. <b><i>Materials and Methods</i>:</b> Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. <b><i>Results</i>:</b> Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. <b><i>Conclusion</i>:</b> These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTD's. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.</p>","PeriodicalId":10803,"journal":{"name":"Current Genomics","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2022-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/66/4f/CG-23-195.PMC9878829.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1389202923666220511162038","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. Objective: The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. Materials and Methods: Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. Results: Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. Conclusion: These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTD's. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.

Abstract Image

Abstract Image

Abstract Image

基于递归特征消除的开放性神经管缺陷生物标志物识别。
背景:开放性脊柱裂(脊髓脊膜膨出)是脊髓不能完全闭合的结果,是第二常见和严重的出生缺陷。开放式神经管缺陷是多因素的,由于疾病的复杂性,其发病机制的确切分子机制尚不清楚,全球产前治疗选择仍然有限。像机器学习工具这样的人工智能技术已经越来越多地用于精确诊断。目的:本研究的主要目的是使用机器学习方法识别开放神经管缺陷的关键基因,该方法提供了关于髓膜膨出的额外信息,以便获得更准确的诊断。材料和方法:本研究报道了对开放性神经管缺陷羊水样本的多个数据集(GSE4182和GSE101141)的差异基因表达分析。使用主成分分析(PCA)检测数据集中的样本异常值。我们报告了将差异基因表达分析与递归特征消除(RFE)相结合的方法,一种机器学习方法,以获得开放神经管缺陷的4个关键基因。选择的特征使用五种二元分类器对患病和健康样本进行验证:逻辑回归(LR),决策树分类器(DT),支持向量机(SVM),随机森林分类器(RF)和k -近邻(KNN)进行5次交叉验证。结果:生长相关蛋白43 (Growth Associated Protein 43, GAP43)、胶质纤维酸性蛋白(Glial fibrillary acid Protein, GFAP)、重复蛋白(Repetin, RPTN)和CD44是研究中发现的重要基因。已知这些基因参与轴突生长、中枢神经系统星形胶质细胞分化、创伤后脑修复、神经炎症和炎症相关的神经元损伤。这些关键基因为进一步研究开放神经管缺陷的诊断和早期检测提供了有希望的工具。结论:这些关键生物标志物有助于开放性神经管缺陷的早期诊断和发现,从而评估疾病的进展和严重程度。这项研究加强了先前的文献来源,证实了这些与开放性NTD相关的生物标志物。因此,到目前为止,在其他产前治疗方案中,这些生物标志物有助于早期发现开放神经管缺陷,这在治疗和预防这些晚期缺陷方面都取得了成功。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Current Genomics
Current Genomics 生物-生化与分子生物学
CiteScore
5.20
自引率
0.00%
发文量
29
审稿时长
>0 weeks
期刊介绍: Current Genomics is a peer-reviewed journal that provides essential reading about the latest and most important developments in genome science and related fields of research. Systems biology, systems modeling, machine learning, network inference, bioinformatics, computational biology, epigenetics, single cell genomics, extracellular vesicles, quantitative biology, and synthetic biology for the study of evolution, development, maintenance, aging and that of human health, human diseases, clinical genomics and precision medicine are topics of particular interest. The journal covers plant genomics. The journal will not consider articles dealing with breeding and livestock. Current Genomics publishes three types of articles including: i) Research papers from internationally-recognized experts reporting on new and original data generated at the genome scale level. Position papers dealing with new or challenging methodological approaches, whether experimental or mathematical, are greatly welcome in this section. ii) Authoritative and comprehensive full-length or mini reviews from widely recognized experts, covering the latest developments in genome science and related fields of research such as systems biology, statistics and machine learning, quantitative biology, and precision medicine. Proposals for mini-hot topics (2-3 review papers) and full hot topics (6-8 review papers) guest edited by internationally-recognized experts are welcome in this section. Hot topic proposals should not contain original data and they should contain articles originating from at least 2 different countries. iii) Opinion papers from internationally recognized experts addressing contemporary questions and issues in the field of genome science and systems biology and basic and clinical research practices.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信