{"title":"利用基于机器学习的特征选择识别 SARS CoV-2 感染的自身抗原标记:洞察COVIDS症状","authors":"Aruna Rajalingam, Chaitra Mallasandra Krishnappa, Shanker G, Anjali Ganjiwale","doi":"10.2174/0126667975296293240320041641","DOIUrl":null,"url":null,"abstract":"\n\nSevere acute respiratory syndrome coronavirus 2 (SARS\nCoV-2) infection has been shown to trigger autoimmunity, and the phenomenon leads to several\nchronic human diseases such as Type-1 diabetes, Crohn’s disease, vasculitis, Guillian-Barrė syndrome,\netc. The mechanism underlying SARS CoV-2-induced autoimmune response is unknown and\nis an active area of interest for the researchers.\n\n\n\nThe primary objective of this study is to identify the autoantigen markers for the classification\nof SARS CoV-2 (COVID-19 positive and negative samples) that trigger an immune response\nleading to autoimmunity using a machine learning approach that provides information to obtain a\nmore accurate diagnosis for COVID-induced diseases.\n\n\n\nOur study reports the transcriptomic profile of the COVID patient's whole\nblood samples collected from 0 to 35th day of acute infection as described in the GSE215865 dataset.\nThe binary classification algorithm from the sci-kit learn python library, namely logistic regression\nand random forest with 10-fold cross-validation, was applied to the processed data, followed by a\nselection of the 20 best gene features with recursive feature elimination from a set of 10,719 gene\nfeatures to obtain the classification accuracy of 87%.\n\n\n\nThe fidgetin, microtubule severing factor (FIGN), SH3 and cysteine-rich domain (STAC),\nCadherin-6 (CDH6), docking protein 6 (DOK6), nuclear RNA export factor 3 (NXF3) and maternally\nexpressed 3 (MEG3) are the autoantigens markers identified for classification of COVID-positive\nand negative samples.\n\n\n\nThe identified autoantigen markers from transcriptomic datasets using machine learning\ntechniques provide a deeper understanding of COVID-induced diseases and may play an important\nrole as potential diagnostic and drug targets for COVID-19.\n","PeriodicalId":10815,"journal":{"name":"Coronaviruses","volume":" March","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Autoantigen Markers for SARS CoV-2 Infection with\\nMachine Learning-based Feature Selection: An Insight into COVID\\nSymptoms\",\"authors\":\"Aruna Rajalingam, Chaitra Mallasandra Krishnappa, Shanker G, Anjali Ganjiwale\",\"doi\":\"10.2174/0126667975296293240320041641\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n\\nSevere acute respiratory syndrome coronavirus 2 (SARS\\nCoV-2) infection has been shown to trigger autoimmunity, and the phenomenon leads to several\\nchronic human diseases such as Type-1 diabetes, Crohn’s disease, vasculitis, Guillian-Barrė syndrome,\\netc. The mechanism underlying SARS CoV-2-induced autoimmune response is unknown and\\nis an active area of interest for the researchers.\\n\\n\\n\\nThe primary objective of this study is to identify the autoantigen markers for the classification\\nof SARS CoV-2 (COVID-19 positive and negative samples) that trigger an immune response\\nleading to autoimmunity using a machine learning approach that provides information to obtain a\\nmore accurate diagnosis for COVID-induced diseases.\\n\\n\\n\\nOur study reports the transcriptomic profile of the COVID patient's whole\\nblood samples collected from 0 to 35th day of acute infection as described in the GSE215865 dataset.\\nThe binary classification algorithm from the sci-kit learn python library, namely logistic regression\\nand random forest with 10-fold cross-validation, was applied to the processed data, followed by a\\nselection of the 20 best gene features with recursive feature elimination from a set of 10,719 gene\\nfeatures to obtain the classification accuracy of 87%.\\n\\n\\n\\nThe fidgetin, microtubule severing factor (FIGN), SH3 and cysteine-rich domain (STAC),\\nCadherin-6 (CDH6), docking protein 6 (DOK6), nuclear RNA export factor 3 (NXF3) and maternally\\nexpressed 3 (MEG3) are the autoantigens markers identified for classification of COVID-positive\\nand negative samples.\\n\\n\\n\\nThe identified autoantigen markers from transcriptomic datasets using machine learning\\ntechniques provide a deeper understanding of COVID-induced diseases and may play an important\\nrole as potential diagnostic and drug targets for COVID-19.\\n\",\"PeriodicalId\":10815,\"journal\":{\"name\":\"Coronaviruses\",\"volume\":\" March\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Coronaviruses\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/0126667975296293240320041641\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Coronaviruses","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/0126667975296293240320041641","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identification of Autoantigen Markers for SARS CoV-2 Infection with
Machine Learning-based Feature Selection: An Insight into COVID
Symptoms
Severe acute respiratory syndrome coronavirus 2 (SARS
CoV-2) infection has been shown to trigger autoimmunity, and the phenomenon leads to several
chronic human diseases such as Type-1 diabetes, Crohn’s disease, vasculitis, Guillian-Barrė syndrome,
etc. The mechanism underlying SARS CoV-2-induced autoimmune response is unknown and
is an active area of interest for the researchers.
The primary objective of this study is to identify the autoantigen markers for the classification
of SARS CoV-2 (COVID-19 positive and negative samples) that trigger an immune response
leading to autoimmunity using a machine learning approach that provides information to obtain a
more accurate diagnosis for COVID-induced diseases.
Our study reports the transcriptomic profile of the COVID patient's whole
blood samples collected from 0 to 35th day of acute infection as described in the GSE215865 dataset.
The binary classification algorithm from the sci-kit learn python library, namely logistic regression
and random forest with 10-fold cross-validation, was applied to the processed data, followed by a
selection of the 20 best gene features with recursive feature elimination from a set of 10,719 gene
features to obtain the classification accuracy of 87%.
The fidgetin, microtubule severing factor (FIGN), SH3 and cysteine-rich domain (STAC),
Cadherin-6 (CDH6), docking protein 6 (DOK6), nuclear RNA export factor 3 (NXF3) and maternally
expressed 3 (MEG3) are the autoantigens markers identified for classification of COVID-positive
and negative samples.
The identified autoantigen markers from transcriptomic datasets using machine learning
techniques provide a deeper understanding of COVID-induced diseases and may play an important
role as potential diagnostic and drug targets for COVID-19.