Praveenkumar Kochuthakidiyel Suresh, Gnanasoundari Sekar, Kavya Mallady, Wan Suriana Wan Ab Rahman, Wan Nazatul Shima Shahidan, Gokulakannan Venkatesan
{"title":"The Identification of Potential Drugs for Dengue Hemorrhagic Fever: Network-Based Drug Reprofiling Study.","authors":"Praveenkumar Kochuthakidiyel Suresh, Gnanasoundari Sekar, Kavya Mallady, Wan Suriana Wan Ab Rahman, Wan Nazatul Shima Shahidan, Gokulakannan Venkatesan","doi":"10.2196/37306","DOIUrl":"10.2196/37306","url":null,"abstract":"<p><strong>Background: </strong>Dengue fever can progress to dengue hemorrhagic fever (DHF), a more serious and occasionally fatal form of the disease. Indicators of serious disease arise about the time the fever begins to reduce (typically 3 to 7 days following symptom onset). There are currently no effective antivirals available. Drug repurposing is an emerging drug discovery process for rapidly developing effective DHF therapies. Through network pharmacology modeling, several US Food and Drug Administration (FDA)-approved medications have already been researched for various viral outbreaks.</p><p><strong>Objective: </strong>We aimed to identify potentially repurposable drugs for DHF among existing FDA-approved drugs for viral attacks, symptoms of viral fevers, and DHF.</p><p><strong>Methods: </strong>Using target identification databases (GeneCards and DrugBank), we identified human-DHF virus interacting genes and drug targets against these genes. We determined hub genes and potential drugs with a network-based analysis. We performed functional enrichment and network analyses to identify pathways, protein-protein interactions, tissues where the gene expression was high, and disease-gene associations.</p><p><strong>Results: </strong>Analyzing virus-host interactions and therapeutic targets in the human genome network revealed 45 repurposable medicines. Hub network analysis of host-virus-drug associations suggested that aspirin, captopril, and rilonacept might efficiently treat DHF. Gene enrichment analysis supported these findings. According to a Mayo Clinic report, using aspirin in the treatment of dengue fever may increase the risk of bleeding complications, but several studies from around the world suggest that thrombosis is associated with DHF. The human interactome contains the genes prostaglandin-endoperoxide synthase 2 (PTGS2), angiotensin converting enzyme (ACE), and coagulation factor II, thrombin (F2), which have been documented to have a role in the pathogenesis of disease progression in DHF, and our analysis of most of the drugs targeting these genes showed that the hub gene module (human-virus-drug) was highly enriched in tissues associated with the immune system (P=7.29 × 10<sup>-24</sup>) and human umbilical vein endothelial cells (P=1.83 × 10<sup>-20</sup>); this group of tissues acts as an anticoagulant barrier between the vessel walls and blood. Kegg analysis showed an association with genes linked to cancer (P=1.13 × 10<sup>-14</sup>) and the advanced glycation end products-receptor for advanced glycation end products signaling pathway in diabetic complications (P=3.52 × 10<sup>-14</sup>), which indicates that DHF patients with diabetes and cancer are at risk of higher pathogenicity. Thus, gene-targeting medications may play a significant part in limiting or worsening the condition of DHF patients.</p><p><strong>Conclusions: </strong>Aspirin is not usually prescribed for dengue fever because of bleeding complications, but it ","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e37306"},"PeriodicalIF":0.0,"publicationDate":"2023-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135182/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43878118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Differentially Expressed Genes Responsible for the Development of T Helper 9 Cells From T Helper 2 Cells in Various Disease States: Immuno-Interactomics Study.","authors":"Manoj Khokhar, Purvi Purohit, Ashita Gadwal, Sojit Tomo, Nitin Kumar Bajpai, Ravindra Shukla","doi":"10.2196/42421","DOIUrl":"10.2196/42421","url":null,"abstract":"<p><strong>Background: </strong>T helper (Th) 9 cells are a novel subset of Th cells that develop independently from Th2 cells and are characterized by the secretion of interleukin (IL)-9. Studies have suggested the involvement of Th9 cells in variable diseases such as allergic and pulmonary diseases (eg, asthma, chronic obstructive airway disease, chronic rhinosinusitis, nasal polyps, and pulmonary hypoplasia), metabolic diseases (eg, acute leukemia, myelocytic leukemia, breast cancer, lung cancer, melanoma, pancreatic cancer), neuropsychiatric disorders (eg, Alzheimer disease), autoimmune diseases (eg, Graves disease, Crohn disease, colitis, psoriasis, systemic lupus erythematosus, systemic scleroderma, rheumatoid arthritis, multiple sclerosis, inflammatory bowel disease, atopic dermatitis, eczema), and infectious diseases (eg, tuberculosis, hepatitis). However, there is a dearth of information on its involvement in other metabolic, neuropsychiatric, and infectious diseases.</p><p><strong>Objective: </strong>This study aims to identify significant differentially altered genes in the conversion of Th2 to Th9 cells, and their regulating microRNAs (miRs) from publicly available Gene Expression Omnibus data sets of the mouse model using in silico analysis to unravel various pathogenic pathways involved in disease processes.</p><p><strong>Methods: </strong>Using differentially expressed genes (DEGs) identified from 2 publicly available data sets (GSE99166 and GSE123501) we performed functional enrichment and network analyses to identify pathways, protein-protein interactions, miR-messenger RNA associations, and disease-gene associations related to significant differentially altered genes implicated in the conversion of Th2 to Th9 cells.</p><p><strong>Results: </strong>We extracted 260 common downregulated, 236 common upregulated, and 634 common DEGs from the expression profiles of data sets GSE99166 and GSE123501. Codifferentially expressed ILs, cytokines, receptors, and transcription factors (TFs) were enriched in 7 crucial Kyoto Encyclopedia of Genes and Genomes pathways and Gene Ontology. We constructed the protein-protein interaction network and predicted the top regulatory miRs involved in the Th2 to Th9 differentiation pathways. We also identified various metabolic, allergic and pulmonary, neuropsychiatric, autoimmune, and infectious diseases as well as carcinomas where the differentiation of Th2 to Th9 may play a crucial role.</p><p><strong>Conclusions: </strong>This study identified hitherto unexplored possible associations between Th9 and disease states. Some important ILs, including CCL1 (chemokine [C-C motif] ligand 1), CCL20 (chemokine [C-C motif] ligand 20), IL-13, IL-4, IL-12A, and IL-9; receptors, including IL-12RB1, IL-4RA (interleukin 9 receptor alpha), CD53 (cluster of differentiation 53), CD6 (cluster of differentiation 6), CD5 (cluster of differentiation 5), CD83 (cluster of differentiation 83), CD197 (cluster of differentiation ","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e42421"},"PeriodicalIF":0.0,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135241/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46910282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ashutosh Kumar, Adil Asghar, Himanshu N Singh, Muneeb A Faiq, Sujeet Kumar, Ravi K Narayan, Gopichand Kumar, Prakhar Dwivedi, Chetan Sahni, Rakesh K Jha, Maheswari Kulandhasamy, Pranav Prasoon, Kishore Sesham, Kamla Kant, Sada N Pandey
{"title":"SARS-CoV-2 Omicron Variant Genomic Sequences and Their Epidemiological Correlates Regarding the End of the Pandemic: In Silico Analysis.","authors":"Ashutosh Kumar, Adil Asghar, Himanshu N Singh, Muneeb A Faiq, Sujeet Kumar, Ravi K Narayan, Gopichand Kumar, Prakhar Dwivedi, Chetan Sahni, Rakesh K Jha, Maheswari Kulandhasamy, Pranav Prasoon, Kishore Sesham, Kamla Kant, Sada N Pandey","doi":"10.2196/42700","DOIUrl":"10.2196/42700","url":null,"abstract":"<p><strong>Background: </strong>Emergence of the new SARS-CoV-2 variant B.1.1.529 worried health policy makers worldwide due to a large number of mutations in its genomic sequence, especially in the spike protein region. The World Health Organization (WHO) designated this variant as a global variant of concern (VOC), which was named \"Omicron.\" Following Omicron's emergence, a surge of new COVID-19 cases was reported globally, primarily in South Africa.</p><p><strong>Objective: </strong>The aim of this study was to understand whether Omicron had an epidemiological advantage over existing variants.</p><p><strong>Methods: </strong>We performed an in silico analysis of the complete genomic sequences of Omicron available on the Global Initiative on Sharing Avian Influenza Data (GISAID) database to analyze the functional impact of the mutations present in this variant on virus-host interactions in terms of viral transmissibility, virulence/lethality, and immune escape. In addition, we performed a correlation analysis of the relative proportion of the genomic sequences of specific SARS-CoV-2 variants (in the period from October 1 to November 29, 2021) with matched epidemiological data (new COVID-19 cases and deaths) from South Africa.</p><p><strong>Results: </strong>Compared with the current list of global VOCs/variants of interest (VOIs), as per the WHO, Omicron bears more sequence variation, specifically in the spike protein and host receptor-binding motif (RBM). Omicron showed the closest nucleotide and protein sequence homology with the Alpha variant for the complete sequence and the RBM. The mutations were found to be primarily condensed in the spike region (n=28-48) of the virus. Further mutational analysis showed enrichment for the mutations decreasing binding affinity to angiotensin-converting enzyme 2 receptor and receptor-binding domain protein expression, and for increasing the propensity of immune escape. An inverse correlation of Omicron with the Delta variant was noted (r=-0.99, <i>P</i><.001; 95% CI -0.99 to -0.97) in the sequences reported from South Africa postemergence of the new variant, subsequently showing a decrease. There was a steep rise in new COVID-19 cases in parallel with the increase in the proportion of Omicron isolates since the report of the first case (74%-100%). By contrast, the incidence of new deaths did not increase (r=-0.04, <i>P</i>>.05; 95% CI -0.52 to 0.58).</p><p><strong>Conclusions: </strong>In silico analysis of viral genomic sequences suggests that the Omicron variant has more remarkable immune-escape ability than existing VOCs/VOIs, including Delta, but reduced virulence/lethality than other reported variants. The higher power for immune escape for Omicron was a likely reason for the resurgence in COVID-19 cases and its rapid rise as the globally dominant strain. Being more infectious but less lethal than the existing variants, Omicron could have plausibly led to widespread unnoticed new, repeated, and vacci","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"4 ","pages":"e42700"},"PeriodicalIF":0.0,"publicationDate":"2023-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9843602/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10598394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
David Chen, Gurjit S Randhawa, Maximillian Pm Soltysiak, Camila Pe de Souza, Lila Kari, Shiva M Singh, Kathleen A Hill
{"title":"Mutational Patterns Observed in SARS-CoV-2 Genomes Sampled From Successive Epochs Delimited by Major Public Health Events in Ontario, Canada: Genomic Surveillance Study.","authors":"David Chen, Gurjit S Randhawa, Maximillian Pm Soltysiak, Camila Pe de Souza, Lila Kari, Shiva M Singh, Kathleen A Hill","doi":"10.2196/42243","DOIUrl":"10.2196/42243","url":null,"abstract":"<p><strong>Background: </strong>The emergence of SARS-CoV-2 variants with mutations associated with increased transmissibility and virulence is a public health concern in Ontario, Canada. Characterizing how the mutational patterns of the SARS-CoV-2 genome have changed over time can shed light on the driving factors, including selection for increased fitness and host immune response, that may contribute to the emergence of novel variants. Moreover, the study of SARS-CoV-2 in the microcosm of Ontario, Canada can reveal how different province-specific public health policies over time may be associated with observed mutational patterns as a model system.</p><p><strong>Objective: </strong>This study aimed to perform a comprehensive analysis of single base substitution (SBS) types, counts, and genomic locations observed in SARS-CoV-2 genomic sequences sampled in Ontario, Canada. Comparisons of mutational patterns were conducted between sequences sampled during 4 different epochs delimited by major public health events to track the evolution of the SARS-CoV-2 mutational landscape over 2 years.</p><p><strong>Methods: </strong>In total, 24,244 SARS-CoV-2 genomic sequences and associated metadata sampled in Ontario, Canada from January 1, 2020, to December 31, 2021, were retrieved from the Global Initiative on Sharing All Influenza Data database. Sequences were assigned to 4 epochs delimited by major public health events based on the sampling date. SBSs from each SARS-CoV-2 sequence were identified relative to the MN996528.1 reference genome. Catalogues of SBS types and counts were generated to estimate the impact of selection in each open reading frame, and identify mutation clusters. The estimation of mutational fitness over time was performed using the Augur pipeline.</p><p><strong>Results: </strong>The biases in SBS types and proportions observed support previous reports of host antiviral defense activity involving the SARS-CoV-2 genome. There was an increase in U>C substitutions associated with adenosine deaminase acting on RNA (ADAR) activity uniquely observed during Epoch 4. The burden of novel SBSs observed in SARS-CoV-2 genomic sequences was the greatest in Epoch 2 (median 5), followed by Epoch 3 (median 4). Clusters of SBSs were observed in the spike protein open reading frame, ORF1a, and ORF3a. The high proportion of nonsynonymous SBSs and increasing dN/dS metric (ratio of nonsynonymous to synonymous mutations in a given open reading frame) to above 1 in Epoch 4 indicate positive selection of the spike protein open reading frame.</p><p><strong>Conclusions: </strong>Quantitative analysis of the mutational patterns of the SARS-CoV-2 genome in the microcosm of Ontario, Canada within early consecutive epochs of the pandemic tracked the mutational dynamics in the context of public health events that instigate significant shifts in selection and mutagenesis. Continued genomic surveillance of emergent variants will be useful for the design of public he","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e42243"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135226/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44573612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zahra Sharifiheris, Amir Rahmani, Joseph Onwuka, Miriam Bender
{"title":"The Utilization of Heart Rate Variability for Autonomic Nervous System Assessment in Healthy Pregnant Women: Systematic Review.","authors":"Zahra Sharifiheris, Amir Rahmani, Joseph Onwuka, Miriam Bender","doi":"10.2196/36791","DOIUrl":"10.2196/36791","url":null,"abstract":"<p><strong>Background: </strong>The autonomic nervous system (ANS) plays a central role in pregnancy-induced adaptations, and failure in the required adaptations is associated with adverse neonatal and maternal outcomes. Mapping maternal ANS function in healthy pregnancy may help to understand ANS function.</p><p><strong>Objective: </strong>This study aimed to systematically review studies on the use of heart rate variability (HRV) monitoring to measure ANS function during pregnancy and determine whether specific HRV patterns representing normal ANS function have been identified during pregnancy.</p><p><strong>Methods: </strong>The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guideline was used to guide the systematic review. The CINAHL, PubMed, SCOPUS, and Web of Science databases were searched to comprehensively identify articles without a time span limitation. Studies were included if they assessed HRV in healthy pregnant individuals at least once during pregnancy or labor, with or without a comparison group (eg, complicated pregnancy). Quality assessment of the included literature was performed using the National Heart, Lung, and Blood Institute (NHLBI) tool. A narrative synthesis approach was used for data extraction and analysis, as the articles were heterogenous in scope, approaches, methods, and variables assessed, which precluded traditional meta-analysis approaches being used.</p><p><strong>Results: </strong>After full screening, 8 studies met the inclusion criteria. In 88% (7/8) of the studies, HRV was measured using electrocardiogram and operationalized in 3 different ways: linear frequency domain (FD), linear time domain (TD), and nonlinear methods. FD was measured in all (8/8), TD in 75% (6/8), and nonlinear methods in 25% (2/8) of the studies. The assessment duration varied from 5 minutes to 24 hours. TD indexes and most of the FD indexes decreased from the first to the third trimesters in the majority (5/7, 71%) of the studies. Of the FD indexes, low frequency (LF [nu]) and the LF/high frequency (HF) ratio showed an ascending trend from early to late pregnancy, indicating an increase in sympathetic activity toward the end of the pregnancy.</p><p><strong>Conclusions: </strong>We identified 3 HRV operationalization methods along with potentially indicative HRV patterns. However, we found no justification for the selection of measurement tools, measurement time frames, and operationalization methods, which threaten the generalizability and reliability of pattern findings. More research is needed to determine the criteria and methods for determining HRV patterns corresponding to ANS functioning in healthy pregnant persons.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e36791"},"PeriodicalIF":0.0,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45914601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Antonio Vilasi, Vincenzo Antonio Panuccio, Salvatore Morante, Antonino Villa, Maria Carmela Versace, Sabrina Mezzatesta, Sergio Mercuri, Rosalinda Inguanta, Giuseppe Aiello, Demetrio Cutrupi, Rossella Puglisi, Salvatore Capria, Maurizio Li Vigni, Giovanni Tripepi, Claudia Torino
{"title":"Monitoring Risk Factors and Improving Adherence to Therapy in Patients With Chronic Kidney Disease (Smit-CKD Project): Pilot Observational Study.","authors":"Antonio Vilasi, Vincenzo Antonio Panuccio, Salvatore Morante, Antonino Villa, Maria Carmela Versace, Sabrina Mezzatesta, Sergio Mercuri, Rosalinda Inguanta, Giuseppe Aiello, Demetrio Cutrupi, Rossella Puglisi, Salvatore Capria, Maurizio Li Vigni, Giovanni Tripepi, Claudia Torino","doi":"10.2196/36766","DOIUrl":"10.2196/36766","url":null,"abstract":"<p><strong>Background: </strong>Chronic kidney disease is a major public health issue, with about 13% of the general adult population and 30% of the elderly affected. Patients in the last stage of this disease have an almost uniquely high risk of death and cardiovascular events, with reduced adherence to therapy representing an additional risk factor for cardiovascular morbidity and mortality. Considering the increased penetration of mobile phones, a mobile app could educate patients to autonomously monitor cardiorenal risk factors.</p><p><strong>Objective: </strong>With this background in mind, we developed an integrated system of a server and app with the aim of improving self-monitoring of cardiovascular and renal risk factors and adherence to therapy.</p><p><strong>Methods: </strong>The software infrastructure for both the Smit-CKD server and Smit-CKD app was developed using standard web-oriented development methodologies preferring open source tools when available. To make the Smit-CKD app suitable for Android and iOS, platforms that allow the development of a multiplatform app starting from a single source code were used. The integrated system was field tested with the help of 22 participants. User satisfaction and adherence to therapy were measured by questionnaires specifically designed for this study; regular use of the app was measured using the daily reports available on the platform.</p><p><strong>Results: </strong>The Smit-CKD app allows the monitoring of cardiorenal risk factors, such as blood pressure, weight, and blood glucose. Collected data are transmitted in real time to the referring general practitioner. In addition, special reminders improve adherence to the medication regimen. Via the Smit-CKD server, general practitioners can monitor the clinical status of their patients and their adherence to therapy. During the test phase, 73% (16/22) of subjects entered all the required data regularly and sent feedback on drug intake. After 6 months of use, the percentage of regular intake of medications rose from 64% (14/22) to 82% (18/22). Analysis of the evaluation questionnaires showed that both the app and server components were well accepted by the users.</p><p><strong>Conclusions: </strong>Our study demonstrated that a simple mobile app, created to self-monitor modifiable cardiorenal risk factors and adherence to therapy, is well tolerated by patients affected by chronic kidney disease. Further studies are required to clarify if the use of this integrated system will have long-term effects on therapy adherence and if self-monitoring of risk factors will improve clinical outcomes in this population.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e36766"},"PeriodicalIF":0.0,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135230/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42836823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prediction of Antibody-Antigen Binding via Machine Learning: Development of Data Sets and Evaluation of Methods.","authors":"Chao Ye, Wenxing Hu, Bruno Gaeta","doi":"10.2196/29404","DOIUrl":"10.2196/29404","url":null,"abstract":"<p><strong>Background: </strong>The mammalian immune system is able to generate antibodies against a huge variety of antigens, including bacteria, viruses, and toxins. The ultradeep DNA sequencing of rearranged immunoglobulin genes has considerable potential in furthering our understanding of the immune response, but it is limited by the lack of a high-throughput, sequence-based method for predicting the antigen(s) that a given immunoglobulin recognizes.</p><p><strong>Objective: </strong>As a step toward the prediction of antibody-antigen binding from sequence data alone, we aimed to compare a range of machine learning approaches that were applied to a collated data set of antibody-antigen pairs in order to predict antibody-antigen binding from sequence data.</p><p><strong>Methods: </strong>Data for training and testing were extracted from the Protein Data Bank and the Coronavirus Antibody Database, and additional antibody-antigen pair data were generated by using a molecular docking protocol. Several machine learning methods, including the weighted nearest neighbor method, the nearest neighbor method with the BLOSUM62 matrix, and the random forest method, were applied to the problem.</p><p><strong>Results: </strong>The final data set contained 1157 antibodies and 57 antigens that were combined in 5041 antibody-antigen pairs. The best performance for the prediction of interactions was obtained by using the nearest neighbor method with the BLOSUM62 matrix, which resulted in around 82% accuracy on the full data set. These results provide a useful frame of reference, as well as protocols and considerations, for machine learning and data set creation in the prediction of antibody-antigen binding.</p><p><strong>Conclusions: </strong>Several machine learning approaches were compared to predict antibody-antigen interaction from protein sequences. Both the data set (in CSV format) and the machine learning program (coded in Python) are freely available for download on GitHub.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e29404"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135222/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44803964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiple-Inputs Convolutional Neural Network for COVID-19 Classification and Critical Region Screening From Chest X-ray Radiographs: Model Development and Performance Evaluation.","authors":"Zhongqiang Li, Zheng Li, Luke Yao, Qing Chen, Jian Zhang, Xin Li, Ji-Ming Feng, Yanping Li, Jian Xu","doi":"10.2196/36660","DOIUrl":"10.2196/36660","url":null,"abstract":"<p><strong>Background: </strong>The COVID-19 pandemic is becoming one of the largest, unprecedented health crises, and chest X-ray radiography (CXR) plays a vital role in diagnosing COVID-19. However, extracting and finding useful image features from CXRs demand a heavy workload for radiologists.</p><p><strong>Objective: </strong>The aim of this study was to design a novel multiple-inputs (MI) convolutional neural network (CNN) for the classification of COVID-19 and extraction of critical regions from CXRs. We also investigated the effect of the number of inputs on the performance of our new MI-CNN model.</p><p><strong>Methods: </strong>A total of 6205 CXR images (including 3021 COVID-19 CXRs and 3184 normal CXRs) were used to test our MI-CNN models. CXRs could be evenly segmented into different numbers (2, 4, and 16) of individual regions. Each region could individually serve as one of the MI-CNN inputs. The CNN features of these MI-CNN inputs would then be fused for COVID-19 classification. More importantly, the contributions of each CXR region could be evaluated through assessing the number of images that were accurately classified by their corresponding regions in the testing data sets.</p><p><strong>Results: </strong>In both the whole-image and left- and right-lung region of interest (LR-ROI) data sets, MI-CNNs demonstrated good efficiency for COVID-19 classification. In particular, MI-CNNs with more inputs (2-, 4-, and 16-input MI-CNNs) had better efficiency in recognizing COVID-19 CXRs than the 1-input CNN. Compared to the whole-image data sets, the efficiency of LR-ROI data sets showed approximately 4% lower accuracy, sensitivity, specificity, and precision (over 91%). In considering the contributions of each region, one of the possible reasons for this reduced performance was that nonlung regions (eg, region 16) provided false-positive contributions to COVID-19 classification. The MI-CNN with the LR-ROI data set could provide a more accurate evaluation of the contribution of each region and COVID-19 classification. Additionally, the right-lung regions had higher contributions to the classification of COVID-19 CXRs, whereas the left-lung regions had higher contributions to identifying normal CXRs.</p><p><strong>Conclusions: </strong>Overall, MI-CNNs could achieve higher accuracy with an increasing number of inputs (eg, 16-input MI-CNN). This approach could assist radiologists in identifying COVID-19 CXRs and in screening the critical regions related to COVID-19 classifications.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e36660"},"PeriodicalIF":0.0,"publicationDate":"2022-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578294/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40676552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Treatment Discontinuation Prediction in Patients With Diabetes Using a Ranking Model: Machine Learning Model Development.","authors":"Hisashi Kurasawa, Kayo Waki, Akihiro Chiba, Tomohisa Seki, Katsuyoshi Hayashi, Akinori Fujino, Tsuneyuki Haga, Takashi Noguchi, Kazuhiko Ohe","doi":"10.2196/37951","DOIUrl":"10.2196/37951","url":null,"abstract":"<p><strong>Background: </strong>Treatment discontinuation (TD) is one of the major prognostic issues in diabetes care, and several models have been proposed to predict a missed appointment that may lead to TD in patients with diabetes by using binary classification models for the early detection of TD and for providing intervention support for patients. However, as binary classification models output the probability of a missed appointment occurring within a predetermined period, they are limited in their ability to estimate the magnitude of TD risk in patients with inconsistent intervals between appointments, making it difficult to prioritize patients for whom intervention support should be provided.</p><p><strong>Objective: </strong>This study aimed to develop a machine-learned prediction model that can output a TD risk score defined by the length of time until TD and prioritize patients for intervention according to their TD risk.</p><p><strong>Methods: </strong>This model included patients with diagnostic codes indicative of diabetes at the University of Tokyo Hospital between September 3, 2012, and May 17, 2014. The model was internally validated with patients from the same hospital from May 18, 2014, to January 29, 2016. The data used in this study included 7551 patients who visited the hospital after January 1, 2004, and had diagnostic codes indicative of diabetes. In particular, data that were recorded in the electronic medical records between September 3, 2012, and January 29, 2016, were used. The main outcome was the TD of a patient, which was defined as missing a scheduled clinical appointment and having no hospital visits within 3 times the average number of days between the visits of the patient and within 60 days. The TD risk score was calculated by using the parameters derived from the machine-learned ranking model. The prediction capacity was evaluated by using test data with the C-index for the performance of ranking patients, area under the receiver operating characteristic curve, and area under the precision-recall curve for discrimination, in addition to a calibration plot.</p><p><strong>Results: </strong>The means (95% confidence limits) of the C-index, area under the receiver operating characteristic curve, and area under the precision-recall curve for the TD risk score were 0.749 (0.655, 0.823), 0.758 (0.649, 0.857), and 0.713 (0.554, 0.841), respectively. The observed and predicted probabilities were correlated with the calibration plots.</p><p><strong>Conclusions: </strong>A TD risk score was developed for patients with diabetes by combining a machine-learned method with electronic medical records. The score calculation can be integrated into medical records to identify patients at high risk of TD, which would be useful in supporting diabetes care and preventing TD.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e37951"},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135228/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46360380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ashutosh Kumar, Adil Asghar, Prakhar Dwivedi, Gopichand Kumar, Ravi K Narayan, Rakesh K Jha, Rakesh Parashar, Chetan Sahni, Sada N Pandey
{"title":"A Bioinformatics Tool for Predicting Future COVID-19 Waves Based on a Retrospective Analysis of the Second Wave in India: Model Development Study.","authors":"Ashutosh Kumar, Adil Asghar, Prakhar Dwivedi, Gopichand Kumar, Ravi K Narayan, Rakesh K Jha, Rakesh Parashar, Chetan Sahni, Sada N Pandey","doi":"10.2196/36860","DOIUrl":"10.2196/36860","url":null,"abstract":"<p><strong>Background: </strong>Since the start of the COVID-19 pandemic, health policymakers globally have been attempting to predict an impending wave of COVID-19. India experienced a devastating second wave of COVID-19 in the late first week of May 2021. We retrospectively analyzed the viral genomic sequences and epidemiological data reflecting the emergence and spread of the second wave of COVID-19 in India to construct a prediction model.</p><p><strong>Objective: </strong>We aimed to develop a bioinformatics tool that can predict an impending COVID-19 wave.</p><p><strong>Methods: </strong>We analyzed the time series distribution of genomic sequence data for SARS-CoV-2 and correlated it with epidemiological data for new cases and deaths for the corresponding period of the second wave. In addition, we analyzed the phylodynamics of circulating SARS-CoV-2 variants in the Indian population during the study period.</p><p><strong>Results: </strong>Our prediction analysis showed that the first signs of the arrival of the second wave could be seen by the end of January 2021, about 2 months before its peak in May 2021. By the end of March 2021, it was distinct. B.1.617 lineage variants powered the wave, most notably B.1.617.2 (Delta variant).</p><p><strong>Conclusions: </strong>Based on the observations of this study, we propose that genomic surveillance of SARS-CoV-2 variants, complemented with epidemiological data, can be a promising tool to predict impending COVID-19 waves.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e36860"},"PeriodicalIF":0.0,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9516867/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33486448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}