{"title":"Fractal dimensionality of a coiled helical coil.","authors":"Subhash Kak","doi":"10.1142/S0219720025710015","DOIUrl":"https://doi.org/10.1142/S0219720025710015","url":null,"abstract":"<p><p>The helical coil is ubiquitous in biological and natural systems, and it is often the basic form in complex structures. This paper considers the question of its dimensionality, <i>D</i>, in biological information as the helical coil goes through recursive coiling as in DNA and RNA molecules in chromatin, in which the <i>D</i>-value is a function of the lengthening of the curve. It is shown that the dimensionality of coiled coils is virtually equal to <i>e</i>. Of the three forms of DNA, the dimensionality of the B-form is nearest to the optimal value, and this might be the reason why it is most common.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2571001"},"PeriodicalIF":0.9,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144286929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Computational modeling and dynamical analysis for B. subtilis competence genic regulation circuit with multiple time delays and external noisy regulation.","authors":"Na Zhao, Haihong Liu, Fang Yan","doi":"10.1142/S0219720025500052","DOIUrl":"https://doi.org/10.1142/S0219720025500052","url":null,"abstract":"<p><p>Bacillus subtilis (B. subtilis), a bacterium known to enter a competent state spontaneously, has garnered significant attention due to its intricate internal regulatory mechanisms. This study proposes a six-dimensional continuous delay differential equation (DDE) model incorporating two-time delays and a stochastic model that accounts for noise, aimed at delving deeper into the dynamic behaviors of the B. subtilis competence gene regulation circuit. Our investigation reveals that time delays play a crucial role in inducing oscillatory behavior within the continuous DDE model. Analyzing the dynamics of multiple time delays proves to be more intricate than studying a single delay. Furthermore, certain parameter adjustments significantly influence the system's dynamic characteristics. The introduction of noise also triggers oscillations, with the irregular oscillation patterns closely aligning with real-world observations. Intriguingly, the effects of parameters and noise regulation undergo significant changes when time delays are jointly considered. This analysis offers a fresh perspective on understanding B. subtilis competence and provides essential theoretical support for subsequent experimental endeavors in this domain of biomathematics.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 2","pages":"2550005"},"PeriodicalIF":0.9,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144267689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Valentina A Grushina, Ivan S Yevshin, Oleg A Gusev, Fedor A Kolpakov, Olga I Stanishevskaya, Elena S Fedorova, Natalia A Zinovieva, Sergey S Pintus
{"title":"Prediction and annotation of alternative transcription starts and promoter shift in the chicken genome.","authors":"Valentina A Grushina, Ivan S Yevshin, Oleg A Gusev, Fedor A Kolpakov, Olga I Stanishevskaya, Elena S Fedorova, Natalia A Zinovieva, Sergey S Pintus","doi":"10.1142/S0219720025500040","DOIUrl":"https://doi.org/10.1142/S0219720025500040","url":null,"abstract":"<p><p>Promoter shifting, characterized by alterations in Transcription Start Site (TSS) coordinates, is a well-documented phenomenon. The impact and statistical significance of promoter shifting can be assessed through analysis of Cap Analysis of Gene Expression (CAGE) data. This phenomenon is associated with developmental stage transitions, tissue differentiation, and cellular responses to environmental stimuli. Differential promoter usage suggests nonconstitutive expression of the regulated gene, indicative of focused promoter utilization. Conversely, housekeeping genes are typically characterized by stable expression levels driven by multiple dispersed promoters and are commonly expressed across a wide range of tissues. However, our findings demonstrate that many ubiquitously expressed genes utilize single, focused promoters and undergo significant promoter shifting, adding a layer of complexity to the definition of a housekeeping gene. Differential gene expression is commonly used to study gene responses to external stimuli in cells and tissues. Here, we employ an alternative approach based on differential promoter usage, identifying genes exhibiting significant promoter shifting as signatures of tissue response and phenotypic effects. Our results suggest that variations in chicken growth rate are regulated by nutrient metabolism rates, mediated through differential promoter usage of relevant genes.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 2","pages":"2550004"},"PeriodicalIF":0.9,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144267691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis of clonal evolution in cancer: A computational perspective.","authors":"Paulo Henrique Ribeiro, Adenilso Simao","doi":"10.1142/S0219720025310018","DOIUrl":"https://doi.org/10.1142/S0219720025310018","url":null,"abstract":"<p><p>Cancer is a complex disease that progresses through Darwinian evolution in cells with genetic mutations, leading to the development of multiple distinct cell populations within tumors, a process known as clonal evolution. While computational methods aid in the analysis of clonal evolution in cancer samples using genetic sequencing data, accurately identifying the clonal structure of tumor samples remains one of the biggest challenges in Cancer Genomics. Several computational methods for analyzing clonal evolution in cancer have been developed in recent years. However, the algorithms of these computational methods are complex and often described at a high level of abstraction. This paper provides a detailed explanation of some computational methods for clonal evolution analysis from a computational perspective, aiding in understanding their mechanisms. Additionally, some methods have been implemented on an online platform, enabling researchers to easily run and analyze the algorithms, as well as adapt these methods to their specific needs.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 2","pages":"2531001"},"PeriodicalIF":0.9,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144267688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"M<sup>3</sup>-20M: A large-scale multi-modal molecule dataset for AI-driven drug design and discovery.","authors":"Siyuan Guo, Lexuan Wang, Chang Jin, Jinxian Wang, Han Peng, Huayang Shi, Wengen Li, Jihong Guan, Shuigeng Zhou","doi":"10.1142/S0219720025500064","DOIUrl":"https://doi.org/10.1142/S0219720025500064","url":null,"abstract":"<p><p>This paper introduces M<sup>3</sup>-20M, a large-scale <i>Multi-Modal Molecule</i> dataset that contains over <i>20 million</i> molecules, with the data mainly being integrated from existing databases and partially generated by large language models. Designed to support AI-driven drug design and discovery, M<sup>3</sup>-20M is 71 times more in the number of molecules than the largest existing dataset, providing an unprecedented scale that can highly benefit the training or fine-tuning of models, including large language models for drug design and discovery tasks. This dataset integrates one-dimensional SMILES, two-dimensional molecular graphs, three-dimensional molecular structures, physicochemical properties, and textual descriptions collected through web crawling and generated using GPT-3.5, offering a comprehensive view of each molecule. To demonstrate the power of M<sup>3</sup>-20M in drug design and discovery, we conduct extensive experiments on two key tasks: molecule generation and molecular property prediction, using large language models including GLM4, GPT-3.5, GPT-4, and Llama3-8b. Our experimental results show that M<sup>3</sup>-20M can significantly boost model performance in both tasks. Specifically, it enables the models to generate more diverse and valid molecular structures and achieve higher property prediction accuracy than existing single-modal datasets, which validates the value and potential of M<sup>3</sup>-20M in supporting AI-driven drug design and discovery. The dataset is available at https://github.com/bz99bz/M-3.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 2","pages":"2550006"},"PeriodicalIF":0.9,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144267690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Michael Khristichenko, Yuri Nechepurenko, Dmitry Grebennikov, Gennady Bocharov
{"title":"Computation and analysis of stationary and periodic solutions of the COVID-19 infection dynamics model.","authors":"Michael Khristichenko, Yuri Nechepurenko, Dmitry Grebennikov, Gennady Bocharov","doi":"10.1142/S0219720025400013","DOIUrl":"10.1142/S0219720025400013","url":null,"abstract":"<p><p>In this work, we search for the conditions for the occurrence of long COVID using the recently developed COVID-19 disease dynamics model which is a system of delay differential equations. To do so, we search for stable stationary or periodic solutions of this model with low viral load that can be interpreted as long COVID using our recently developed technology for analysing time-delay systems. The results of the bifurcation and sensitivity analysis of the mathematical model of SARS-CoV-2 infection suggest the following biological conclusions concerning the mechanisms of pathogenesis of long COVID-19. First, the possibility of SARS-CoV-2 persistence requires a 3-time reduction of the virus production rate per infected cell, or 18-times increase of the antibody-mediated elimination rate of free viruses as compared to an acute infection baseline estimates. Second, the loss of kinetic coordination between virus-induced type I IFN, antibody, and cytotoxic T lymphocyte (CTL) responses can result in the development of mild severity long-lasting infection. Third, the low-level persistent SARS-CoV-2 infection is robust to up to 100-fold perturbations (increase) in viral load and most sensitive to parameters of the humoral immune response.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 1","pages":"2540001"},"PeriodicalIF":0.9,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143765517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Boon How Low, Kaushal Krishna Kaliskar, Stefano Perna, Bernett Lee
{"title":"Cross-cellular analysis of chromatin accessibility markers H3K4me3 and DNase in the context of detecting cell-identity genes: An \"all-or-nothing\" approach.","authors":"Boon How Low, Kaushal Krishna Kaliskar, Stefano Perna, Bernett Lee","doi":"10.1142/S0219720025400025","DOIUrl":"10.1142/S0219720025400025","url":null,"abstract":"<p><p>Cell identity is often associated to a subset of highly-expressed genes that define the cell processes, as opposed to essential genes that are always active. Cell-specific genes may be defined in opposition to essential genes, or via experimental means. Detection of said cell-specific genes is often a primary goal in the study of novel biosamples. Chromatin accessibility markers (such as DNase and H3K4me3) help identify actively transcribed genes, but data can be difficult to come by for entirely novel biosamples. In this study, we investigate the possibility of associating the cell-specificity status of genes with chromatin accessibility markers from different cell lines, and we suggest that the number of cell lines in which a gene is found to be marked by DNase/H3K4me3 is predictive of the essentiality status itself. We define a measure called the Cross-cellular Chromatin Openness (CCO) level, and show that it is associated with the essentiality status using two differentiation experiments. We then compare the CCO-level predictive power to existing scRNA-Seq and bulk RNA-Seq methods, showing it has good concordance when applicable.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 1","pages":"2540002"},"PeriodicalIF":0.9,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143765527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SS-DTI: A deep learning method integrating semantic and structural information for drug-target interaction prediction.","authors":"Yujie Chun, Huaihu Li, Shunfang Wang","doi":"10.1142/S0219720025500027","DOIUrl":"10.1142/S0219720025500027","url":null,"abstract":"<p><p>Drug-target interaction (DTI) prediction is pivotal in drug discovery and repurposing, providing a more efficient alternative to traditional wet-lab experiments by saving time and resources and expediting the identification of potential targets. Current DTI methods predominantly focus on extracting semantic features from drug and protein sequences or utilizing structural information, often neglecting the integration of both. This gap hinders the achievement of a comprehensive representation of drug and protein molecules. To address this, we propose SS-DTI, a novel end-to-end deep learning approach that integrates both semantic and structural information. Our method features a multi-scale semantic feature extraction block to capture local and global information from sequences and employs Graph Convolutional Networks (GCNs) to learn structural features. Evaluations on four benchmark datasets demonstrate that SS-DTI outperforms state-of-the-art methods, showcasing its superior predictive performance. Our code is available at https://github.com/RobinChun/SS-DTI.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2550002"},"PeriodicalIF":0.9,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143711799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Drug repurposing for non-small cell lung cancer by predicting drug response using pathway-level graph convolutional network.","authors":"I T Anjusha, K A Abdul Nazeer, N Saleena","doi":"10.1142/S0219720025500015","DOIUrl":"10.1142/S0219720025500015","url":null,"abstract":"<p><p>Drug repurposing is the process of identifying new clinical indications for an existing drug. Some of the recent studies utilized drug response prediction models to identify drugs that can be repurposed. By representing cell-line features as a pathway-pathway interaction network, we can better understand the connections between cellular processes and drug response mechanisms. Existing deep learning models for drug response prediction do not integrate known biological pathway-pathway interactions into the model. This paper presents a drug response prediction model that applies a graph convolution operation on a pathway-pathway interaction network to represent features of cancer cell-lines effectively. The model is used to identify potential drug repurposing candidates for Non-Small Cell Lung Cancer (NSCLC). Experiment results show that the inclusion of graph convolutional model applied on a pathway-pathway interaction network makes the proposed model more effective in predicting drug response than the state-of-the-art methods. Specifically, the model has shown better performance in terms of Root Mean Squared Error, Coefficient of Determination, and Pearson's Correlation Coefficient when applied to the GDSC1000 dataset. Also, most of the drugs that the model predicted as top candidates for NSCLC treatment are either undergoing clinical studies or have some evidence in the PubMed literature database.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2550001"},"PeriodicalIF":0.9,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143711795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K Soni Sharmila, Thanga Revathi S, Pokkuluri Kiran Sree
{"title":"DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.","authors":"K Soni Sharmila, Thanga Revathi S, Pokkuluri Kiran Sree","doi":"10.1142/S0219720025500039","DOIUrl":"10.1142/S0219720025500039","url":null,"abstract":"<p><p>Drug-drug interactions (DDIs) pose a major concern in polypharmacy due to their potential to cause unexpected side effects that can adversely affect a patient's health. Therefore, it is crucial to identify DDIs effectively during the early stages of drug discovery and development. In this paper, a novel DDI prediction network (DDINet) is proposed to enhance the predictive performance over conventional DDI methods. Leveraging the DrugBank dataset, drugs are represented using the Simplified Molecular Input Line-Entry System (SMILES), with the RDKit software pre-processing the SMILES strings into their canonical forms. Multiple molecular fingerprinting techniques such as Extended Connectivity Fingerprints (ECFPs), Molecular ACCess System keys (MACCSkeys), PubChem Fingerprints, 3D molecular fingerprints (3D-FP), and molecular dynamics fingerprints (MDFPs) are employed to encode drug chemical structures into feature vectors. Drug similarities are computed using the Tanimoto coefficient (TC), and the final Structural Similarity Profile (SSP) is obtained by averaging the five molecular fingerprint types. The novelty of the approach lies in the integration of a Multi-head Attention centered Weighted Autoencoder (Mul_WAE) as the interaction prediction module, which leverages the Multi-head Attention (MHA) layer to focus on the most significant input features. Furthermore, we introduce the Upgraded Bald Eagle Search Optimization (UBesO) algorithm, which optimally selects the learnable parameters of the Mul_WAE based on cross-entropy loss, improving the model's convergence and performance. The proposed DDINet model achieves an accuracy of 99.77%, 99.66% of AUC, 99.5% average precision, 99.4% precision, and 99.49% recall, providing a comprehensive evaluation of the model's robustness. Beyond high accuracy, DDINet offers advantages in scalability, making it well suited for handling large datasets due to its efficient feature extraction and optimization processes. The unique combination of multiple molecular fingerprinting methods with the MHA layer and UBesO algorithm highlights the innovative aspects of our model and significantly improves prediction performance compared to existing approaches.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 1","pages":"2550003"},"PeriodicalIF":0.9,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143765530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}