Donn Liew, Akesha Dinuli Dharmatilleke, Edwin See, Ee Hou Yong
{"title":"G4STAB:基于序列和盐浓度预测g -四联体热力学稳定性的多输入深度学习模型。","authors":"Donn Liew, Akesha Dinuli Dharmatilleke, Edwin See, Ee Hou Yong","doi":"10.1093/bioinformatics/btaf545","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>G-quadruplexes (G4s) are non-canonical nucleic acid structures formed in guanine-rich regions that modulate gene regulation and genomic stability. The thermodynamic stability of G4s directly influences their biological functions and potential as therapeutic targets. However, current quantitative frameworks for predicting G4 stability rely on predetermined structural features, limiting their effectiveness for diverse G4 topologies, and fail to account for environmental factors such as ion concentration and pH that significantly modulate G4 stability in cellular contexts.</p><p><strong>Results: </strong>We present G4STAB, a multi-input deep learning neural network that accurately predicts DNA G4 melting temperatures based on sequence features, salt concentration, and pH. Trained on 2,382 diverse DNA G4 sequences, our model achieves high accuracy (R2 = 0.8) without relying on predetermined G4 structural features. G4STAB successfully captures established G4 stability determinants and proposes previously unobserved sequence-stability relationships. Analysis of 391,502 experimentally validated G4s reveals that cancer-like ionic environments alter G4 stability profiles, with a 13.5-fold increase in number of structures exhibiting physiological melting temperatures (36-42°C). These findings suggest systematic genomic patterns in G4 stability responses across chromosomes and gene types.</p><p><strong>Availability and implementation: </strong>G4STAB is available at https://github.com/donn-liew/G4STAB; G4STAB web database interface is available at https://donn-liew.github.io/g4stab-web-database/.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"G4STAB: A multi-input deep learning model to predict G-quadruplex thermodynamic stability based on sequence and salt concentration.\",\"authors\":\"Donn Liew, Akesha Dinuli Dharmatilleke, Edwin See, Ee Hou Yong\",\"doi\":\"10.1093/bioinformatics/btaf545\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>G-quadruplexes (G4s) are non-canonical nucleic acid structures formed in guanine-rich regions that modulate gene regulation and genomic stability. The thermodynamic stability of G4s directly influences their biological functions and potential as therapeutic targets. However, current quantitative frameworks for predicting G4 stability rely on predetermined structural features, limiting their effectiveness for diverse G4 topologies, and fail to account for environmental factors such as ion concentration and pH that significantly modulate G4 stability in cellular contexts.</p><p><strong>Results: </strong>We present G4STAB, a multi-input deep learning neural network that accurately predicts DNA G4 melting temperatures based on sequence features, salt concentration, and pH. Trained on 2,382 diverse DNA G4 sequences, our model achieves high accuracy (R2 = 0.8) without relying on predetermined G4 structural features. G4STAB successfully captures established G4 stability determinants and proposes previously unobserved sequence-stability relationships. Analysis of 391,502 experimentally validated G4s reveals that cancer-like ionic environments alter G4 stability profiles, with a 13.5-fold increase in number of structures exhibiting physiological melting temperatures (36-42°C). These findings suggest systematic genomic patterns in G4 stability responses across chromosomes and gene types.</p><p><strong>Availability and implementation: </strong>G4STAB is available at https://github.com/donn-liew/G4STAB; G4STAB web database interface is available at https://donn-liew.github.io/g4stab-web-database/.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btaf545\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
G4STAB: A multi-input deep learning model to predict G-quadruplex thermodynamic stability based on sequence and salt concentration.
Motivation: G-quadruplexes (G4s) are non-canonical nucleic acid structures formed in guanine-rich regions that modulate gene regulation and genomic stability. The thermodynamic stability of G4s directly influences their biological functions and potential as therapeutic targets. However, current quantitative frameworks for predicting G4 stability rely on predetermined structural features, limiting their effectiveness for diverse G4 topologies, and fail to account for environmental factors such as ion concentration and pH that significantly modulate G4 stability in cellular contexts.
Results: We present G4STAB, a multi-input deep learning neural network that accurately predicts DNA G4 melting temperatures based on sequence features, salt concentration, and pH. Trained on 2,382 diverse DNA G4 sequences, our model achieves high accuracy (R2 = 0.8) without relying on predetermined G4 structural features. G4STAB successfully captures established G4 stability determinants and proposes previously unobserved sequence-stability relationships. Analysis of 391,502 experimentally validated G4s reveals that cancer-like ionic environments alter G4 stability profiles, with a 13.5-fold increase in number of structures exhibiting physiological melting temperatures (36-42°C). These findings suggest systematic genomic patterns in G4 stability responses across chromosomes and gene types.
Availability and implementation: G4STAB is available at https://github.com/donn-liew/G4STAB; G4STAB web database interface is available at https://donn-liew.github.io/g4stab-web-database/.
Supplementary information: Supplementary data are available at Bioinformatics online.