{"title":"Prediction of Drug-Target Interactions Using BERT for Protein Sequences and Drug Compound","authors":"Essmily Simon, Sanjay S Bankapur","doi":"10.1109/COMSNETS59351.2024.10427536","DOIUrl":null,"url":null,"abstract":"The current drug development crucially depends on identifying potential relationships between medicines and targets. However, anticipating such relationships is difficult due to the limits of current computational techniques. Hence, the use of deep learning is essential for identifying potential therapeutic drug compounds and providing support throughout the entire drug development process. This study discusses the deep learning technique of using bidirectional encoder representations from the Transformers (BERT) model which helped to build representations using protein and drug SMILES (Simplified Molecular Input Line Entry System) dataset to enhance DTI prediction. We used the pretrained Protein BERT model and ChemBERT for protein sequences and drug SMILES data respectively for feature extraction and resulting features are concatenated together and fed into a random forest (RF) for classification. BERT model helps to use protein and drug datasets for feature extraction without using the descriptor dataset for finding the interaction between drugs and proteins. .","PeriodicalId":518748,"journal":{"name":"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)","volume":"7 1","pages":"436-438"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMSNETS59351.2024.10427536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The current drug development crucially depends on identifying potential relationships between medicines and targets. However, anticipating such relationships is difficult due to the limits of current computational techniques. Hence, the use of deep learning is essential for identifying potential therapeutic drug compounds and providing support throughout the entire drug development process. This study discusses the deep learning technique of using bidirectional encoder representations from the Transformers (BERT) model which helped to build representations using protein and drug SMILES (Simplified Molecular Input Line Entry System) dataset to enhance DTI prediction. We used the pretrained Protein BERT model and ChemBERT for protein sequences and drug SMILES data respectively for feature extraction and resulting features are concatenated together and fed into a random forest (RF) for classification. BERT model helps to use protein and drug datasets for feature extraction without using the descriptor dataset for finding the interaction between drugs and proteins. .