Boyu Zhang, Daniel T Yehdego, Kyle L Johnson, Ming-Ying Leung, Michela Taufer
{"title":"Enhancement of accuracy and efficiency for RNA secondary structure prediction by sequence segmentation and MapReduce","authors":"Boyu Zhang, Daniel T Yehdego, Kyle L Johnson, Ming-Ying Leung, Michela Taufer","doi":"10.1186/1472-6807-13-S1-S3","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S3","url":null,"abstract":"<p>Ribonucleic acid (RNA) molecules play important roles in many biological processes including gene expression and regulation. Their secondary structures are crucial for the RNA functionality, and the prediction of the secondary structures is widely studied. Our previous research shows that cutting long sequences into shorter chunks, predicting secondary structures of the chunks independently using thermodynamic methods, and reconstructing the entire secondary structure from the predicted chunk structures can yield better accuracy than predicting the secondary structure using the RNA sequence as a whole. The chunking, prediction, and reconstruction processes can use different methods and parameters, some of which produce more accurate predictions than others. In this paper, we study the prediction accuracy and efficiency of three different chunking methods using seven popular secondary structure prediction programs that apply to two datasets of RNA with known secondary structures, which include both pseudoknotted and non-pseudoknotted sequences, as well as a family of viral genome RNAs whose structures have not been predicted before. Our modularized MapReduce framework based on Hadoop allows us to study the problem in a parallel and robust environment.</p><p>On average, the maximum accuracy retention values are larger than one for our chunking methods and the seven prediction programs over 50 non-pseudoknotted sequences, meaning that the secondary structure predicted using chunking is more similar to the real structure than the secondary structure predicted by using the whole sequence. We observe similar results for the 23 pseudoknotted sequences, except for the NUPACK program using the centered chunking method. The performance analysis for 14 long RNA sequences from the <i>Nodaviridae</i> virus family outlines how the coarse-grained mapping of chunking and predictions in the MapReduce framework exhibits shorter turnaround times for short RNA sequences. However, as the lengths of the RNA sequences increase, the fine-grained mapping can surpass the coarse-grained mapping in performance.</p><p>By using our MapReduce framework together with statistical analysis on the accuracy retention results, we observe how the inversion-based chunking methods can outperform predictions using the whole sequence. Our chunk-based approach also enables us to predict secondary structures for very long RNA sequences, which is not feasible with traditional methods alone.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4354094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Elucidating the ensemble of functionally-relevant transitions in protein systems with a robotics-inspired method","authors":"Kevin Molloy, Amarda Shehu","doi":"10.1186/1472-6807-13-S1-S8","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S8","url":null,"abstract":"<p>Many proteins tune their biological function by transitioning between different functional states, effectively acting as dynamic molecular machines. Detailed structural characterization of transition trajectories is central to understanding the relationship between protein dynamics and function. Computational approaches that build on the Molecular Dynamics framework are in principle able to model transition trajectories at great detail but also at considerable computational cost. Methods that delay consideration of dynamics and focus instead on elucidating energetically-credible conformational paths connecting two functionally-relevant structures provide a complementary approach. Effective sampling-based path planning methods originating in robotics have been recently proposed to produce conformational paths. These methods largely model short peptides or address large proteins by simplifying conformational space.</p><p>We propose a robotics-inspired method that connects two given structures of a protein by sampling conformational paths. The method focuses on small- to medium-size proteins, efficiently modeling structural deformations through the use of the molecular fragment replacement technique. In particular, the method grows a tree in conformational space rooted at the start structure, steering the tree to a goal region defined around the goal structure. We investigate various bias schemes over a progress coordinate for balance between coverage of conformational space and progress towards the goal. A geometric projection layer promotes path diversity. A reactive temperature scheme allows sampling of rare paths that cross energy barriers.</p><p>Experiments are conducted on small- to medium-size proteins of length up to 214 amino acids and with multiple known functionally-relevant states, some of which are more than 13? apart of each-other. Analysis reveals that the method effectively obtains conformational paths connecting structural states that are significantly different. A detailed analysis on the depth and breadth of the tree suggests that a soft global bias over the progress coordinate enhances sampling and results in higher path diversity. The explicit geometric projection layer that biases the exploration away from over-sampled regions further increases coverage, often improving proximity to the goal by forcing the exploration to find new paths. The reactive temperature scheme is shown effective in increasing path diversity, particularly in difficult structural transitions with known high-energy barriers.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S8","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4354942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Four-body atomic potential for modeling protein-ligand binding affinity: application to enzyme-inhibitor binding energy prediction","authors":"Majid Masso","doi":"10.1186/1472-6807-13-S1-S1","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S1","url":null,"abstract":"<p>Models that are capable of reliably predicting binding affinities for protein-ligand complexes play an important role the field of structure-guided drug design.</p><p>Here, we begin by applying the computational geometry technique of Delaunay tessellation to each set of atomic coordinates for over 1400 diverse macromolecular structures, for the purpose of deriving a four-body statistical potential that serves as a topological scoring function. Next, we identify a second, independent set of three hundred protein-ligand complexes, having both high-resolution structures and known dissociation constants. Two-thirds of these complexes are randomly selected to train a predictive model of binding affinity as follows: two tessellations are generated in each case, one for the entire complex and another strictly for the isolated protein without its bound ligand, and a topological score is computed for each tessellation with the four-body potential. Predicted protein-ligand binding affinity is then based on an empirically derived linear function of the difference between both topological scores, one that appropriately scales the value of this difference.</p><p>A comparison between experimental and calculated binding affinity values over the two hundred complexes reveals a Pearson's correlation coefficient of <i>r</i> = 0.79 with a standard error of <i>SE</i> = 1.98 kcal/mol. To validate the method, we similarly generated two tessellations for each of the remaining protein-ligand complexes, computed their topological scores and the difference between the two scores for each complex, and applied the previously derived linear transformation of this topological score difference to predict binding affinities. For these one hundred complexes, we again observe a correlation of <i>r</i> = 0.79 (<i>SE</i> = 1.93 kcal/mol) between known and calculated binding affinities. Applying our model to an independent test set of high-resolution structures for three hundred diverse enzyme-inhibitor complexes, each with an experimentally known inhibition constant, also yields a correlation of <i>r</i> = 0.79 (<i>SE</i> = 2.39 kcal/mol) between experimental and calculated binding energies.</p><p>Lastly, we generate predictions with our model on a diverse test set of one hundred protein-ligand complexes previously used to benchmark 15 related methods, and our correlation of <i>r</i> = 0.66 between the calculated and experimental binding energies for this dataset exceeds those of the other approaches. Compared with these related prediction methods, our approach stands out based on salient features that include the reliability of our model, combined with the rapidity of the generated predictions, which are less than one second for an average sized complex.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4357914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Andrew McKnight, Dong Si, Kamal Al Nasr, Andrey Chernikov, Nikos Chrisochoides, Jing He
{"title":"Estimating loop length from CryoEM images at medium resolutions","authors":"Andrew McKnight, Dong Si, Kamal Al Nasr, Andrey Chernikov, Nikos Chrisochoides, Jing He","doi":"10.1186/1472-6807-13-S1-S5","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S5","url":null,"abstract":"<p>De novo protein modeling approaches utilize 3-dimensional (3D) images derived from electron cryomicroscopy (CryoEM) experiments. The skeleton connecting two secondary structures such as <i>α</i>-helices represent the loop in the 3D image. The accuracy of the skeleton and of the detected secondary structures are critical in De novo modeling. It is important to measure the length along the skeleton accurately since the length can be used as a constraint in modeling the protein.</p><p>We have developed a novel computational geometric approach to derive a simplified curve in order to estimate the loop length along the skeleton. The method was tested using fifty simulated density images of helix-loop-helix segments of atomic structures and eighteen experimentally derived density data from Electron Microscopy Data Bank (EMDB). The test using simulated density maps shows that it is possible to estimate within 0.5? of the expected length for 48 of the 50 cases. The experiments, involving eighteen experimentally derived CryoEM images, show that twelve cases have error within 2?.</p><p>The tests using both simulated and experimentally derived images show that it is possible for our proposed method to estimate the loop length along the skeleton if the secondary structure elements, such as <i>α</i>-helices, can be detected accurately, and there is a continuous skeleton linking the <i>α</i>-helices.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4351895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unbiased, scalable sampling of protein loop conformations from probabilistic priors","authors":"Yajia Zhang, Kris Hauser","doi":"10.1186/1472-6807-13-S1-S9","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S9","url":null,"abstract":"<p>Protein loops are flexible structures that are intimately tied to function, but understanding loop motion and generating loop conformation ensembles remain significant computational challenges. Discrete search techniques scale poorly to large loops, optimization and molecular dynamics techniques are prone to local minima, and inverse kinematics techniques can only incorporate structural preferences in adhoc fashion. This paper presents Sub-Loop Inverse Kinematics Monte Carlo (SLIKMC), a new Markov chain Monte Carlo algorithm for generating conformations of closed loops according to experimentally available, heterogeneous structural preferences.</p><p>Our simulation experiments demonstrate that the method computes high-scoring conformations of large loops (<i>></i> 10 residues) orders of magnitude faster than standard Monte Carlo and discrete search techniques. Two new developments contribute to the scalability of the new method. First, structural preferences are specified via a probabilistic graphical model (PGM) that links conformation variables, spatial variables (e.g., atom positions), constraints and prior information in a unified framework. The method uses a sparse PGM that exploits locality of interactions between atoms and residues. Second, a novel method for sampling sub-loops is developed to generate statistically unbiased samples of probability densities restricted by loop-closure constraints.</p><p>Numerical experiments confirm that SLIKMC generates conformation ensembles that are statistically consistent with specified structural preferences. Protein conformations with 100+ residues are sampled on standard PC hardware in seconds. Application to proteins involved in ion-binding demonstrate its potential as a tool for loop ensemble generation and missing structure completion.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4354972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DINC: A new AutoDock-based protocol for docking large ligands","authors":"Ankur Dhanik, John S McMurray, Lydia E Kavraki","doi":"10.1186/1472-6807-13-S1-S11","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S11","url":null,"abstract":"<p>Using the popular program AutoDock, computer-aided docking of small ligands with 6 or fewer rotatable bonds, is reasonably fast and accurate. However, docking large ligands using AutoDock's recommended standard docking protocol is less accurate and computationally slow.</p><p>In our earlier work, we presented a novel AutoDock-based incremental protocol (DINC) that addresses the limitations of AutoDock's standard protocol by enabling improved docking of large ligands. Instead of docking a large ligand to a target protein in one single step as done in the standard protocol, our protocol docks the large ligand in increments. In this paper, we present three detailed examples of docking using DINC and compare the docking results with those obtained using AutoDock's standard protocol. We summarize the docking results from an extended docking study that was done on 73 protein-ligand complexes comprised of large ligands. We demonstrate not only that DINC is up to 2 orders of magnitude faster than AutoDock's standard protocol, but that it also achieves the speed-up without sacrificing docking accuracy. We also show that positional restraints can be applied to the large ligand using DINC: this is useful when computing a docked conformation of the ligand. Finally, we introduce a webserver for docking large ligands using DINC.</p><p>Docking large ligands using DINC is significantly faster than AutoDock's standard protocol without any loss of accuracy. Therefore, DINC could be used as an alternative protocol for docking large ligands. DINC has been implemented as a webserver and is available at http://dinc.kavrakilab.org. Applications such as therapeutic drug design, rational vaccine design, and others involving large ligands could benefit from DINC and its webserver implementation.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S11","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4354075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ibrahim Al-Bluwi, Marc Vaisset, Thierry Siméon, Juan Cortés
{"title":"Modeling protein conformational transitions by a combination of coarse-grained normal mode analysis and robotics-inspired methods","authors":"Ibrahim Al-Bluwi, Marc Vaisset, Thierry Siméon, Juan Cortés","doi":"10.1186/1472-6807-13-S1-S2","DOIUrl":"https://doi.org/10.1186/1472-6807-13-S1-S2","url":null,"abstract":"<p>Obtaining atomic-scale information about large-amplitude conformational transitions in proteins is a challenging problem for both experimental and computational methods. Such information is, however, important for understanding the mechanisms of interaction of many proteins.</p><p>This paper presents a computationally efficient approach, combining methods originating from robotics and computational biophysics, to model protein conformational transitions. The ability of normal mode analysis to predict directions of collective, large-amplitude motions is applied to bias the conformational exploration performed by a motion planning algorithm. To reduce the dimension of the problem, normal modes are computed for a coarse-grained elastic network model built on short fragments of three residues. Nevertheless, the validity of intermediate conformations is checked using the all-atom model, which is accurately reconstructed from the coarse-grained one using closed-form inverse kinematics.</p><p>Tests on a set of ten proteins demonstrate the ability of the method to model conformational transitions of proteins within a few hours of computing time on a single processor. These results also show that the computing time scales linearly with the protein size, independently of the protein topology. Further experiments on adenylate kinase show that main features of the transition between the open and closed conformations of this protein are well captured in the computed path.</p><p>The proposed method enables the simulation of large-amplitude conformational transitions in proteins using very few computational resources. The resulting paths are a first approximation that can directly provide important information on the molecular mechanisms involved in the conformational transition. This approximation can be subsequently refined and analyzed using state-of-the-art energy models and molecular modeling methods.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-S1-S2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4354946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Paul J DePietro, Elchin S Julfayev, William A McLaughlin
{"title":"Quantification of the impact of PSI:Biology according to the annotations of the determined structures","authors":"Paul J DePietro, Elchin S Julfayev, William A McLaughlin","doi":"10.1186/1472-6807-13-24","DOIUrl":"https://doi.org/10.1186/1472-6807-13-24","url":null,"abstract":"<p>Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure.</p><p>One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure.</p><p>We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in conjunction with established metrics, i.e. the numbers of structures and impact of publication records, to monitor the progress of PSI:Biology towards its goals of examining structure to function connections of high biomedical relevance. The metric provides an objective means to quantify the overall impact of PSI:Biology as it uses biomedical annotations from external sources.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-24","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4837187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thai V Hoang, Xavier Cavin, Patrick Schultz, David W Ritchie
{"title":"gEMpicker: a highly parallel GPU-accelerated particle picking tool for cryo-electron microscopy","authors":"Thai V Hoang, Xavier Cavin, Patrick Schultz, David W Ritchie","doi":"10.1186/1472-6807-13-25","DOIUrl":"https://doi.org/10.1186/1472-6807-13-25","url":null,"abstract":"<p>Picking images of particles in cryo-electron micrographs is an important step in solving the 3D structures of large macromolecular assemblies. However, in order to achieve sub-nanometre resolution it is often necessary to capture and process many thousands or even several millions of 2D particle images. Thus, a computational bottleneck in reaching high resolution is the accurate and automatic picking of particles from raw cryo-electron micrographs.</p><p>We have developed “gEMpicker”, a highly parallel correlation-based particle picking tool. To our knowledge, gEMpicker is the first particle picking program to use multiple graphics processor units (GPUs) to accelerate the calculation. When tested on the publicly available keyhole limpet hemocyanin dataset, we find that gEMpicker gives similar results to the FindEM program. However, compared to calculating correlations on one core of a contemporary central processor unit (CPU), running gEMpicker on a modern GPU gives a speed-up of about 27 ×. To achieve even higher processing speeds, the basic correlation calculations are accelerated considerably by using a hierarchy of parallel programming techniques to distribute the calculation over multiple GPUs and CPU cores attached to multiple nodes of a computer cluster. By using a theoretically optimal reduction algorithm to collect and combine the cluster calculation results, the speed of the overall calculation scales almost linearly with the number of cluster nodes available.</p><p>The very high picking throughput that is now possible using GPU-powered workstations or computer clusters will help experimentalists to achieve higher resolution 3D reconstructions more rapidly than before.</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-25","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4837190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nicholas Chim, Christine A Harmston, David J Guzman, Celia W Goulding
{"title":"Structural and biochemical characterization of the essential DsbA-like disulfide bond forming protein from Mycobacterium tuberculosis","authors":"Nicholas Chim, Christine A Harmston, David J Guzman, Celia W Goulding","doi":"10.1186/1472-6807-13-23","DOIUrl":"https://doi.org/10.1186/1472-6807-13-23","url":null,"abstract":"<p>Bacterial <i>D</i> i<i>s</i> ulfide <i>b</i> ond forming (Dsb) proteins facilitate proper folding and disulfide bond formation of periplasmic and secreted proteins. Previously, we have shown that <i>Mycobacterium tuberculosis</i> Mt-DsbE and Mt-DsbF aid <i>in vitro</i> oxidative folding of proteins. The <i>M. tuberculosis</i> proteome contains another predicted membrane-tethered Dsb protein, Mt-DsbA, which is encoded by an essential gene.</p><p>Herein, we present structural and biochemical analyses of Mt-DsbA. The X-ray crystal structure of Mt-DsbA reveals a two-domain structure, comprising a canonical thioredoxin domain with the conserved CXXC active site cysteines in their reduced form, and an inserted α-helical domain containing a structural disulfide bond. The overall fold of Mt-DsbA resembles that of other DsbA-like proteins and not Mt-DsbE or Mt-DsbF. Biochemical characterization demonstrates that, unlike Mt-DsbE and Mt-DsbF, Mt-DsbA is unable to oxidatively fold reduced, denatured hirudin. Moreover, on the substrates tested in this study, Mt-DsbA has disulfide bond isomerase activity contrary to Mt-DsbE and Mt-DsbF.</p><p>These results suggest that Mt-DsbA acts upon a distinct subset of substrates as compared to Mt-DsbE and Mt-DsbF. One could speculate that Mt-DsbE and Mt-DsbF are functionally redundant whereas Mt-DsbA is not, offering an explanation for the essentiality of Mt-DsbA in <i>M. tuberculosis.</i>\u0000</p>","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2013-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-23","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4733830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}