Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics最新文献

AI and Big Data Analytics for Health and Bioinformatics 健康和生物信息学的人工智能和大数据分析

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156347

C. Kwoh

引用次数: 0

Meta-analysis of whole-transcriptome data for prediction of novel genes associated with autism spectrum disorder 预测自闭症谱系障碍相关新基因的全转录组数据荟萃分析

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156357

Duc-Hau Le, N. Van

{"title":"Meta-analysis of whole-transcriptome data for prediction of novel genes associated with autism spectrum disorder","authors":"Duc-Hau Le, N. Van","doi":"10.1145/3156346.3156357","DOIUrl":"https://doi.org/10.1145/3156346.3156357","url":null,"abstract":"Autism spectrum disorder (ASD) is a common heterogeneous neurodevelopmental disorder with typical symptoms such as impaired social interaction, language and communication abnormalities and stereotypical behavior. Since the genetics of ASDs is so diverse, information on genome function as provided by transcriptomic data is essential to further our understanding. This is transcriptome is a key link between measuring protein levels and genetic information. Transcriptome-based studies have been often performed by comparing ASD and control groups to identify which genes are dysregulated in the ASD group using statistical techniques. However, these statistical techniques can only find genes solely related to ASD, but cannot reflect relationship among genes which could be the etiology of ASD. In this study, we propose a novel method to find the ASD-associated genes, which are predictive for ASD. For this purpose, we metaanalyze whole-transcriptomic data of previous studies for ASD, which were performed using some expression profiling platforms on different issues of interest. These predictive genes, which can differentiate a sample into either ASD or non-ASD, are selected by an optimization process. Comparing subsets selected from different tissues/platforms, we conclude that tissues contain different gene sets associated with ASD. In addition, a platform can supply other ASD-associated genes of which other platforms cannot. Identified genes are then compared to those which have been well documented in SFARI, which is the most comprehensive and up-to-date data of ASD. Interestingly, we can find two novel genes with evidences from literature, which have not yet been recorded in this database. In summary, meta-analysis on whole-transcriptome data of ASD could shed light on the etiology of ASD.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126287962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Extraction of disease-related genes from PubMed paper using word2vec 利用word2vec从PubMed论文中提取疾病相关基因

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156355

Takahiro Koiwa, H. Ohwada

{"title":"Extraction of disease-related genes from PubMed paper using word2vec","authors":"Takahiro Koiwa, H. Ohwada","doi":"10.1145/3156346.3156355","DOIUrl":"https://doi.org/10.1145/3156346.3156355","url":null,"abstract":"Finding disease-related genes is important in drug discovery. Many genes are involved in the disease, and many studies have been conducted and reported for each disease. However, it is very costly to check these one by one. Therefore, machine learning is a suitable method to address this problem. By extracting study results from research papers by text mining, it is possible to make use of that knowledge. In this research, we aim to extract disease-related genes from PubMed papers using word2vec, which is a text mining method. The method extracts the top 10 genes whose known disease genes and vectors are close to those obtained by word2vec. Based on these, genes other than known disease-related genes are extracted and used as disease-related genes. We conducted experiments using schizophrenia, and confirmed the likelihood of this disease-related gene using xgboost. Pattern 1: Only known genes. Pattern 2: Pattern 1 plus disease-related genes extracted in this study. Pattern 3: Pattern 1 plus the same number of random genes. Using these three patterns, we performed a xgboost with microarray data and compared the classification accuracy. The result was that Pattern 2 had the highest accuracy. Therefore, we could extract genes with using genes related to disease by our method.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"47 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116651880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The effect of reference species on reference-guided genome assembly 参考物种对参考基因组组装的影响

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156351

Juyeon Kim, Daehwan Lee, Mikang Sim, Jongin Lee, Jaebum Kim

{"title":"The effect of reference species on reference-guided genome assembly","authors":"Juyeon Kim, Daehwan Lee, Mikang Sim, Jongin Lee, Jaebum Kim","doi":"10.1145/3156346.3156351","DOIUrl":"https://doi.org/10.1145/3156346.3156351","url":null,"abstract":"The rapid improvement of the next-generation sequencing (NGS) technologies has enabled unprecedented production of huge DNA sequence data at low cost. However, the NGS technologies are still limited to generate short DNA sequences, which has led to the development of many assembly algorithms to recover whole genome sequences from those short sequences. Unfortunately, the assembly algorithms alone can only construct scaffold sequences, which are generally much shorter than chromosome sequences. To generate chromosome sequences, additional expensive experimental data is required. To overcome this problem, there have been many studies to develop new computational algorithms to further merge the scaffold sequences, and produce chromosome-level sequences by utilizing an existing genome assembly of a related species called a reference. However, even though the quality of the chosen reference assembly is critical for generating a good final assembly, its effect is not well uncovered yet. In this study, we measured the effect of the reference genome assembly on the quality of the final assembly generated by reference-guided assembly algorithms. By using the genome assemblies of total eleven reference species (eight primates and three rodents), the human genome sequences were assembled from scaffold sequences by one of the reference-guided assembly algorithms, called RACA, and they were compared with known genome sequences to measure their quality in terms of the number of misassemblies. The effect of the quality of the reference assemblies was investigated in terms of divergence time against human, alignment coverage between the reference and human, and the amount of inclusion of core eukaryotic genes. We found that the divergence time is a good indicator of the quality of the final assembly when reference assemblies with high quality are used. We believe this study will contribute to broaden our understanding of the effect and importance of a reference assembly on the reference-guided assembly task.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"179 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126950032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Proposal of application method of Inductive Logic Programming to microarray data 归纳逻辑程序设计在微阵列数据中的应用方法

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156356

Hiromu Ide, M. Umezawa, H. Ohwada

{"title":"Proposal of application method of Inductive Logic Programming to microarray data","authors":"Hiromu Ide, M. Umezawa, H. Ohwada","doi":"10.1145/3156346.3156356","DOIUrl":"https://doi.org/10.1145/3156346.3156356","url":null,"abstract":"This paper describing a method of specifying common terms of genes from microarray data in 3 steps. First, we use random forest for extracting disease-related genes and it give each gene variable importance. The higher the variable importance, the more effective feature for classification. We extract genes whose variable importance more than 0 and set them positive samples and the rest set negative samples for ILP. Next, we annotate extracted genes by using Gene Ontology (GO) and use the term as predicate for ILP. Annotation is the process of assigning GO terms to gene products. Finally, we obtain rules about common terms in positive samples by using ILP. ILP is a subfield of machine learning which uses logic programming as a uniform representation technique for examples, background knowledge and hypotheses. ILP learns based on background knowledge. Background knowledge is represented in first-order logic. In the result, we extracted 1051 mRNA as positive samples for ILP from random forest and its F-measure score was 65.1%. We obtained about 4000 terms at each dataset and use them as predicates for ILP. We got eventually some rules about positive samples.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115200891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A machine learning approach for drug discovery from herbal medicine: Metabolite profiles to Therapeutic effects 从草药药物发现的机器学习方法:代谢物谱到治疗效果

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156352

P. T. Duy, Nguyen Minh Thanh, N. Vu, Ly Le

{"title":"A machine learning approach for drug discovery from herbal medicine: Metabolite profiles to Therapeutic effects","authors":"P. T. Duy, Nguyen Minh Thanh, N. Vu, Ly Le","doi":"10.1145/3156346.3156352","DOIUrl":"https://doi.org/10.1145/3156346.3156352","url":null,"abstract":"Vietnam has an abundant of herbal traditional medicine with accumulated experience for thousands of years. They play an important role in the drug development. However, several therapeutic effects remain unknown among these plants. To explore active ingredients in the effective Vietnamese herbal medicine formulations for individual diseases and to understand therapeutic effects under scientific viewpoint, this project predicts therapeutic effects based on metabolite profiles. The herbal medicine database has been processed to get the useful information by the supporting of computational approach, particularly Random forest algorithm, Generalized Boosted Model and Support Vector Machine. Three specific therapeutic effects which are \"Edema treatment\", \"Astrictive treatment\" and \"Cure sore eyes\" - metabolites binary classification model to deal with multi-class classification and unbalanced class data problem. Since this project can reveal the main predictors of specific therapeutic effect, they are valuable information for further research of drug development.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131116252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Identifying microRNA targets in epithelial-mesenchymal transition using joint-intervention causal inference 利用联合干预因果推理识别上皮-间质转化中的microRNA靶点

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156353

T. Le, Junpeng Zhang, Lin Liu, B. Truong, Shu Hu, Taosheng Xu, Jiuyong Li

{"title":"Identifying microRNA targets in epithelial-mesenchymal transition using joint-intervention causal inference","authors":"T. Le, Junpeng Zhang, Lin Liu, B. Truong, Shu Hu, Taosheng Xu, Jiuyong Li","doi":"10.1145/3156346.3156353","DOIUrl":"https://doi.org/10.1145/3156346.3156353","url":null,"abstract":"microRNAs (miRNAs) are important gene regulators, controlling a wide range of biological processes and being involved in several types of cancers. Currently, several computational approaches have been developed to elucidate the miRNA-mRNA regulatory relationships. However, these approaches have their own limitations and we are still far from understanding the miRNA-mRNA relationships, especially in specific biological processes. In this paper, we adapt a causal inference method to infer miRNA targets from the Epithelial Mesenchymal Transition (EMT) dataset. Our method utilises a causality based method that estimates the causal effect of each miRNA on a mRNA while controlling the effects of other miRNAs on the mRNA. The inferred causal effect is similar to the effect of a miRNA on a mRNA when we knockout all the other miRNAs. The experimental results show that our method is better than existing benchmark methods in finding experimentally confirmed miRNA targets. Moreover, we have found that the miR-200 family members (miR-141, miR-200a/b/c, and miR-429) synergistically regulate a number of target genes in EMT, suggesting their roles in controlling cancer metastasis. In addition, functional and pathway enrichment analyses show that the discovered miRNA-mRNA regulatory relationships are highly enriched in EMT, implying the validity of the proposed method. Novel miRNA-mRNA regulatory relationships discovered by our method provide a rich resource for follow up wet-lab experiments and EMT related studies.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130803838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Revealing deep proteome diversity with community-scale proteomics big data 利用群落规模的蛋白质组学大数据揭示深层蛋白质组多样性

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156694

N. Bandeira

{"title":"Revealing deep proteome diversity with community-scale proteomics big data","authors":"N. Bandeira","doi":"10.1145/3156346.3156694","DOIUrl":"https://doi.org/10.1145/3156346.3156694","url":null,"abstract":"Translating the growing volumes of proteomics mass spectrometry data into reusable evidence of the occurrence and provenance of proteomics events requires the development of novel algorithms and community-scale computational workflows. MassIVE (http://massive.ucsd.edu) proposes to address this challenge in three stages. First, systematic annotation of human proteomics big data requires automated reanalysis of all public data using open source workflows with detailed records of search parameters and of individual Peptide Spectrum Matches (PSMs). As such, our large-scale reanalysis of tens of terabytes of human data has now increased the total number of proper public PSMs by over 10-fold to over 320 million PSMs whose coverage includes over 95 Second, proper synthesis of community-scale search results into a reusable knowledge base (KB) requires scalable workflows imposing strict statistical controls. Our MassIVE-KB spectral library has thus properly assembled 2+ million precursors from over 1.5 million peptides covering over 6.2 million amino acids in the human proteome, all of which at least double the numbers covered by the popular NIST spectral libraries. Moreover, MassIVE-KB detects 723 novel proteins (PE 2-5) for a total of 16,852 proteins observed in non-synthetic LCMS runs and 19,610 total proteins when including the recent ProteomeTools data. Third, we show how advanced identification algorithms combine with public data to reveal dozens of unexpected putative modifications supported by multiple highly-correlated spectra. These show that protein regions can be observed in over 100 different variants with various combinations of post-translational modifications and cleavage events, thus suggesting that current coverage of proteome diversity (at 1.3 variants per protein region) is far below what is observable in experimental data.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129303615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Formal Validation of Neural Networks as Timed Automata 神经网络作为时间自动机的形式化验证

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156350

Elisabetta De Maria, C. Giusto, Giovanni Ciatto

引用次数: 4

Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity 药物再利用:靶向mTOR抑制剂的抗癌活性

Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics Pub Date : 2017-12-07 DOI: 10.1145/3156346.3156359

E. Kabir, Mohammad Kawsar Sharif Siam, S. M. Kabir, Arafat H Khan, Samiul Alam Rajib

{"title":"Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity","authors":"E. Kabir, Mohammad Kawsar Sharif Siam, S. M. Kabir, Arafat H Khan, Samiul Alam Rajib","doi":"10.1145/3156346.3156359","DOIUrl":"https://doi.org/10.1145/3156346.3156359","url":null,"abstract":"In the search of safer and more effective drugs while reducing costs and increasing productivity of novel drug discovery, scientists are changing their focus to an approach known as drug repurposing. This involves finding a new therapeutic effect of an already existing drug. It is a method that can effectively be addressed in the drug discovery and development challenges of targeting different disorders. Many drugs which have failed clinical trials for not being effective in their intended therapeutic indication have also been repurposed which has been of great benefit for pharmaceutical industries. For instance, sildenafil failed its clinical trials and was repurposed and currently in use as a repurposed drug. Many methods are available for drug repurposing but in silico method is a cost effective and convenient method for drug repurposing which uses computer software to find a possible binding site of a drug within a protein. For its advantages, computational docking approach was used for the present drug repurposing study of mTOR protein, where the drugs chosen were metformin, aspirin and rosuvastatin. Autodock Vina and PyMOL was used to complete the study and it was found that aspirin and metformin have poor affinity (-5.8 kcal/mol) for this protein which is upregulated in various types of cancer such as breast cancer and ovarian cancer. On the other hand, rosuvastatin was found to have a high affinity (-7.8 kcal/mol in case of flexible docking and -10.2 kcal/mol in case of rigid docking) for mTOR and binds to the same binding pocket where the immunosuppressant and anticancer drug rapamycin binds. The study therefore indicates that rosuvastatin might have significant immunosuppressive and anticancer activity by downregulating the activity of mTOR and needs further studies to prove it.","PeriodicalId":415207,"journal":{"name":"Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123027254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7