{"title":"基于统计原理的mirna -靶基因相互作用检测方法","authors":"Nai-Wen Chang, Hong-Jie Dai, Yu-Lun Hsieh, W. Hsu","doi":"10.1109/BIBE.2016.60","DOIUrl":null,"url":null,"abstract":"MicroRNAs (miRNAs) are small non-coding RNAs of approximately 23 nucleotides, which negatively regulate the gene expression at the post-transcriptional level. miRNAs have been considered as good candidates for early detection or prognosis biomarkers for various diseases. Validated miRNA targets are usually reported in literature, necessitating researchers to manually screen through the related literature to keep up-to-date with novel findings. However, the amount of miRNA-related literature is increasing rapidly which makes it difficult for researchers to keep up to date. This study develops a text mining pipeline based on the statistical principle-based approach (SPBA) to detect MiRNA-Target Interactions (MTIs) mentioned in literatures. SPBA uses a collection of principles to represent linguistic concepts or rules used by human for describing MTIs. Each principle is composed of a collection of slots, which can be automatically learned from training data by merging the labeled slot sequences into more representative principles through a dominating set algorithm. Followed by a partial matching algorithm, the proposed approach can successfully recognize miRNA mentions and extract their MTIs in articles with a promising F-score of 98.8% and an accuracy of 71.43%.","PeriodicalId":377504,"journal":{"name":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"19 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Statistical Principle-Based Approach for Detecting miRNA-Target Gene Interaction Articles\",\"authors\":\"Nai-Wen Chang, Hong-Jie Dai, Yu-Lun Hsieh, W. Hsu\",\"doi\":\"10.1109/BIBE.2016.60\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MicroRNAs (miRNAs) are small non-coding RNAs of approximately 23 nucleotides, which negatively regulate the gene expression at the post-transcriptional level. miRNAs have been considered as good candidates for early detection or prognosis biomarkers for various diseases. Validated miRNA targets are usually reported in literature, necessitating researchers to manually screen through the related literature to keep up-to-date with novel findings. However, the amount of miRNA-related literature is increasing rapidly which makes it difficult for researchers to keep up to date. This study develops a text mining pipeline based on the statistical principle-based approach (SPBA) to detect MiRNA-Target Interactions (MTIs) mentioned in literatures. SPBA uses a collection of principles to represent linguistic concepts or rules used by human for describing MTIs. Each principle is composed of a collection of slots, which can be automatically learned from training data by merging the labeled slot sequences into more representative principles through a dominating set algorithm. Followed by a partial matching algorithm, the proposed approach can successfully recognize miRNA mentions and extract their MTIs in articles with a promising F-score of 98.8% and an accuracy of 71.43%.\",\"PeriodicalId\":377504,\"journal\":{\"name\":\"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"19 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2016.60\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2016.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Statistical Principle-Based Approach for Detecting miRNA-Target Gene Interaction Articles
MicroRNAs (miRNAs) are small non-coding RNAs of approximately 23 nucleotides, which negatively regulate the gene expression at the post-transcriptional level. miRNAs have been considered as good candidates for early detection or prognosis biomarkers for various diseases. Validated miRNA targets are usually reported in literature, necessitating researchers to manually screen through the related literature to keep up-to-date with novel findings. However, the amount of miRNA-related literature is increasing rapidly which makes it difficult for researchers to keep up to date. This study develops a text mining pipeline based on the statistical principle-based approach (SPBA) to detect MiRNA-Target Interactions (MTIs) mentioned in literatures. SPBA uses a collection of principles to represent linguistic concepts or rules used by human for describing MTIs. Each principle is composed of a collection of slots, which can be automatically learned from training data by merging the labeled slot sequences into more representative principles through a dominating set algorithm. Followed by a partial matching algorithm, the proposed approach can successfully recognize miRNA mentions and extract their MTIs in articles with a promising F-score of 98.8% and an accuracy of 71.43%.