转座要素层次分类中正反例的选择策略

Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri
{"title":"转座要素层次分类中正反例的选择策略","authors":"Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri","doi":"10.1109/BRACIS.2018.00079","DOIUrl":null,"url":null,"abstract":"Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Strategies for Selection of Positive and Negative Instances in the Hierarchical Classification of Transposable Elements\",\"authors\":\"Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri\",\"doi\":\"10.1109/BRACIS.2018.00079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.\",\"PeriodicalId\":405190,\"journal\":{\"name\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRACIS.2018.00079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

转座因子(te)是能够通过在宿主细胞内的转座改变基因活性的DNA序列。一旦te插入到其他基因中,它们可以改变或降低某些蛋白质的活性,这在某些情况下可能会使这些生物体的生存无法实现,甚至会产生遗传变异。对于TEs的识别和分类已经提出了多种方法,但大多数方法仍然涉及大量的手工工作或过于特定于类别,这限制了其适用性。此外,这些问题所涉及的类通常是分层结构的,这一点被大多数方法所忽略。在这种情况下,仍然需要进一步研究的一个问题是在分层模型的归纳过程中选择积极和消极实例的策略的使用。因此,在本文中,我们探索了四种不同的策略来选择训练实例,利用几种具有不同偏差的机器学习分类器,这些分类器使用局部方法应用于te的分层分类。因此,我们根据实验结果推荐最佳策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Strategies for Selection of Positive and Negative Instances in the Hierarchical Classification of Transposable Elements
Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信