Design of a state-machine based genomic simulator and development of a system for prediction of Rheumatoid Arthritis (RA) using signal processing techniques.

T. Lakshmi, K. B. Ramesh, V. Niranjan, Aishwarya Shetty, N. Monica, Aishwarya Rao
{"title":"Design of a state-machine based genomic simulator and development of a system for prediction of Rheumatoid Arthritis (RA) using signal processing techniques.","authors":"T. Lakshmi, K. B. Ramesh, V. Niranjan, Aishwarya Shetty, N. Monica, Aishwarya Rao","doi":"10.1109/CCUBE.2017.8394131","DOIUrl":null,"url":null,"abstract":"Rheumatic Arthritis (RA) is a chronic, autoimmune, inflammatory disease involving primarily the peripheral synovial joints. The diagnosis of RA in its pre-clinical phase is of at most importance as it can prevent progressive and irreversible joint damage if treated early. As RA is a genetic disorder, diagnosis through genomic sequence analysis has proven to be an appropriate solution to achieve the above goal [2]. Digital Signal Processing (DSP) applications in bio- informatics has received great attention in recent years, where computationally efficient methods for genome sequence analysis have been developed by utilizing existing signal processing algorithms. In the proposed work, a software module that uses signal processing techniques to predict probability of the future occurrence of RA has been developed. This is done by reviewing medical literature to identify the genes responsible for causing the disease and subsequently obtaining the nucleotide sequences of these genes through GenBank, a standard open-access gene database. The nucleotides are then mapped onto a unit circle in the complex plane so that complimentary base pairs are complex conjugates of each other and the magnitudes of the nucleotides are normalized at unity. Risk gene patterns are then searched in the chromosome sequence under test. Cross-correlation, which is a signal processing algorithm, was used for recognition of presence of risk genes in the chromosome sequence. The usage of cross- correlation not only allowed the identification of mutated sequences but also reduced the time complexity to O[Nlog2(N)].A relative genetic risk score and overall genetic risk score of probability of developing RA was then calculated using statistical methods. In order to test the system, a genome sequence simulator whose underlying architecture is that of a state machine, was created. Using this simulator multiple datasets containing several combinations of risk genes were generated. The system tested using the datasets thus obtained was found to be 95% accurate when the risk magnitudes obtained by the system was compared against the ground truth values given in RAVariome database for the same set of genes chosen. Hence by ensuring early diagnosis, the system will assist doctors to formulate effective treatment plans and thus prevent joint deterioration and permanent functional disability.","PeriodicalId":443423,"journal":{"name":"2017 International Conference on Circuits, Controls, and Communications (CCUBE)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Circuits, Controls, and Communications (CCUBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCUBE.2017.8394131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Rheumatic Arthritis (RA) is a chronic, autoimmune, inflammatory disease involving primarily the peripheral synovial joints. The diagnosis of RA in its pre-clinical phase is of at most importance as it can prevent progressive and irreversible joint damage if treated early. As RA is a genetic disorder, diagnosis through genomic sequence analysis has proven to be an appropriate solution to achieve the above goal [2]. Digital Signal Processing (DSP) applications in bio- informatics has received great attention in recent years, where computationally efficient methods for genome sequence analysis have been developed by utilizing existing signal processing algorithms. In the proposed work, a software module that uses signal processing techniques to predict probability of the future occurrence of RA has been developed. This is done by reviewing medical literature to identify the genes responsible for causing the disease and subsequently obtaining the nucleotide sequences of these genes through GenBank, a standard open-access gene database. The nucleotides are then mapped onto a unit circle in the complex plane so that complimentary base pairs are complex conjugates of each other and the magnitudes of the nucleotides are normalized at unity. Risk gene patterns are then searched in the chromosome sequence under test. Cross-correlation, which is a signal processing algorithm, was used for recognition of presence of risk genes in the chromosome sequence. The usage of cross- correlation not only allowed the identification of mutated sequences but also reduced the time complexity to O[Nlog2(N)].A relative genetic risk score and overall genetic risk score of probability of developing RA was then calculated using statistical methods. In order to test the system, a genome sequence simulator whose underlying architecture is that of a state machine, was created. Using this simulator multiple datasets containing several combinations of risk genes were generated. The system tested using the datasets thus obtained was found to be 95% accurate when the risk magnitudes obtained by the system was compared against the ground truth values given in RAVariome database for the same set of genes chosen. Hence by ensuring early diagnosis, the system will assist doctors to formulate effective treatment plans and thus prevent joint deterioration and permanent functional disability.
风湿性关节炎(RA)是一种慢性自身免疫性炎症性疾病,主要累及周围滑膜关节。RA在临床前阶段的诊断是最重要的,因为如果早期治疗,它可以防止进行性和不可逆的关节损伤。由于RA是一种遗传性疾病,通过基因组序列分析进行诊断已被证明是实现上述目标的合适解决方案[2]。近年来,数字信号处理(DSP)在生物信息学中的应用受到了广泛的关注,利用现有的信号处理算法开发了计算效率高的基因组序列分析方法。在本文中,我们开发了一个利用信号处理技术预测RA未来发生概率的软件模块。这是通过查阅医学文献来确定导致疾病的基因,并随后通过标准的开放获取基因数据库GenBank获得这些基因的核苷酸序列来完成的。然后将核苷酸映射到复平面上的单位圆上,以便互补碱基对彼此是复共轭,并且核苷酸的大小在单位处归一化。然后在测试的染色体序列中搜索风险基因模式。交叉相关是一种信号处理算法,用于识别染色体序列中存在的风险基因。相互关系的使用不仅可以识别突变序列,而且可以将时间复杂度降低到0 [Nlog2(N)]。采用统计学方法计算RA发生概率的相对遗传风险评分和总体遗传风险评分。为了测试该系统,我们创建了一个基因组序列模拟器,其底层架构是状态机。使用该模拟器生成了包含多种风险基因组合的多个数据集。当将系统获得的风险等级与RAVariome数据库中给定的同一组所选基因的基础真值进行比较时,使用由此获得的数据集测试的系统发现准确率为95%。因此,通过确保早期诊断,该系统将帮助医生制定有效的治疗方案,从而防止关节恶化和永久性功能残疾。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信