{"title":"A Novel Evolutionary Biclustering Approach using MapReduce(EBC-MR)","authors":"R. Rathipriya","doi":"10.4018/IJKDB.2016010103","DOIUrl":"https://doi.org/10.4018/IJKDB.2016010103","url":null,"abstract":"A novel biclustering approach is proposed in this paper, which can be used to cluster data like web data, gene expression data into local pattern using MapReduce framework. The proposed biclustering approach extracts the highly coherent bicluster using a correlation measure called Average Correlation Value measure. Furthermore, MapReduce based genetic algorithm is firstly used to the biclustering of web data. This method can avoid local convergence in the optimization algorithms mostly. The MSWeb dataset and MSNBC web usage data set are used to test the performance of new MapReduce based Evolutionary biclustering algorithm. The experimental study is carried out for comparison of proposed algorithm with traditional genetic algorithm in biclustering. The results reveal that novel proposed approach preforms better than existing evolutionary biclustering approach.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125859995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Fairley, J. McClure, Neil Hanlon, Robert W. Irving, M. McBride, A. Dominiczak, Ela Pustulka-Hunt
{"title":"Mapping Affymetrix Microarray Probes to the Rat Genome via a Persistent Index","authors":"S. Fairley, J. McClure, Neil Hanlon, Robert W. Irving, M. McBride, A. Dominiczak, Ela Pustulka-Hunt","doi":"10.4018/JKDB.2010100204","DOIUrl":"https://doi.org/10.4018/JKDB.2010100204","url":null,"abstract":"A probe mapping technique using a novel implementation of a persistent q-gram index was developed. It guarantees to find all matches that meet certain definitions. These include exact matching of the central 19 bases of 25 base probes, matching the central 19 bases with at most one or three mismatches and exact matching of any 16 bases. In comparison with BLAST and BLAT, the new methods were either significantly faster or identified matches missed by the heuristics. The 16 bp method was used to map the 342,410 perfect match probes from the Affymetrix GeneChip Rat Genome 230 2.0 Array to the genome. When compared with the mapping from Ensembl, the new mapping included over seven million novel matches, providing additional evidence for researchers wishing to further investigate the sources of signals measured in microarray experiments. The results demonstrate the practicality of the index, which could support other q-gram based algorithms.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131988641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Charles D. Hammack, George Perry, R. LeBaron, G. Villareal, C. Phelix
{"title":"Low Dose Pioglitazone Attenuates Oxidative Damage in Early Alzheimer's Disease by Binding mitoNEET: Transcriptome-To-Reactome™ Biosimulation of Neurons","authors":"Charles D. Hammack, George Perry, R. LeBaron, G. Villareal, C. Phelix","doi":"10.4018/IJKDB.2015010103","DOIUrl":"https://doi.org/10.4018/IJKDB.2015010103","url":null,"abstract":"Oxidative damage OD is considered to be a central component in the progression of Alzheimer's disease AD. 8-hydroxyguanosine 8-OHG, a readily oxidized ribonucleic acid found in AD, was used as a biomarker to investigate the role of OD in the progression of the disease. A disruption in two critical Thioredoxin-Dependent Peroxiredoxin System components, peroxiredoxin-3 Prx-3 and thioredoxin Trx, may serve as a source of the increased accumulation of OD observed in AD. We demonstrate that OD, in the form of 8-OHG, was quantitatively most significant during the earliest stage of AD [F 3, 25 = 5.08, p <.01]. A drastic decline in mitochondrial protein levels of Prx-3 [F 3, 25 = 8.74, p. < 01] and Trx [F 3, 25 = 4.33, p. < 05] were also observed across the progression of the disease. We then tested the efficacy of pioglitazone, a thiazolidinedione class drug aimed to delay onset of AD by acting on mitoNEET. Our results showed a significant reduction in the oxidized variant of mitoNEET within the incipient population when a 0.8mg dose was simulated in silico p = 0.0242; a. < 05.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134208735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prediction of Protein-Protein Interactions Between Human Host and Two Mycobacterial Organisms","authors":"O. Krishnadev, S. Bisht, N. Srinivasan","doi":"10.4018/jkdb.2010100201","DOIUrl":"https://doi.org/10.4018/jkdb.2010100201","url":null,"abstract":"The genomes of many human pathogens have been sequenced but the protein-protein interactions across a pathogen and human are still poorly understood. The authors apply a simple homology-based method to predict protein-protein interactions between human host and two mycobacterial organisms viz., M.tuberculosis and M.leprae. They focused on secreted proteins of pathogens and cellular membrane proteins to restrict to uncovering biologically significant and feasible interactions. Predicted interactions include five mycobacterial proteins of yet unknown function, thus suggesting a role for these proteins in pathogenesis. The authors predict interaction partners for secreted mycobacterial antigens such as MPT70, serine proteases and other proteins interacting with human proteins, such as toll-like receptors, ras signalling proteins and immune maintenance proteins, that are implicated in pathogenesis. These results suggest that the list of predicted interactions is suitable for further analysis and forms a useful step in the understanding of pathogenesis of these mycobacterial organisms.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"222 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115347297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Distributed Scalar Controller Selection Scheme for Redundant Data Elimination in Sensor Networks","authors":"S. Priyadarshini, S. Panigrahi","doi":"10.4018/IJKDB.2017010107","DOIUrl":"https://doi.org/10.4018/IJKDB.2017010107","url":null,"abstract":"This research presents a novel distributed strategy for actuation of reduced number of cameras motivated by scalar controller selection for eliminating the amount of redundant data transmission taking place in any geographic zone under speculation. The proposed framework is based on dividing the monitored region into a number of virtual compartments. A scalar controller is selected in each of the compartments, which is chosen in such a manner that its distance is the minimum among all the scalars from the central point of the concerned compartment. Further, all the cameras are arranged in a predetermined circular fashion. Whenever, an event takes place, the scalar controllers inform their respective cameras regarding its occurrence. The cameras collaboratively exchange information among themselves for deciding which among them are to be actuated. The least camera activation, enhanced coverage ratio, minimized redundancy ratio, reduced energy and power expenditure obtained from the experimental outcomes affirm the effectiveness of our proposed method over other approaches.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130249864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Characterization and Classification of Local Protein Surfaces Using Self-Organizing Map","authors":"Lee Sael, D. Kihara","doi":"10.4018/jkdb.2010100203","DOIUrl":"https://doi.org/10.4018/jkdb.2010100203","url":null,"abstract":"Annotating protein structures is an urgent task as increasing number of protein structures of unknown function is being solved. To achieve this goal, it is critical to establish computational methods for characterizing and classifying protein local structures. The authors analyzed the similarity of local surface patches from 609 representative proteins considering shape and the electrostatic potential, which are represented by the 3D Zernike descriptors. Classification of local patches is done with the emergent self-organizing map (ESOM). They mapped patches at ligand binding-sites to investigate how they distribute and cluster among the ESOM map. They obtained 30-50 clusters of local surfaces of different characteristics, which will be useful for annotating surface of proteins.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"30 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120933378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tsuyoshi Kato, Kinya Okada, H. Kashima, Masashi Sugiyama
{"title":"A Transfer Learning Approach and Selective Integration of Multiple Types of Assays for Biological Network Inference","authors":"Tsuyoshi Kato, Kinya Okada, H. Kashima, Masashi Sugiyama","doi":"10.4018/jkdb.2010100205","DOIUrl":"https://doi.org/10.4018/jkdb.2010100205","url":null,"abstract":"Inferring the relationship among proteins is a central issue of computational biology and a diversity of biological assays are utilized to predict the relationship. However, as experiments are usually expensive to perform, automatic data selection is employed to reduce the data collection cost. Although data useful for link prediction are different in each local sub-network, existing methods cannot select different data for different processes. This article presents a new algorithm for inferring biological networks from multiple types of assays. The proposed algorithm is based on transfer learning and can exploit local information effectively. Each assay is automatically weighted through learning and the weights can be adaptively different in each local part. The authors’ algorithm was favorably examined on two kinds of biological networks: a metabolic network and a protein interaction network. A statistical test confirmed that the weight that our algorithm assigned to each assay was meaningful.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114140107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Figure Based Biomedical Document Retrieval System using Structural Image Features","authors":"Harikrishna G. N. Rai, K. Deepak, P. R. Krishna","doi":"10.4018/jkdb.2012010103","DOIUrl":"https://doi.org/10.4018/jkdb.2012010103","url":null,"abstract":"Multi-modal and Unstructured nature of documents make their retrieval from healthcare document repositories a challenging task. Text based retrieval is the conventional approach used for solving this problem. In this paper, the authors explore an alternate avenue of using embedded figures for the retrieval task. Usually, context of a document is directly reflected in the associated figures, therefore embedded text within these figures along with image features have been used for similarity based retrieval of figures. The present work demonstrates that image features describing the structural properties of figures are sufficient for the figure retrieval task. First, the authors analyze the problem of figure retrieval from biomedical literature and identify significant classes of figures. Second, they use edge information as a means to discriminate between structural properties of each figure category. Finally, the authors present a methodology using a novel feature descriptor namely Fourier Edge Orientation Autocorrelogram FEOAC to describe structural properties of figures and build an effective Biomedical document retrieval system. The experimental results demonstrate the better retrieval performance and overall improvement of FEOAC for figure retrieval task, especially when most of the edge information is retained. Apart from invariance to scale, rotation and non-uniform illumination, the proposed feature descriptor is shown to be relatively robust to noisy edges.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131952313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Alizadehsani, Mohammad Javad Hosseini, Reihane Boghrati, Asma Ghandeharioun, F. Khozeimeh, Z. Sani
{"title":"Exerting Cost-Sensitive and Feature Creation Algorithms for Coronary Artery Disease Diagnosis","authors":"R. Alizadehsani, Mohammad Javad Hosseini, Reihane Boghrati, Asma Ghandeharioun, F. Khozeimeh, Z. Sani","doi":"10.4018/jkdb.2012010104","DOIUrl":"https://doi.org/10.4018/jkdb.2012010104","url":null,"abstract":"One of the main causes of death the world over is the family of cardiovascular diseases, of which coronary artery disease CAD is a major type. Angiography is the principal diagnostic modality for the stenosis of heart arteries; however, it leads to high complications and costs. The present study conducted data-mining algorithms on the Z-Alizadeh Sani dataset, so as to investigate rule based and feature based classifiers and their comparison, and the reason for the effectiveness of a preprocessing algorithm on a dataset. Misclassification of diseased patients has more side effects than that of healthy ones. To this end, this paper employs 10-fold cross-validation on cost-sensitive algorithms along with base classifiers of Naive Bayes, Sequential Minimal Optimization SMO, K-Nearest Neighbors KNN, Support Vector Machine SVM, and C4.5 and the results show that the SMO algorithm yielded very high sensitivity 97.22% and accuracy 92.09% rates.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131757969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"K-NN Based Outlier Detection Technique on Intrusion Dataset","authors":"S. Sahu, S. K. Jena, Manish Verma","doi":"10.4018/IJKDB.2017010105","DOIUrl":"https://doi.org/10.4018/IJKDB.2017010105","url":null,"abstract":"Outliers in the database are the objects that deviate from the rest of the dataset by some measure. The Nearest Neighbor Outlier Factor is considering to measure the degree of outlier-ness of the object in the dataset. Unlike the other methods like Local Outlier Factor, this approach shows the interest of a point from both neighbors and reverse neighbors, and after that, an object comes into consideration. We have observed that in GBBK algorithm that based on K-NN, used quick sort to find k nearest neighbors that take O N log N time. However, in proposed method, the time required for searching on K times which complete in O KN time to find k nearest neighbors k < < log N. As a result, the proposed method improves the time complexity. The NSL-KDD and Fisher iris dataset is used, and experimental results compared with the GBBK method. The result is same in both the methods, but the proposed method takes less time for computation.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134147706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}