Hela Ltifi, Mounir Ben Ayed, Ghada Trabelsi, A. Alimi
{"title":"Perspective Wall Technique for Visualizing and Interpreting Medical Data","authors":"Hela Ltifi, Mounir Ben Ayed, Ghada Trabelsi, A. Alimi","doi":"10.4018/jkdb.2012040104","DOIUrl":"https://doi.org/10.4018/jkdb.2012040104","url":null,"abstract":"Increasing the improvement of confidence and comprehensibility of medical data as well as the possibility of using the human capacities in medical pattern recognition is a significant interest for the coming years. In this context, we have created a visual knowledge discovery from databases application. It has been developed to efficiently and accurately understand a large collection of fixed and temporal patients' data in the Intensive Care Unit in order to prevent the nosocomial infection occurrence. It is based on data visualization technique which is the perspective wall. Its application is a good example of the usefulness of data visualization techniques in the medical domain.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114891740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yan-Hui Li, Jian-Hui Li, Xin Song, Kai Feng, Y. Zhou
{"title":"Predicting Aging-Genes in Drosophila Melanogaster by Integrating Network Topological Features and Functional Categories","authors":"Yan-Hui Li, Jian-Hui Li, Xin Song, Kai Feng, Y. Zhou","doi":"10.4018/jkdb.2012040102","DOIUrl":"https://doi.org/10.4018/jkdb.2012040102","url":null,"abstract":"An important task of aging research is to find genes that regulate lifespan. Wet-lab identification of aging genes is tedious and labor-intensive activity. Developing an algorithm to predict aging genes will be greatly helpful. In this paper, we systematically analyzed topological features of proteins encoded by Drosophila melanogaster aging genes versus those encoded by non-aging genes in protein-protein interaction PPI network and found that aging genes are characterized by several network topological features such as higher in degrees. And aging genes tend to be enriched in certain functions were also found. Based on these features, an algorithm was developed to detect aging genes genome wide. With a posterior probability score describing possible involvement in aging no less than 1, 1014 novel aging genes were predicted by decision trees. Evidence supporting our prediction can be found.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129622077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Kanagarajadurai, S. Kalaimathy, Paramasivam Nagarajan, R. Sowdhamini
{"title":"PASS2: A Database of Structure-Based Sequence Alignments of Protein Structural Domain Superfamilies","authors":"K. Kanagarajadurai, S. Kalaimathy, Paramasivam Nagarajan, R. Sowdhamini","doi":"10.4018/jkdb.2011100104","DOIUrl":"https://doi.org/10.4018/jkdb.2011100104","url":null,"abstract":"A detailed comparison of protein domains that belong to families and superfamilies shows that structure is better conserved than sequence during evolutionary divergence. Sequence alignments, guided by structural features, permit a better sampling of the protein sequence space and effective construction of libraries for fold recognition. Sequence alignments are useful evolutionary models in defining structure-function relationships for protein superfamilies. The PASS2 database, maintained by the authors, presents alignments of proteins related at the superfamily level and characterised by low sequence similarity. The number of new superfamilies increased to 47% compared with the previous PASS2 version, which shows the crucial importance of updating the PASS2 database. In the current release of the PASS2 database, they align protein superfamilies using a structural alignment protocol. The authors also introduce two alignment assessment methods that depend on the average structural deviations of domains and the extent of conserved secondary structures. They also integrate new and important structural and sequence features at the superfamily level into the database. These features are conserved-unconserved blocks in proteins, spatial distribution of sequences using principal component analysis and a statistical view for each superfamily. The authors suggest that highly structurally deviant superfamily members could be removed as outliers, so that such extreme distant relationships will not obscure the alignment. They report a nearly-automated, updated version of the superfamily alignment database, consisting of 1776 superfamilies and 9536 protein domains, that is in direct correspondence with the SCOP (1.73) database.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115320019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Knowledge Discovery and Multimodal Inputs for Driving an Intelligent Wheelchair","authors":"B. Faria, Luis Paulo Reis, N. Lau","doi":"10.4018/JKDB.2011100102","DOIUrl":"https://doi.org/10.4018/JKDB.2011100102","url":null,"abstract":"Cerebral Palsy is defined as a group of permanent disorders in the development of movement and posture. The motor disorders in cerebral palsy are associated with deficits of perception, cognition, communication, and behaviour, which can affect autonomy and independence. The interface between the user and an intelligent wheelchair can be done with several input devices such as joysticks, microphones, and brain computer interfaces (BCI). BCI enables interaction between users and hardware systems through the recognition of brainwave activity. The current BCI systems have very low accuracy on the recognition of facial expressions and thoughts, making it difficult to use these devices to enable safe and robust commands of complex devices like an Intelligent Wheelchair. This paper presents an approach to expand the use of a brain computer interface for driving an intelligent wheelchair by patients suffering from cerebral palsy. The ability with the joystick, head movements, and voice inputs were tested, and the best possibility for driving the wheelchair is given to a specific user. Experiments were performed using 30 individuals suffering from IV and V degrees of cerebral palsy on the Gross Motor Function (GMF) measure. The results show that the pre-processing and variable selection methods are effective to improve the results of a commercial BCI product by 57%. With the developed system, it is also possible for users to perform a circuit in a simulated environment using just facial expressions and thoughts.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121268575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Identification Methods of G Protein-Coupled Receptors","authors":"Meriem Zekri, K. Alem, L. Souici-Meslati","doi":"10.4018/jkdb.2011100103","DOIUrl":"https://doi.org/10.4018/jkdb.2011100103","url":null,"abstract":"The G protein-coupled receptors (GPCRs) include one of the largest and most important families of multifunctional proteins known to molecular biology. They play a key role in cell signaling networks that regulate many physiological processes, such as vision, smell, taste, neurotransmission, secretion, immune responses, metabolism, and cell growth. These proteins are thus very important for understanding human physiology and they are involved in several diseases. Therefore, many efforts in pharmaceutical research are to understand their structures and functions, which is not an easy task, because although thousands GPCR sequences are known, many of them remain orphans. To remedy this, many methods have been developed using methods such as statistics, machine learning algorithms, and bio-inspired approaches. In this article, the authors review the approaches used to develop algorithms for classification GPCRs by trying to highlight the strengths and weaknesses of these different approaches and providing a comparison of their performances.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134069822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Trend Analysis of Length of Stay Data via Phase-Type Models","authors":"T. Le, C. Kwoh, K. Lee, Eng Soon Teo","doi":"10.4018/jkdb.2011070103","DOIUrl":"https://doi.org/10.4018/jkdb.2011070103","url":null,"abstract":"The populations in many developed countries throughout the world are aging rapidly and the number of geriatric patients is expected to rise steeply in those countries. This will exert greater pressures on the management of hospital resources as a result. Hospital length of stay (LOS) is an important indicator of hospital activity and management because of its direct relation to resource consumption. Planning of hospital resources according to identified trends of LOS is, thus, an effective way to meet such future needs. In this paper, the authors propose a method to analyze the temporal trends of LOS based on the Coxian phase-type distributions, a special type of continuous-time Markov process. By fitting and regressing the probabilities of discharge from each phase of the distribution on time, the authors have found a growing trend in the proportion of long-staying patients in their sample of stroke patients from a general hospital in Singapore. The authors compare the yearly, quarterly and monthly trends over the same period to see the common pattern. The datasets were also robustified by bootstrapping to aid the analysis.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129684213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Célia Talma Gonçalves, Rui Camacho, Eugénio C. Oliveira
{"title":"BioTextRetriever: A Tool to Retrieve Relevant Papers","authors":"Célia Talma Gonçalves, Rui Camacho, Eugénio C. Oliveira","doi":"10.4018/jkdb.2011070102","DOIUrl":"https://doi.org/10.4018/jkdb.2011070102","url":null,"abstract":"Whenever new sequences of DNA or proteins have been decoded it is almost compulsory to look at similar sequences and papers describing those sequences in order to both collect relevant information concerning the function and activity of the new sequences and/or know what is known already about similar sequences. In current web sites and data bases of sequences there are, usually, a set of curated paper references linked to each sequence. Those links are a good starting point to look for relevant information related to a set of sequences. One way to implement such approach is to do a blast with the new decoded sequences, and collect similar sequences. Then one looks at the papers linked with the similar sequences. Most often the number of retrieved papers is small and one has to search large data bases for relevant papers. This paper proposes a process of generating a classifier based on the initially set of relevant papers. First, the authors collect similar sequences using an alignment algorithm like Blast. Then, the authors use the enlarges set of papers to construct a classifier. Finally a classifier is used to automatically enlarge the set of relevant papers by searching the MEDLINE using the automatically constructed classifier.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125382886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yi Mao, Yixin Chen, Gregory Hackmann, Minmin Chen, Chenyang Lu, M. Kollef, T. Bailey
{"title":"Early Deterioration Warning for Hospitalized Patients by Mining Clinical Data","authors":"Yi Mao, Yixin Chen, Gregory Hackmann, Minmin Chen, Chenyang Lu, M. Kollef, T. Bailey","doi":"10.4018/jkdb.2011070101","DOIUrl":"https://doi.org/10.4018/jkdb.2011070101","url":null,"abstract":"Data mining on medical data has great potential to improve the treatment quality of hospitals and increase the survival rate of patients. Every year, 4-17% of patients undergo cardiopulmonary or respiratory arrest while in hospitals. Clinical study has found early detection and intervention to be essential for preventing clinical deterioration in patients at general hospital units. This paper proposes an early warning system (EWS) designed to identify the signs of clinical deterioration and provide early warning for serious clinical events. The EWS is designed to provide reliable early alarms for patients at the general hospital wards (GHWs). The main task of EWS is a challenging classification problem on high-dimensional stream data with irregular, multi-scale data gaps, measurement errors, outliers, and class imbalance. This paper proposes a novel data mining framework for analyzing such medical data streams. The authors assess the feasibility of the proposed EWS approach through retrospective study that includes data from 41,503 visits at a major hospital. Finally, the system is applied in a clinical trial at a major hospital and obtains promising results. This project is an example of multidisciplinary cyber-physical systems involving researchers in clinical science, data mining, and nursing staff.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130883322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Medical Survival Analysis Through Transduction of Semi-Supervised Regression Targets","authors":"F. Khan, Qiuhua Liu","doi":"10.4018/jkdb.2011070104","DOIUrl":"https://doi.org/10.4018/jkdb.2011070104","url":null,"abstract":"A crucial challenge in predictive modeling for survival analysis applications such as medical prognosis is the accounting of censored observations in the data. While these time-to-event predictions inherently represent a regression problem, traditional regression approaches are challenged by the censored characteristics of the data. In such problems the true target times of a majority of instances are unknown; what is known is a censored target representing some indeterminate time before the true target time. While censored samples can be considered as semi-supervised targets, the current limited efforts in semi-supervised regression do not take into account the partial nature of unsupervised information; samples are treated as either fully labeled or unlabelled. This paper presents a novel semi-supervised learning approach where the true target times are approximated from the censored times through transduction. The method can be employed to transform traditional regression methods for survival analysis, or can be employed to enhance existing state-of-the-art survival analysis methods for improved predictive performance. The proposed approach represents one of the first applications of semi-supervised regression to survival analysis and yields a significant improvement in performance over the state-of-the-art in prostate and breast cancer prognosis applications.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125302435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Association Rule Mining Based HotSpot Analysis on SEER Lung Cancer Data","authors":"Ankit Agrawal, A. Choudhary","doi":"10.4018/jkdb.2011040103","DOIUrl":"https://doi.org/10.4018/jkdb.2011040103","url":null,"abstract":"The authors analyze the lung cancer data available from the SEER program with the aim of identifying hotspots using association rule mining techniques. A subset of 13 patient attributes from the SEER data were recently linked with the survival outcome using prediction models, which is used in this study for segmentation. The goal here is to identify characteristics of patient segments where average survival is significantly higher/lower than average survival across the entire dataset. Automated association rule mining techniques resulted in hundreds of rules, from which many redundant rules were manually removed based on domain knowledge. Further, association rule mining based hotspot analysis was also conducted for conditional survival patient data, i.e., in cases where patients have already survived for a year after diagnosis. The resulting rules conform with existing biomedical knowledge and provide interesting insights into lung cancer survival.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116481907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}