S. M. Basha, D. Rajput, N. Iyengar, Ronnie D. Caytiles
{"title":"A Novel Approach to Perform Analysis and Prediction on Breast Cancer Dataset using R","authors":"S. M. Basha, D. Rajput, N. Iyengar, Ronnie D. Caytiles","doi":"10.14257/IJGDC.2018.11.2.05","DOIUrl":null,"url":null,"abstract":"Screening shows impact on cancer mortality rate by decreasing the number of advanced cancers with poor diagnosis, while cancer treatment works through decreasing the case-fatality rate. The prediction of breast cancer survivability has been a challenging research problem for many researchers. The objective of this research work is to propose a Novel model that can analysis the Breast cancer data and do efficient prediction. The contributions made in this paper are as follows, we collected three different the dataset from UCI Machine Learning repositories. We propose an approach, where a detailed comparison made between feature selection algorithms. Trained the datasets using Decision Tree, Random Forest and Support vector machine (SVM) machine learning algorithms. An attempt made to understand the impact of model selection metric in predicting different classes of Brest cancer. The results indicated that the Random forest is the best predictor wit 0.98 accuracy on the holdout sample, SVM came out to be the second with 0.97 accuracy and the Decision Tree came out with 0.96 to be the worst of the four condition tree with 0.95 accuracy. Finally performed prediction using Neural Network with three hidden layers and measured the efficiency, using Root Mean Square Error (RMSE) along with its variations.","PeriodicalId":46000,"journal":{"name":"International Journal of Grid and Distributed Computing","volume":"11 1","pages":"41-54"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Grid and Distributed Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14257/IJGDC.2018.11.2.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
Screening shows impact on cancer mortality rate by decreasing the number of advanced cancers with poor diagnosis, while cancer treatment works through decreasing the case-fatality rate. The prediction of breast cancer survivability has been a challenging research problem for many researchers. The objective of this research work is to propose a Novel model that can analysis the Breast cancer data and do efficient prediction. The contributions made in this paper are as follows, we collected three different the dataset from UCI Machine Learning repositories. We propose an approach, where a detailed comparison made between feature selection algorithms. Trained the datasets using Decision Tree, Random Forest and Support vector machine (SVM) machine learning algorithms. An attempt made to understand the impact of model selection metric in predicting different classes of Brest cancer. The results indicated that the Random forest is the best predictor wit 0.98 accuracy on the holdout sample, SVM came out to be the second with 0.97 accuracy and the Decision Tree came out with 0.96 to be the worst of the four condition tree with 0.95 accuracy. Finally performed prediction using Neural Network with three hidden layers and measured the efficiency, using Root Mean Square Error (RMSE) along with its variations.
期刊介绍:
IJGDC aims to facilitate and support research related to control and automation technology and its applications. Our Journal provides a chance for academic and industry professionals to discuss recent progress in the area of control and automation. To bridge the gap of users who do not have access to major databases where one should pay for every downloaded article; this online publication platform is open to all readers as part of our commitment to global scientific society. Journal Topics: -Architectures and Fabrics -Autonomic and Adaptive Systems -Cluster and Grid Integration -Creation and Management of Virtual Enterprises and Organizations -Dependable and Survivable Distributed Systems -Distributed and Large-Scale Data Access and Management -Distributed Multimedia Systems -Distributed Trust Management -eScience and eBusiness Applications -Fuzzy Algorithm -Grid Economy and Business Models -Histogram Methodology -Image or Speech Filtering -Image or Speech Recognition -Information Services -Large-Scale Group Communication -Metadata, Ontologies, and Provenance -Middleware and Toolkits -Monitoring, Management and Organization Tools -Networking and Security -Novel Distributed Applications -Performance Measurement and Modeling -Pervasive Computing -Problem Solving Environments -Programming Models, Tools and Environments -QoS and resource management -Real-time and Embedded Systems -Security and Trust in Grid and Distributed Systems -Sensor Networks -Utility Computing on Global Grids -Web Services and Service-Oriented Architecture -Wireless and Mobile Ad Hoc Networks -Workflow and Multi-agent Systems