{"title":"Using Weka API for creating a custom classification application","authors":"R. Robu, Paul Arseni-Ailoi, D. Ungureanu-Anghel","doi":"10.1109/SACI58269.2023.10158580","DOIUrl":null,"url":null,"abstract":"In real world applications, classification models can be built by repeatedly going through the steps of data preprocessing, building classification models with different dedicated algorithms, testing the resulted models, until building a good enough classification model is achieved. The model built in this way can be used in order to perform predictions on new data, with a degree of trust resulted by testing the model. All the operations described above can be done with Weka, which is a very powerful machine learning tool. The authors consider that after going through the previous steps, for many real world applications, it would be beneficial to go through a supplementary step, of developing a custom application for data classification, specific to the realized study. The proposed application will use the Weka API, which will allow it to rebuild the selected classification model, save and load it in/from a binary file and then make predictions with the help of a user interface perfectly adapted to the data set used. The proposed application makes it easier to make predictions for a final beneficiary. The paper presents how such a custom classification application can be developed, using the Weka API and Java programming language. In order to validate the proposed solution, the authors have built a custom application for the classification of data of patients who may have heart diseases, starting from the Cleveland heart disease data set, obtained from the UCI Machine Learning Repository. The proposed application allows saving and loading the classification model through serialization and deserialization, so that building it in order to make predictions is necessary only once.","PeriodicalId":339156,"journal":{"name":"2023 IEEE 17th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 17th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI58269.2023.10158580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In real world applications, classification models can be built by repeatedly going through the steps of data preprocessing, building classification models with different dedicated algorithms, testing the resulted models, until building a good enough classification model is achieved. The model built in this way can be used in order to perform predictions on new data, with a degree of trust resulted by testing the model. All the operations described above can be done with Weka, which is a very powerful machine learning tool. The authors consider that after going through the previous steps, for many real world applications, it would be beneficial to go through a supplementary step, of developing a custom application for data classification, specific to the realized study. The proposed application will use the Weka API, which will allow it to rebuild the selected classification model, save and load it in/from a binary file and then make predictions with the help of a user interface perfectly adapted to the data set used. The proposed application makes it easier to make predictions for a final beneficiary. The paper presents how such a custom classification application can be developed, using the Weka API and Java programming language. In order to validate the proposed solution, the authors have built a custom application for the classification of data of patients who may have heart diseases, starting from the Cleveland heart disease data set, obtained from the UCI Machine Learning Repository. The proposed application allows saving and loading the classification model through serialization and deserialization, so that building it in order to make predictions is necessary only once.