Ahmed Banimustafa, S. Almatarneh, Olla Bulkrock, G. Samara, Mohammad Aljaidi
{"title":"预测众筹成功的数据科学方法","authors":"Ahmed Banimustafa, S. Almatarneh, Olla Bulkrock, G. Samara, Mohammad Aljaidi","doi":"10.1109/EICEEAI56378.2022.10050465","DOIUrl":null,"url":null,"abstract":"Crowdfunding is important for backing innovative projects and new startup businesses. However, success in achieving the target fundraising is a big challenge, and it depends on many complex factors. This work uses data science to predict the success of crowdfunding pledges using a historical dataset that was scrapped from the Kickstarter website. The dataset was subject to intensive data wrangling, exploration, and engineering procedures. Three machine learning models were constructed in this study using: (1) Random Forests (RF), (3) K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) algorithms. The models were trained using a separate portion representing two-thirds of the dataset, while the remaining third was used for evaluation. The KNN model achieved the best performance with a classification accuracy of 97.9% and an AUC of 98.3%. Random Forests was the second-best model, with a classification accuracy of 94.9% and an AUC of 98.9%. The Precision, Recall, F1, and AUC metrics also confirmed the validity of the reported results, while the confusion matrix and the calibration curve confirmed the robustness of the constructed models.","PeriodicalId":426838,"journal":{"name":"2022 International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Data Science Approach for Predicting Crowdfunding Success\",\"authors\":\"Ahmed Banimustafa, S. Almatarneh, Olla Bulkrock, G. Samara, Mohammad Aljaidi\",\"doi\":\"10.1109/EICEEAI56378.2022.10050465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crowdfunding is important for backing innovative projects and new startup businesses. However, success in achieving the target fundraising is a big challenge, and it depends on many complex factors. This work uses data science to predict the success of crowdfunding pledges using a historical dataset that was scrapped from the Kickstarter website. The dataset was subject to intensive data wrangling, exploration, and engineering procedures. Three machine learning models were constructed in this study using: (1) Random Forests (RF), (3) K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) algorithms. The models were trained using a separate portion representing two-thirds of the dataset, while the remaining third was used for evaluation. The KNN model achieved the best performance with a classification accuracy of 97.9% and an AUC of 98.3%. Random Forests was the second-best model, with a classification accuracy of 94.9% and an AUC of 98.9%. The Precision, Recall, F1, and AUC metrics also confirmed the validity of the reported results, while the confusion matrix and the calibration curve confirmed the robustness of the constructed models.\",\"PeriodicalId\":426838,\"journal\":{\"name\":\"2022 International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI)\",\"volume\":\"177 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EICEEAI56378.2022.10050465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EICEEAI56378.2022.10050465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Data Science Approach for Predicting Crowdfunding Success
Crowdfunding is important for backing innovative projects and new startup businesses. However, success in achieving the target fundraising is a big challenge, and it depends on many complex factors. This work uses data science to predict the success of crowdfunding pledges using a historical dataset that was scrapped from the Kickstarter website. The dataset was subject to intensive data wrangling, exploration, and engineering procedures. Three machine learning models were constructed in this study using: (1) Random Forests (RF), (3) K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) algorithms. The models were trained using a separate portion representing two-thirds of the dataset, while the remaining third was used for evaluation. The KNN model achieved the best performance with a classification accuracy of 97.9% and an AUC of 98.3%. Random Forests was the second-best model, with a classification accuracy of 94.9% and an AUC of 98.9%. The Precision, Recall, F1, and AUC metrics also confirmed the validity of the reported results, while the confusion matrix and the calibration curve confirmed the robustness of the constructed models.