{"title":"Research on Pre-release Prediction Model of Chinese High Quality Films","authors":"Yong Huang, Haoyu Wang, Liangliang Zhao, Feng Wang, Weijing Huang, Jinjiang Yan","doi":"10.1109/IHMSC52134.2021.00021","DOIUrl":null,"url":null,"abstract":"By analyzing the big data of Chinese-language films, we can predict high-quality films that are both “good and popular” before they are released. Firstly, collect data of 1876 Chinese-language films released in 2009-2017, and then select indicators that affect box office and score from four aspects: film people, plot, release time and synthesize, propose a prerelease prediction model. Finally, the model is trained by using 6 methods such as Random Forest to predict the “high quality or not” of the films. The results show that the best prediction performance comes from Random Forest, with an accuracy rate of 78.29% and an AUC value of 0.831. The model can provide decision-making reference for investors to invest in high quality films and bring more high quality films to the market.","PeriodicalId":380011,"journal":{"name":"2021 13th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IHMSC52134.2021.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
By analyzing the big data of Chinese-language films, we can predict high-quality films that are both “good and popular” before they are released. Firstly, collect data of 1876 Chinese-language films released in 2009-2017, and then select indicators that affect box office and score from four aspects: film people, plot, release time and synthesize, propose a prerelease prediction model. Finally, the model is trained by using 6 methods such as Random Forest to predict the “high quality or not” of the films. The results show that the best prediction performance comes from Random Forest, with an accuracy rate of 78.29% and an AUC value of 0.831. The model can provide decision-making reference for investors to invest in high quality films and bring more high quality films to the market.