{"title":"使用Selenium支持向量机和非参数LDA对Facebook作为流量更新的众包平台进行探索和分析","authors":"Leodivino Lawas, Ken Gorro, Elmo Ranolo, A. Ilano","doi":"10.1145/3512576.3512617","DOIUrl":null,"url":null,"abstract":"Traffic is a major problem in the Philippines. Facebook is one of the social media platforms that is commonly used by Filipinos. Machine learning is a field of computer science that allows computers to perform tasks like human beings. In this study, the proponents explored Facebook as a source of traffic updates and as a source of traffic information. In this paper, as a partial result, a machine learning model was created to classify Facebook posts as related to traffic. To gather Facebook posts, a total of 1000 respondents were asked for consent to scrape their public post using the username link and selenium. The Support vector machine model was trained with 3000 Facebook posts. The SVM model was only trained to 3 classes {Road accident, Road activities and Other}. The SVM model was evaluated using 10-cross fold validation. The result shows that the accuracy is 76% and the recall is 69%. To analyze the narrative of the corpus, the Hierarchical Dirichlet Process model was created with the log-likelihood of -4.06 with 10 topic models. The following are the narratives of the corpus: {Traffic Management, Immediate Emergency Response, Seeking help, Busses causes majority of accidents.}","PeriodicalId":278114,"journal":{"name":"Proceedings of the 2021 9th International Conference on Information Technology: IoT and Smart City","volume":"211 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring and Analyzing Facebook as crowdsourcing platform for traffic updates using Selenium Support Vector Machine and Non-parametric LDA\",\"authors\":\"Leodivino Lawas, Ken Gorro, Elmo Ranolo, A. Ilano\",\"doi\":\"10.1145/3512576.3512617\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traffic is a major problem in the Philippines. Facebook is one of the social media platforms that is commonly used by Filipinos. Machine learning is a field of computer science that allows computers to perform tasks like human beings. In this study, the proponents explored Facebook as a source of traffic updates and as a source of traffic information. In this paper, as a partial result, a machine learning model was created to classify Facebook posts as related to traffic. To gather Facebook posts, a total of 1000 respondents were asked for consent to scrape their public post using the username link and selenium. The Support vector machine model was trained with 3000 Facebook posts. The SVM model was only trained to 3 classes {Road accident, Road activities and Other}. The SVM model was evaluated using 10-cross fold validation. The result shows that the accuracy is 76% and the recall is 69%. To analyze the narrative of the corpus, the Hierarchical Dirichlet Process model was created with the log-likelihood of -4.06 with 10 topic models. The following are the narratives of the corpus: {Traffic Management, Immediate Emergency Response, Seeking help, Busses causes majority of accidents.}\",\"PeriodicalId\":278114,\"journal\":{\"name\":\"Proceedings of the 2021 9th International Conference on Information Technology: IoT and Smart City\",\"volume\":\"211 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 9th International Conference on Information Technology: IoT and Smart City\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3512576.3512617\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 9th International Conference on Information Technology: IoT and Smart City","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3512576.3512617","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring and Analyzing Facebook as crowdsourcing platform for traffic updates using Selenium Support Vector Machine and Non-parametric LDA
Traffic is a major problem in the Philippines. Facebook is one of the social media platforms that is commonly used by Filipinos. Machine learning is a field of computer science that allows computers to perform tasks like human beings. In this study, the proponents explored Facebook as a source of traffic updates and as a source of traffic information. In this paper, as a partial result, a machine learning model was created to classify Facebook posts as related to traffic. To gather Facebook posts, a total of 1000 respondents were asked for consent to scrape their public post using the username link and selenium. The Support vector machine model was trained with 3000 Facebook posts. The SVM model was only trained to 3 classes {Road accident, Road activities and Other}. The SVM model was evaluated using 10-cross fold validation. The result shows that the accuracy is 76% and the recall is 69%. To analyze the narrative of the corpus, the Hierarchical Dirichlet Process model was created with the log-likelihood of -4.06 with 10 topic models. The following are the narratives of the corpus: {Traffic Management, Immediate Emergency Response, Seeking help, Busses causes majority of accidents.}