{"title":"Stock market analysis from Twitter and news based on streaming big data infrastructure","authors":"C. Lee, Incheon Paik","doi":"10.1109/ICAWST.2017.8256469","DOIUrl":null,"url":null,"abstract":"Due to the rapid development of the web, services of social media and Internet of Things (IoT) are producing a huge volume of data in every second. This data is not only large, but also grows quickly and is difficult to analyze. Most of traditional big data framework can't process such data in real-time. For processing the data in real-time, many companies and researchers have started to develop new big data frameworks. The Apache Spark, Apache Flink and Apache Storm have been introduced for real-time data processing. With the new processing frameworks, it has become more efficient to analyze the streaming data. Stock market analysis is a hot issued domain to analyze the big streaming data. In this paper, we build a real-time processing system to analyze tweets for finding correlation with the stock market. System configuration, performance of our system is explained. With 77% accuracy of Twitter data classification, we got 80% of separation of increase/decrease of stock value.","PeriodicalId":378618,"journal":{"name":"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAWST.2017.8256469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Due to the rapid development of the web, services of social media and Internet of Things (IoT) are producing a huge volume of data in every second. This data is not only large, but also grows quickly and is difficult to analyze. Most of traditional big data framework can't process such data in real-time. For processing the data in real-time, many companies and researchers have started to develop new big data frameworks. The Apache Spark, Apache Flink and Apache Storm have been introduced for real-time data processing. With the new processing frameworks, it has become more efficient to analyze the streaming data. Stock market analysis is a hot issued domain to analyze the big streaming data. In this paper, we build a real-time processing system to analyze tweets for finding correlation with the stock market. System configuration, performance of our system is explained. With 77% accuracy of Twitter data classification, we got 80% of separation of increase/decrease of stock value.