{"title":"用机器学习方法分析航空公司乘客推文的情绪","authors":"Shengyang Wu, Yi Gao","doi":"10.1177/03611981231172948","DOIUrl":null,"url":null,"abstract":"As one of the most extensive social networking services, Twitter has more than 300 million active users as of 2022. Among its many functions, Twitter is now one of the go-to platforms for consumers to share their opinions about products or experiences, including flight services provided by commercial airlines. Using a machine learning approach, this study aimed to measure customer satisfaction by analyzing sentiments of tweets that mention airlines. Relevant tweets were retrieved from Twitter’s application programming interface and processed through tokenization and vectorization. After that, these processed vectors were passed into a pretrained machine learning classifier to predict the sentiments. In addition to sentiment analysis, we also performed a lexical analysis on the collected tweets to model keyword frequencies, which provided meaningful context to facilitate interpretation of the sentiments. We then applied time series methods such as Bollinger Bands to detect abnormalities in the sentiment data. Using historical records from January to July 2022, our approach was proven capable of capturing sudden and significant changes in passenger sentiments through the analysis of breakout points on the Bollinger upper and lower bounds. The methodology devised for this study has the potential to be developed into an application that could help airlines, along with other customer-facing businesses, efficiently detect abrupt changes in customer sentiments and consequently take appropriate mitigatory measures.","PeriodicalId":23279,"journal":{"name":"Transportation Research Record","volume":"10 1","pages":"0"},"PeriodicalIF":1.6000,"publicationDate":"2023-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Approach to Analyze the Sentiment of Airline Passengers’ Tweets\",\"authors\":\"Shengyang Wu, Yi Gao\",\"doi\":\"10.1177/03611981231172948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As one of the most extensive social networking services, Twitter has more than 300 million active users as of 2022. Among its many functions, Twitter is now one of the go-to platforms for consumers to share their opinions about products or experiences, including flight services provided by commercial airlines. Using a machine learning approach, this study aimed to measure customer satisfaction by analyzing sentiments of tweets that mention airlines. Relevant tweets were retrieved from Twitter’s application programming interface and processed through tokenization and vectorization. After that, these processed vectors were passed into a pretrained machine learning classifier to predict the sentiments. In addition to sentiment analysis, we also performed a lexical analysis on the collected tweets to model keyword frequencies, which provided meaningful context to facilitate interpretation of the sentiments. We then applied time series methods such as Bollinger Bands to detect abnormalities in the sentiment data. Using historical records from January to July 2022, our approach was proven capable of capturing sudden and significant changes in passenger sentiments through the analysis of breakout points on the Bollinger upper and lower bounds. The methodology devised for this study has the potential to be developed into an application that could help airlines, along with other customer-facing businesses, efficiently detect abrupt changes in customer sentiments and consequently take appropriate mitigatory measures.\",\"PeriodicalId\":23279,\"journal\":{\"name\":\"Transportation Research Record\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2023-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Record\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/03611981231172948\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Record","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/03611981231172948","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Machine Learning Approach to Analyze the Sentiment of Airline Passengers’ Tweets
As one of the most extensive social networking services, Twitter has more than 300 million active users as of 2022. Among its many functions, Twitter is now one of the go-to platforms for consumers to share their opinions about products or experiences, including flight services provided by commercial airlines. Using a machine learning approach, this study aimed to measure customer satisfaction by analyzing sentiments of tweets that mention airlines. Relevant tweets were retrieved from Twitter’s application programming interface and processed through tokenization and vectorization. After that, these processed vectors were passed into a pretrained machine learning classifier to predict the sentiments. In addition to sentiment analysis, we also performed a lexical analysis on the collected tweets to model keyword frequencies, which provided meaningful context to facilitate interpretation of the sentiments. We then applied time series methods such as Bollinger Bands to detect abnormalities in the sentiment data. Using historical records from January to July 2022, our approach was proven capable of capturing sudden and significant changes in passenger sentiments through the analysis of breakout points on the Bollinger upper and lower bounds. The methodology devised for this study has the potential to be developed into an application that could help airlines, along with other customer-facing businesses, efficiently detect abrupt changes in customer sentiments and consequently take appropriate mitigatory measures.
期刊介绍:
Transportation Research Record: Journal of the Transportation Research Board is one of the most cited and prolific transportation journals in the world, offering unparalleled depth and breadth in the coverage of transportation-related topics. The TRR publishes approximately 70 issues annually of outstanding, peer-reviewed papers presenting research findings in policy, planning, administration, economics and financing, operations, construction, design, maintenance, safety, and more, for all modes of transportation. This site provides electronic access to a full compilation of papers since the 1996 series.