Shruti Daggumati, Igor Soares, Jieting Wu, D. Cao, Hongfeng Yu, Jun Wang
{"title":"Tweether:一个可视化工具,显示天气与tweet的相关性","authors":"Shruti Daggumati, Igor Soares, Jieting Wu, D. Cao, Hongfeng Yu, Jun Wang","doi":"10.2352/ISSN.2470-1173.2016.1.VDA-497","DOIUrl":null,"url":null,"abstract":"As the generation of social media, we can instantly express how our day is going; however, unknowingly the weather can play a key role in how we are feeling. The weather may dictate our lives regardless of what may be happening. The relationship between weather and mood has been immensely studied to show that the weather does play a major factor regarding our emotions. However, how we visualize the relationship and influence between weather and human emotions remains an interesting question. Based on the natural correlation between weather and mood, we propose Tweether, a real-time weather and tweet visualization tool, to see how Twitter users feel regarding the weather they experience. Our visualization displays a current reflection of emotions in a set of select geographic regions and also predicts possible emotions in these regions in response to the weather forecast. The visualization uses multiple layers to show the connection between geolocations, weather, and emotions. By aggregating multiple users with emotions, we create an aesthetic design in a 3D manner that is relatively free of visual clutter and it is simple to understand the relationships between weather and emotions. Introduction Weather affects our daily lives, from what we wear, what activities we do, what type of transportation we use, what we eat, or even how we feel. With the increasing accuracy of weather forecasts, people can gain an idea on the type of weather they can expect for upcoming days. Activities are usually planned according to the weather outside (e.g., weddings) and alternative plans must be made in case of inclement weather. How people dress is also affected by weather; when the temperature drops people need to wear coats to stay warm. The economy is also greatly affected by the weather. Certain weather conditions can lower crop yield and cause higher prices in stores. Disastrous weather phenomena such as hurricanes, tornadoes, or even floods can cause devastation in communities resulting in homelessness, death, and destruction. Inclement weather can also cause delays in transportation on roads or via flights. We can also choose to ride our bike to work instead of driving the car if the temperature is warm enough. One thing that is an effect of all these items is how we feel: • Are you sad that you cannot enjoy the outdoors due to rain? • Do you love that it’s raining so you can bundle up and read your favorite book? • Do you love the snow because it’s close to Christmas? • Do you hate the winter because you want it to be spring? These feelings are all brought out by the weather outside. One person can feel positive about a certain type of weather and one person can feel negative. Categorizing similar feelings from a large number of people may reveal some useful patterns that can help decision makers or shareholders make more appropriate decisions (e.g., carnival or sport game arrangements) with respect to weather conditions. Social media researchers can also gain benefits from these patterns by getting insights into how population behaviors may change with weather variation. However, rare visualization research exits to show the correlation between human sentiments and weather conditions. Furthermore, it remains a challenging task to depict massive and complex relationships among many objects. In this work, we present a novel tool, named Tweether, a visualization of real-time Twitter and weather data to show the feelings of current users and how their emotions could fluctuate regarding the weather. We also develop a prediction model that predicts the emotions with respect to the weather forecast. Our work showcases a 3D map which highlights select clusters of weather. The correlation of tweets to weather is represented by a graph. We use texture-based edge bundling to visualize the graph for reducing visual clutter. Introducing a clear relationship between weather and tweets, the design presents a natural manner of representing correlation and revealing high-level patterns. By using Tweether, the end-users, such as decision makers and social media researchers, can explore if the classified weather has any correlation to the majority of the population. Tweether can be used to examine if the majority of the population follows certain patterns, and how the overall sentiment changes in regards to weather. In addition, the prediction model predicts the future feelings of given weather so that public preferences could be foreseen for certain decision making. Related Work Visualizations correlating sentiment and weather are highly sparse, and the presence of live visualizations is also non-existent. However, there are studies showing the two portions of this work. Clustering of weather data has been done many times in the past. There are also a vast number of visualizations which indicate the sentiment of different locations. Psychological Studies The natural correlation of weather with emotion has been studied profusely [5, 11, 15, 17]. With various factors among the different research, the conclusions attained were broad. In general, humidity, sunshine, and temperature have the greatest effect on mood [11]. In some research on weather and mood, correlations have been debunked. There is no consistency due to seasons and time spent outside [5]. Having certain emotions regarding the season has strong links to seasonal affective disorder (SAD), where people are depressed in regards to changes of the seasons, which usually occurs during the winter. However, most psychologists believe that the weather has an impact on psychological intentions [15]. When observing serotonin levels in regards to sunshine there were strong relationships to being happy [17]. It has been found that weather may not play a big role in the positive attitude, but the negative attitude can have a correlation with weather [11]. Social Media, Weather, and Emotions Few works present the use of Twitter data for their social media feed and some form of weather data. In comparison to our work, the following research used past data and developed a 2D graph implementation for visualization. Work has been done using two to four years of Twitter data and correlating it with meteorological data from NOAA [8, 18]. Using urban areas in the United States as the area of interest, the tweets are passed to a sentiment analyzer that has a multilevel process. The researchers first determine keywords which are identified from public events (e.g., entertainment or natural disasters), then identify the mood state, and finally assign sentiment scores. To correlate the weather with the tweets they use a Generalized Mixed Model to display the non-linear relationship between emotion and weather. Using multiple variables for weather (temperature, temperature change, precipitation, snow depth, wind speed, solar energy, and hail), they determine the connection to hostility-anger, depression-dejection, fatigue-inertia, and sleepiness-freshness. Their results indicate that the warmer temperatures create an angrier atmosphere, lower depression, and less sleepiness, and they determine the influence of temperature to mood is trivial. Their visualization is limited to graphs [23]. Other than using urban areas in USA the researchers see relationships between temperature, humidity, and atmospheric pressure for tweets in the United States and weather data from Weather Underground. Using Linguistic Inquiry and Word Count for sentiment classification they see a pattern with temperature and emotion of every state in the United States. Using regression analysis, they find that the warmer states have a happier mood than the colder states. Their visualization is limited to a bar chart. These works are limited in visualization and usability study. Using past data is useful for our training and testing model; however, having a live view of what Twitter users feel is what we aim for in this work. Sentiment Analysis in Social Media Sentiment analysis has been studied vastly. There are various methods to detect the sentiment of a sentence. However, in regards to tweets sentences may be incomplete, because a tweet is limited to 140 characters and the need to express oneself is limited to short meaningful phrases. We find abbreviations, neologisms, acronyms, hashtags, emotions, and URL’s throughout most tweets. Certain features need to be extracted and some need to be filtered out. Filtering of URL’s, usernames, and Twitter special words may be needed in certain scenarios [20]. Stop words (i.e., a, an, the, etc.) are also removed due to not adding any extra sentiment information. For classifying the tweets, a number of different methods are used, and the most prevalent one is the Naive Bayes classifier [20, 22]. Emotions are used as basis for sentiment classification for classifying tweets as positive or negative for the training purposes [20, 1]. Visualizations Clustering Although clustering of data has been extensively studied, it remains a non-trivial task to deal with temporal and time series data. Weather data needs to be clustered based on values, proximities, and changes throughout the given time span. There have been various visualization techniques for time series data. Visualizing time series data using spirals for large data sets can better identify periodic structures in data [27]. Using wavelet to transform data along a multi-resolution temporal representation to find clusters with similar trends is a useful method for exploring data in a time series fashion [28]. Applying smooth data histograms for visualizing clusters in self-organizing maps is a simple method for 2D data sets [21]. The mass majority of clustering visualizations uses k-means clustering on the basis of their algorithms [27, 28]. We choose to follow this pattern as well, as k-means is a widely used algorithm that gives reasonably good results [27, 13]. Bundling The correlation between multiple entities (e.g., various weather and","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"57 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tweether: A Visualization Tool Displaying Correlation of Weather to Tweets\",\"authors\":\"Shruti Daggumati, Igor Soares, Jieting Wu, D. Cao, Hongfeng Yu, Jun Wang\",\"doi\":\"10.2352/ISSN.2470-1173.2016.1.VDA-497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the generation of social media, we can instantly express how our day is going; however, unknowingly the weather can play a key role in how we are feeling. The weather may dictate our lives regardless of what may be happening. The relationship between weather and mood has been immensely studied to show that the weather does play a major factor regarding our emotions. However, how we visualize the relationship and influence between weather and human emotions remains an interesting question. Based on the natural correlation between weather and mood, we propose Tweether, a real-time weather and tweet visualization tool, to see how Twitter users feel regarding the weather they experience. Our visualization displays a current reflection of emotions in a set of select geographic regions and also predicts possible emotions in these regions in response to the weather forecast. The visualization uses multiple layers to show the connection between geolocations, weather, and emotions. By aggregating multiple users with emotions, we create an aesthetic design in a 3D manner that is relatively free of visual clutter and it is simple to understand the relationships between weather and emotions. Introduction Weather affects our daily lives, from what we wear, what activities we do, what type of transportation we use, what we eat, or even how we feel. With the increasing accuracy of weather forecasts, people can gain an idea on the type of weather they can expect for upcoming days. Activities are usually planned according to the weather outside (e.g., weddings) and alternative plans must be made in case of inclement weather. How people dress is also affected by weather; when the temperature drops people need to wear coats to stay warm. The economy is also greatly affected by the weather. Certain weather conditions can lower crop yield and cause higher prices in stores. Disastrous weather phenomena such as hurricanes, tornadoes, or even floods can cause devastation in communities resulting in homelessness, death, and destruction. Inclement weather can also cause delays in transportation on roads or via flights. We can also choose to ride our bike to work instead of driving the car if the temperature is warm enough. One thing that is an effect of all these items is how we feel: • Are you sad that you cannot enjoy the outdoors due to rain? • Do you love that it’s raining so you can bundle up and read your favorite book? • Do you love the snow because it’s close to Christmas? • Do you hate the winter because you want it to be spring? These feelings are all brought out by the weather outside. One person can feel positive about a certain type of weather and one person can feel negative. Categorizing similar feelings from a large number of people may reveal some useful patterns that can help decision makers or shareholders make more appropriate decisions (e.g., carnival or sport game arrangements) with respect to weather conditions. Social media researchers can also gain benefits from these patterns by getting insights into how population behaviors may change with weather variation. However, rare visualization research exits to show the correlation between human sentiments and weather conditions. Furthermore, it remains a challenging task to depict massive and complex relationships among many objects. In this work, we present a novel tool, named Tweether, a visualization of real-time Twitter and weather data to show the feelings of current users and how their emotions could fluctuate regarding the weather. We also develop a prediction model that predicts the emotions with respect to the weather forecast. Our work showcases a 3D map which highlights select clusters of weather. The correlation of tweets to weather is represented by a graph. We use texture-based edge bundling to visualize the graph for reducing visual clutter. Introducing a clear relationship between weather and tweets, the design presents a natural manner of representing correlation and revealing high-level patterns. By using Tweether, the end-users, such as decision makers and social media researchers, can explore if the classified weather has any correlation to the majority of the population. Tweether can be used to examine if the majority of the population follows certain patterns, and how the overall sentiment changes in regards to weather. In addition, the prediction model predicts the future feelings of given weather so that public preferences could be foreseen for certain decision making. Related Work Visualizations correlating sentiment and weather are highly sparse, and the presence of live visualizations is also non-existent. However, there are studies showing the two portions of this work. Clustering of weather data has been done many times in the past. There are also a vast number of visualizations which indicate the sentiment of different locations. Psychological Studies The natural correlation of weather with emotion has been studied profusely [5, 11, 15, 17]. With various factors among the different research, the conclusions attained were broad. In general, humidity, sunshine, and temperature have the greatest effect on mood [11]. In some research on weather and mood, correlations have been debunked. There is no consistency due to seasons and time spent outside [5]. Having certain emotions regarding the season has strong links to seasonal affective disorder (SAD), where people are depressed in regards to changes of the seasons, which usually occurs during the winter. However, most psychologists believe that the weather has an impact on psychological intentions [15]. When observing serotonin levels in regards to sunshine there were strong relationships to being happy [17]. It has been found that weather may not play a big role in the positive attitude, but the negative attitude can have a correlation with weather [11]. Social Media, Weather, and Emotions Few works present the use of Twitter data for their social media feed and some form of weather data. In comparison to our work, the following research used past data and developed a 2D graph implementation for visualization. Work has been done using two to four years of Twitter data and correlating it with meteorological data from NOAA [8, 18]. Using urban areas in the United States as the area of interest, the tweets are passed to a sentiment analyzer that has a multilevel process. The researchers first determine keywords which are identified from public events (e.g., entertainment or natural disasters), then identify the mood state, and finally assign sentiment scores. To correlate the weather with the tweets they use a Generalized Mixed Model to display the non-linear relationship between emotion and weather. Using multiple variables for weather (temperature, temperature change, precipitation, snow depth, wind speed, solar energy, and hail), they determine the connection to hostility-anger, depression-dejection, fatigue-inertia, and sleepiness-freshness. Their results indicate that the warmer temperatures create an angrier atmosphere, lower depression, and less sleepiness, and they determine the influence of temperature to mood is trivial. Their visualization is limited to graphs [23]. Other than using urban areas in USA the researchers see relationships between temperature, humidity, and atmospheric pressure for tweets in the United States and weather data from Weather Underground. Using Linguistic Inquiry and Word Count for sentiment classification they see a pattern with temperature and emotion of every state in the United States. Using regression analysis, they find that the warmer states have a happier mood than the colder states. Their visualization is limited to a bar chart. These works are limited in visualization and usability study. Using past data is useful for our training and testing model; however, having a live view of what Twitter users feel is what we aim for in this work. Sentiment Analysis in Social Media Sentiment analysis has been studied vastly. There are various methods to detect the sentiment of a sentence. However, in regards to tweets sentences may be incomplete, because a tweet is limited to 140 characters and the need to express oneself is limited to short meaningful phrases. We find abbreviations, neologisms, acronyms, hashtags, emotions, and URL’s throughout most tweets. Certain features need to be extracted and some need to be filtered out. Filtering of URL’s, usernames, and Twitter special words may be needed in certain scenarios [20]. Stop words (i.e., a, an, the, etc.) are also removed due to not adding any extra sentiment information. For classifying the tweets, a number of different methods are used, and the most prevalent one is the Naive Bayes classifier [20, 22]. Emotions are used as basis for sentiment classification for classifying tweets as positive or negative for the training purposes [20, 1]. Visualizations Clustering Although clustering of data has been extensively studied, it remains a non-trivial task to deal with temporal and time series data. Weather data needs to be clustered based on values, proximities, and changes throughout the given time span. There have been various visualization techniques for time series data. Visualizing time series data using spirals for large data sets can better identify periodic structures in data [27]. Using wavelet to transform data along a multi-resolution temporal representation to find clusters with similar trends is a useful method for exploring data in a time series fashion [28]. Applying smooth data histograms for visualizing clusters in self-organizing maps is a simple method for 2D data sets [21]. The mass majority of clustering visualizations uses k-means clustering on the basis of their algorithms [27, 28]. We choose to follow this pattern as well, as k-means is a widely used algorithm that gives reasonably good results [27, 13]. Bundling The correlation between multiple entities (e.g., various weather and\",\"PeriodicalId\":89305,\"journal\":{\"name\":\"Visualization and data analysis\",\"volume\":\"57 1\",\"pages\":\"1-10\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visualization and data analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2352/ISSN.2470-1173.2016.1.VDA-497\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2352/ISSN.2470-1173.2016.1.VDA-497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
人们对天气与情绪之间的自然关联进行了大量研究[5,11,15,17]。由于不同的研究中存在不同的因素,得出的结论是广泛的。一般来说,湿度、阳光和温度对情绪的影响最大。在一些关于天气和情绪的研究中,相关性被揭穿了。由于季节和在户外度过的时间,没有一致性。对季节有特定的情绪与季节性情感障碍(SAD)有很强的联系,人们对季节的变化感到沮丧,通常发生在冬天。然而,大多数心理学家认为天气对心理意向有影响。当观察与阳光有关的血清素水平时,发现与快乐有很强的关系。研究发现,天气对积极态度的影响可能不大,但消极态度可能与天气有关。社交媒体、天气和情绪很少有作品将Twitter数据用于其社交媒体feed和某种形式的天气数据。与我们的工作相比,下面的研究使用了过去的数据,并开发了一个2D图形实现可视化。这项工作使用了两到四年的Twitter数据,并将其与NOAA的气象数据相关联[8,18]。使用美国的城市地区作为感兴趣的区域,tweet被传递给具有多层过程的情感分析器。研究人员首先确定从公共事件(例如,娱乐或自然灾害)中识别的关键词,然后确定情绪状态,最后分配情绪分数。为了将天气和推文联系起来,他们使用了一个广义混合模型来显示情绪和天气之间的非线性关系。使用天气的多个变量(温度、温度变化、降水、雪深、风速、太阳能和冰雹),他们确定了敌意-愤怒、抑郁-沮丧、疲劳-惯性和嗜睡-新鲜感之间的联系。他们的研究结果表明,温度越高,气氛越愤怒,抑郁情绪越低,困倦程度越低,他们认为温度对情绪的影响是微不足道的。它们的可视化仅限于图形b[23]。除了使用美国的城市地区,研究人员还看到了美国推特上的温度、湿度和大气压与地下天气数据之间的关系。使用语言调查和单词计数进行情绪分类,他们看到了美国每个州的温度和情绪模式。通过回归分析,他们发现温暖的国家比寒冷的国家有更快乐的情绪。他们的可视化仅限于条形图。这些工作在可视化和可用性研究方面是有限的。使用过去的数据对我们的训练和测试模型是有用的;然而,实时了解Twitter用户的感受是我们在这项工作中的目标。情感分析在社交媒体中的应用已经得到了广泛的研究。检测句子情感的方法有很多种。然而,对于tweets来说,句子可能是不完整的,因为tweet的长度限制在140个字符,表达自己的需要也仅限于有意义的短语。我们在大多数推文中发现了缩写、新词、首字母缩略词、标签、情感和URL。有些特征需要提取,有些特征需要过滤掉。在某些情况下,可能需要过滤URL、用户名和Twitter特殊单词[20]。停止词(即,a, an, the等)也被删除,因为没有添加任何额外的情感信息。对于推文的分类,使用了许多不同的方法,最流行的是朴素贝叶斯分类器[20,22]。情感是情感分类的基础,用于将tweet分类为积极或消极,以达到训练目的[20,1]。虽然数据的聚类已经得到了广泛的研究,但处理时间序列和时间序列数据仍然是一项非常重要的任务。天气数据需要根据给定时间范围内的值、接近度和变化进行聚类。对于时间序列数据已经有了各种各样的可视化技术。对大数据集使用螺旋来可视化时间序列数据可以更好地识别数据[27]中的周期性结构。使用小波变换数据沿着多分辨率时间表示来寻找具有相似趋势的聚类是一种探索时间序列时尚[28]数据的有用方法。应用平滑数据直方图对自组织映射中的聚类进行可视化是一种简单的二维数据集可视化方法。绝大多数聚类可视化在其算法的基础上使用k-means聚类[27,28]。 我们也选择遵循这种模式,因为k-means是一种广泛使用的算法,可以给出相当好的结果[27,13]。多个实体(例如,各种天气和天气)之间的关联
Tweether: A Visualization Tool Displaying Correlation of Weather to Tweets
As the generation of social media, we can instantly express how our day is going; however, unknowingly the weather can play a key role in how we are feeling. The weather may dictate our lives regardless of what may be happening. The relationship between weather and mood has been immensely studied to show that the weather does play a major factor regarding our emotions. However, how we visualize the relationship and influence between weather and human emotions remains an interesting question. Based on the natural correlation between weather and mood, we propose Tweether, a real-time weather and tweet visualization tool, to see how Twitter users feel regarding the weather they experience. Our visualization displays a current reflection of emotions in a set of select geographic regions and also predicts possible emotions in these regions in response to the weather forecast. The visualization uses multiple layers to show the connection between geolocations, weather, and emotions. By aggregating multiple users with emotions, we create an aesthetic design in a 3D manner that is relatively free of visual clutter and it is simple to understand the relationships between weather and emotions. Introduction Weather affects our daily lives, from what we wear, what activities we do, what type of transportation we use, what we eat, or even how we feel. With the increasing accuracy of weather forecasts, people can gain an idea on the type of weather they can expect for upcoming days. Activities are usually planned according to the weather outside (e.g., weddings) and alternative plans must be made in case of inclement weather. How people dress is also affected by weather; when the temperature drops people need to wear coats to stay warm. The economy is also greatly affected by the weather. Certain weather conditions can lower crop yield and cause higher prices in stores. Disastrous weather phenomena such as hurricanes, tornadoes, or even floods can cause devastation in communities resulting in homelessness, death, and destruction. Inclement weather can also cause delays in transportation on roads or via flights. We can also choose to ride our bike to work instead of driving the car if the temperature is warm enough. One thing that is an effect of all these items is how we feel: • Are you sad that you cannot enjoy the outdoors due to rain? • Do you love that it’s raining so you can bundle up and read your favorite book? • Do you love the snow because it’s close to Christmas? • Do you hate the winter because you want it to be spring? These feelings are all brought out by the weather outside. One person can feel positive about a certain type of weather and one person can feel negative. Categorizing similar feelings from a large number of people may reveal some useful patterns that can help decision makers or shareholders make more appropriate decisions (e.g., carnival or sport game arrangements) with respect to weather conditions. Social media researchers can also gain benefits from these patterns by getting insights into how population behaviors may change with weather variation. However, rare visualization research exits to show the correlation between human sentiments and weather conditions. Furthermore, it remains a challenging task to depict massive and complex relationships among many objects. In this work, we present a novel tool, named Tweether, a visualization of real-time Twitter and weather data to show the feelings of current users and how their emotions could fluctuate regarding the weather. We also develop a prediction model that predicts the emotions with respect to the weather forecast. Our work showcases a 3D map which highlights select clusters of weather. The correlation of tweets to weather is represented by a graph. We use texture-based edge bundling to visualize the graph for reducing visual clutter. Introducing a clear relationship between weather and tweets, the design presents a natural manner of representing correlation and revealing high-level patterns. By using Tweether, the end-users, such as decision makers and social media researchers, can explore if the classified weather has any correlation to the majority of the population. Tweether can be used to examine if the majority of the population follows certain patterns, and how the overall sentiment changes in regards to weather. In addition, the prediction model predicts the future feelings of given weather so that public preferences could be foreseen for certain decision making. Related Work Visualizations correlating sentiment and weather are highly sparse, and the presence of live visualizations is also non-existent. However, there are studies showing the two portions of this work. Clustering of weather data has been done many times in the past. There are also a vast number of visualizations which indicate the sentiment of different locations. Psychological Studies The natural correlation of weather with emotion has been studied profusely [5, 11, 15, 17]. With various factors among the different research, the conclusions attained were broad. In general, humidity, sunshine, and temperature have the greatest effect on mood [11]. In some research on weather and mood, correlations have been debunked. There is no consistency due to seasons and time spent outside [5]. Having certain emotions regarding the season has strong links to seasonal affective disorder (SAD), where people are depressed in regards to changes of the seasons, which usually occurs during the winter. However, most psychologists believe that the weather has an impact on psychological intentions [15]. When observing serotonin levels in regards to sunshine there were strong relationships to being happy [17]. It has been found that weather may not play a big role in the positive attitude, but the negative attitude can have a correlation with weather [11]. Social Media, Weather, and Emotions Few works present the use of Twitter data for their social media feed and some form of weather data. In comparison to our work, the following research used past data and developed a 2D graph implementation for visualization. Work has been done using two to four years of Twitter data and correlating it with meteorological data from NOAA [8, 18]. Using urban areas in the United States as the area of interest, the tweets are passed to a sentiment analyzer that has a multilevel process. The researchers first determine keywords which are identified from public events (e.g., entertainment or natural disasters), then identify the mood state, and finally assign sentiment scores. To correlate the weather with the tweets they use a Generalized Mixed Model to display the non-linear relationship between emotion and weather. Using multiple variables for weather (temperature, temperature change, precipitation, snow depth, wind speed, solar energy, and hail), they determine the connection to hostility-anger, depression-dejection, fatigue-inertia, and sleepiness-freshness. Their results indicate that the warmer temperatures create an angrier atmosphere, lower depression, and less sleepiness, and they determine the influence of temperature to mood is trivial. Their visualization is limited to graphs [23]. Other than using urban areas in USA the researchers see relationships between temperature, humidity, and atmospheric pressure for tweets in the United States and weather data from Weather Underground. Using Linguistic Inquiry and Word Count for sentiment classification they see a pattern with temperature and emotion of every state in the United States. Using regression analysis, they find that the warmer states have a happier mood than the colder states. Their visualization is limited to a bar chart. These works are limited in visualization and usability study. Using past data is useful for our training and testing model; however, having a live view of what Twitter users feel is what we aim for in this work. Sentiment Analysis in Social Media Sentiment analysis has been studied vastly. There are various methods to detect the sentiment of a sentence. However, in regards to tweets sentences may be incomplete, because a tweet is limited to 140 characters and the need to express oneself is limited to short meaningful phrases. We find abbreviations, neologisms, acronyms, hashtags, emotions, and URL’s throughout most tweets. Certain features need to be extracted and some need to be filtered out. Filtering of URL’s, usernames, and Twitter special words may be needed in certain scenarios [20]. Stop words (i.e., a, an, the, etc.) are also removed due to not adding any extra sentiment information. For classifying the tweets, a number of different methods are used, and the most prevalent one is the Naive Bayes classifier [20, 22]. Emotions are used as basis for sentiment classification for classifying tweets as positive or negative for the training purposes [20, 1]. Visualizations Clustering Although clustering of data has been extensively studied, it remains a non-trivial task to deal with temporal and time series data. Weather data needs to be clustered based on values, proximities, and changes throughout the given time span. There have been various visualization techniques for time series data. Visualizing time series data using spirals for large data sets can better identify periodic structures in data [27]. Using wavelet to transform data along a multi-resolution temporal representation to find clusters with similar trends is a useful method for exploring data in a time series fashion [28]. Applying smooth data histograms for visualizing clusters in self-organizing maps is a simple method for 2D data sets [21]. The mass majority of clustering visualizations uses k-means clustering on the basis of their algorithms [27, 28]. We choose to follow this pattern as well, as k-means is a widely used algorithm that gives reasonably good results [27, 13]. Bundling The correlation between multiple entities (e.g., various weather and