{"id":1396,"date":"2020-12-15T09:42:00","date_gmt":"2020-12-15T04:12:00","guid":{"rendered":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/"},"modified":"2025-01-31T00:22:26","modified_gmt":"2025-01-30T18:52:26","slug":"tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods","status":"publish","type":"post","link":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/","title":{"rendered":"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods"},"content":{"rendered":"\n<div class=\"inherit-container-width wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h2 class=\"wp-block-heading\" id=\"introduction\"><strong>Introduction<\/strong><\/h2>\n\n\n\n<p>We did this case study as a part of our capstone project at Great Lakes Institute of Management, Chennai. After we presented this study, we got an overwhelming response from our professors &amp; mentors. Later, they encouraged us to share our work to help others learn something new.<\/p>\n\n\n\n<p>So, using this case study we\u2019ve analyzed the crisis communication on Twitter happened during Chennai Floods. Also, we\u2019ve discovered patterns &amp; themes of communication, the way in which the platform was used to share information and how it shaped response to the crisis. After successfully finishing our study, the following objectives were achieved:<\/p>\n\n\n\n<p>Topic analysis of social media interactions to understand the different subjects of interactions, information sharing and grouping geographies into riskGrouping similar messages together with emphasis on predominant themes (rescue, food, supplies, ambulance calls)<br>This study is done on a set of social interactions limited to the first two days of December 2015, as these were the worst days of the crisis. The analysis was restricted to a set of 6000 tweets, for want of computing power. The tweets were extracted by looking for the hashtag #ChennaiFloods.<\/p>\n\n\n\n<p>Among analytical approach &amp; tools used, Topic Analysis of tweets is done using Latent Dirichlet Allocation. K-Means &amp; Hierarchical Clustering is employed on the themes of tweets. Tableau is used to create word clouds. Microsoft Excel and Word is used to perform spellchecks on extracted words.<\/p>\n\n\n\n<p>The findings of this study explores the viability of analyzing Twitter communication real-time with an aim of providing timely assistance to the affected areas.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"table-of-contents\"><strong>Table of Contents<\/strong><\/h2>\n\n\n\n<p><strong>Chennai Floods \u2013 Quick Story<\/strong><br><strong>Exploratory Data Analysis<\/strong><br><strong>2.1 \u2013 The Typical Tweet<\/strong><br><strong>2.2 \u2013 Data Extraction<\/strong><br><strong>2.3 \u2013 Data Preparation &amp; Exploration<\/strong><br><strong>2.4 \u2013 Text Preparation<\/strong><br><strong>2.5 \u2013 Word Frequencies and Associations<\/strong><br><strong>Clustering and Topic Modeling<\/strong><br><strong>3.1 \u2013 Hierarchical Clustering<\/strong><br><strong>3.1.1 \u2013 Interpretation<\/strong><br><strong>3.2 \u2013 k-Means Clustering<\/strong><br><strong>3.2.1 \u2013 Choosing k<\/strong><br><strong>3.3 \u2013 Topic Modeling<\/strong><br><strong>3.3.1 \u2013 Latent Dirichlet Association<\/strong><br><strong>Conclusion<\/strong><br><strong>4.1 \u2013 Limitations of this study<\/strong><br><strong>4.2 \u2013 Challenges in Real Time Analysis of Tweets<\/strong><br><strong>4.3 \u2013 Applications and Scope of Further Work<\/strong><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"our-learning-journey\"><strong>Our learning &amp; Journey<\/strong><\/h2>\n\n\n\n<p><strong><span id=\"Chennai_Floods_Quick_Story\">1. Chennai Floods \u2013 Quick Story<\/span><\/strong><br>In November\u2013December 2015, the annual northeast monsoon generated heavy rainfall in theCoromandel Coast region of the South Indian States of Tamil Nadu and Andhra Pradesh, the union territory of Pondicherry with the city of Chennai particularly hard-hit.<br>Termed 2015 South Indian floods as the \u2018black swan\u2019 caused an enormous destruction. It is estimated that around 500 people lost their lives and over 18 lakh (1.8 million) people were displaced.<br><br>With approximations of damages and losses ranging from \u20b950000 crore (US$7 billion) to \u20b9100000 crore (US$15 billion), the floods were the costliest to have occurred in 2015, and were among the costliest natural disasters of the year. The flooding has been attributed to the El Ni\u00f1o phenomenon.<br>In the city of Chennai, the impact of the heavy rainfall was compounded by the overfull water reservoirs, which soon started to spill over into the city. Whole neighborhoods were inundated, with loss of power, communication and access to essentials. <br><br>The National Disaster Response Force (NDRF), the Indian Army and disaster management units were pressed into service. Rescue and rehabilitation efforts spanned well over 2 weeks.<br>What emerged conspicuously during this unprecedented crisis was the coordination of relief efforts over social media platforms such as Twitter, Facebook and WhatsApp. Volunteers, NGOs and other rescue parties were observed to be sending out alerts, requests and sharing useful information related to the flooding on social media. Others used social media to check up on their loved ones, share information, express opinions and sending out requests for help.<\/p>\n\n\n\n<p><br><strong>2. Exploratory Data Analysis<\/strong><br><strong>2.1 The typical tweet<\/strong><\/p>\n\n\n\n<p>A tweet is a social media message posted on Twitter.com. It is restricted to 140 characters. Though most tweets contain mostly text, it is possible to embed URLs, pictures, videos, vines and GIFs.<br>Tweets contain components called hashtags, which are words that capture the subject of the tweet. They are prefixed by the \u2018#\u2019 character. They can also convey emotion (#sarcasm) or an event (#IndiaVsPakistan), or popular catchphrases in pop culture (#ICantBreathe).<br>The usernames or handles of those who post are recognized by the \u2018@\u2019 symbol. A user can direct a message to another user by adding the handle with the \u2018@\u2019 symbol.<br>A retweet (\u2018rt\u2019 for short) is a tweet by a user X that has been shared by user Y to all of Y\u2019s followers. Thus, retweets are a way of measuring how popular the tweet is.<br>A user can \u2018favorite\u2019 a tweet; this is analogous to a \u2018Like\u2019 on Facebook.<\/p>\n\n\n\n<p>Also, uncover <a href=\"https:\/\/www.mygreatlearning.com\/blog\/transportation-problem-explained\/\">how Excel Data Analytics can help solve the Transportation Problem<\/a><\/p>\n\n\n\n<p><br><strong>2.2 Data Extraction<\/strong><br>Twitter has an official API called OAuth, a token-based authentication system that indexes tweets that match a given search string and writes the output to a file. While this service is free and convenient to perform a quick and efficient extraction of tweets, it has a crucial limitation: it can retrieve tweets only from the previous week.<br>Hence, a GitHub repository on the code-sharing portal was utilized to search for and extract tweets. The search term was the hashtag #ChennaiFloods. The python language was used (in an Ubuntu Linux environment) to perform the extraction; the extracted tweets were written to a comma-separated value file.<\/p>\n\n\n\n<p>Check out the<a href=\"https:\/\/www.mygreatlearning.com\/academy\/learn-for-free\/courses\/sentiment-analysis-using-python\" target=\"_blank\" rel=\"noreferrer noopener\"> Sentiment analysis using python<\/a> course to enhance your knowledge.<\/p>\n\n\n\n<p><strong>2.3 Data Preparation &amp; Exploration<\/strong><br>The information gathered contained hashtags and mentions of the users who tweeted about this event. These were separated in R using regular expressions.<br>The following wordcloud shows the hashtags used in the tweets.<br><img decoding=\"async\" width=\"781\" height=\"490\" class=\"alignnone size-full wp-image-1397\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_441.png\" alt=\"Screenshot_44\"><br>From the hashtags used, the following themes are evident, other than the disaster itself:<br>Sympathy <strong>(#PrayForChennai)<\/strong><br>Requests for help <strong>(#savechennai, #ChennaiRainsHelp)<\/strong><br>Information on further weather forecasts <strong>(#chennaiweather)<\/strong><br>Information on specific areas in Chennai <strong>(#airport, #Chromepet)<\/strong><br>Cautionary messages <strong>(#ExerciseCaution)<\/strong><br>Various hashtags for the same topic are observable. This would make it challenging to separate all tweets on the subject.<br>The below chart shows the top 10 active Twitter users that participated in relief efforts. Some handles belong to prominent celebrities.<br>(It must be noted that these facts are true only with respect to the tweets collected; in reality, if all the tweets on the subject were analyzed, it is likely the no. of retweets, tweets &amp; favorites per user may change.)<br><img decoding=\"async\" width=\"601\" height=\"407\" class=\"alignnone size-full wp-image-1398\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_451.png\" alt=\"Screenshot_45\"><br>Retweets are also likely to be heavily favorited.<br><img decoding=\"async\" width=\"605\" height=\"341\" class=\"alignnone size-full wp-image-1400\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_471.png\" alt=\"Screenshot_47\"><br><strong>2.4 Text Preparation<\/strong><br>The tweets were parsed into a corpus for text analysis. The following steps were executed to clean the corpus and prepare it for further analysis. Only the text portion of the tweet (the actual message) was considered.<br>Removing numbers: TweetIDs are number generated by Twitter to identify each tweet. Numbers as such don\u2019t serve any purpose for text analysis and hence they are discarded.<br>Removing URLs &amp; links: Many tweets contained links to webpages and videos elsewhere on the Internet. These were removed with regular expressions.<br>Removing stopwords: Stopwords are words in English that are commonly used in every sentence, but have no analytical significance. Examples are \u2018is\u2019, \u2018but\u2019, \u2018shall\u2019, \u2018by\u2019 etc. These words were removed by matching the corpus with the stopwords list in the tm package of R. Expletives were also removed.<br>Removing non-English words: The corpus generated after performing the last 3 steps were broken into their constituent words and all English words and words less than 4 characters long were removed. What remained was the list of non-English words, words that mentioned areas in Chennai, words that are actually Tamil words written in English and misspellings of normal English words.<br>The non-English words that denoted the names of localities were used to form a wordcloud.<br><img decoding=\"async\" width=\"690\" height=\"351\" class=\"alignnone size-full wp-image-1401\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_481.png\" alt=\"Screenshot_48\"><\/p>\n\n\n\n<p><strong>Stemming words:<\/strong> In text analysis, stemming is \u2018the process of reducing inflected (or sometimes derived) words to their word stem, base or root form\u2014generally a written word form\u2019. Stemming is done to reduce inflectional forms and sometimes derivationally related-forms of a word to a common base form. Many methods exist to stem words in a corpus.<br><br><strong>Suffix-dropping algorithms:<\/strong> The last parts of all the words get truncated. For example, words like \u2018programming\u2019, \u2018programmer\u2019, \u2019programmed\u2019, \u2018programmable\u2019 can all be stemmed to the root \u2018program\u2019. On the other hand, \u2018rescuing\u2019, \u2018rescue\u2019, \u2018rescued\u2019 are stemmed to form \u2018rescu\u2019, which is not a word or a root. This method was chosen for this study for simplicity.<br><br><strong>Lemmatisation algorithms:<\/strong> Each word is the determination of the lemma for a word in the corpus. This is done with the understanding of the context, part of speech and the lexicon for the language. For example, \u2018better\u2019 is related to \u2018good\u2019, \u2018running\u2019 is related to \u2018walk\u2019 and so on.<br><br><strong>n-gram analysis:<\/strong> Each word is broken into a part of its whole by \u2018n\u2019 characters, and the one that makes most sense is retained. For example, for n=1 (unigram), the letters \u2018f\u2019, \u2019l\u2019, \u2019o\u2019, \u2019o\u2019, \u2019d\u2019 are individually parsed from \u2018flood\u2019. For a higher n (say n=5), \u2018flood\u2019 is retained from \u2018flooding\u2019, although at n=4, \u2018ding\u2019 can also be construed as a word.<br><br><strong>Removing punctuation: <\/strong>Punctuation marks make no impact to the analysis of text and are hence removed.<br><br><strong>Stripping whitespace:<\/strong> Words that have extra whitespaces at the beginning, middle or end are subjected to a regular expression that removes the whitespace and retains only the words themselves.<br><br><strong>Checking for impure characters:<\/strong> A check on the corpus after the modifications made thus far revealed that some URLs were left behind, due to the removal of whitespaces, numbers and punctuations. Regular expressions were used to remove them.<br><br><strong>2.5 Word Frequencies and Associations<\/strong><br>After the necessary cleaning, another wordcloud was plotted to understand the most frequently used terms. The following observations are made:<br>A few simple words repeat more often than others: \u2018people\u2019, \u2018stay\u2019, \u2019safe\u2019, \u2018food\u2019 etc. These are immediate reactions and responses to the crisis.<br>Some infrequent terms are \u2018street\u2019, \u2018nagar\u2019, \u2018mudichur\u2019, \u2018road\u2019 etc. that give information about the situation in a few areas.<br>Words like \u2018pray\u2019, \u2018hope\u2019 and \u2018proud\u2019 are also used, in messages that mostly convey sympathy or hope.<br>There were tweets about news reports covering the crisis, with words like \u2018channel\u2019, \u2018media\u2019 etc.<br>Words like \u2018help\u2019, \u2018please\u2019, \u2018food\u2019 etc. were used in tweets that requested for volunteers to participate in rescue and rehabilitation efforts.<br><img decoding=\"async\" width=\"655\" height=\"376\" class=\"alignnone size-full wp-image-1402\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_491.png\" alt=\"Screenshot_49\"><br><img decoding=\"async\" width=\"554\" height=\"527\" class=\" size-full wp-image-1403 alignnone\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_50.png\" alt=\"Screenshot_50\"><\/p>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_513-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_513-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_513.png\" alt=\"Screenshot_51\" class=\"wp-image-1404\"><\/figure>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_531-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_531-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_531.png\" alt=\"Screenshot_53\" class=\"wp-image-1406\"><\/figure>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_541-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_541-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_541.png\" alt=\"Screenshot_54\" class=\"wp-image-1407\"><\/figure>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_551-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_551-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_551.png\" alt=\"Screenshot_55\" class=\"wp-image-1408\"><\/figure>\n\n\n\n<p>A few words are associated strongly with the most frequent words than others. The below table describes the most common word associations.<br>It was observed that the names of many localities feature in the associations list. These localities were mentioned with a lot less frequency than those in Figure 4. Considering these associations are from a list of 6303 tweets, more associations and localities are likely to emerge if the entire set of tweets on the crisis is considered.<br><img decoding=\"async\" width=\"746\" height=\"395\" class=\"alignnone size-full wp-image-1409\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_561.png\" alt=\"Screenshot_56\"><\/p>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_571-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_571-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_571.png\" alt=\"Screenshot_57\" class=\"wp-image-1410\"><\/figure>\n\n\n\n<p><strong>3. Clustering and Topic Modeling<\/strong><br>When a crisis like the Chennai floods strikes, a large number of similar tweets get generated that it becomes challenging to make meaningful interpretations from the huge volumes of data that need to be processed.<br>One solution is to cluster similar tweets together after performing the necessary EDA operations that it becomes easier to manage the flow of information.<br><br>Check out our blog on <a href=\"https:\/\/www.mygreatlearning.com\/blog\/introduction-to-spectral-clustering\/\">Introduction to Spectral Clustering<\/a> to learn more about this advanced clustering technique.<br><br><strong>3.1 Hierarchical Clustering<\/strong><br>Hierarchical clustering attempts to build different levels of clusters. Strategies for hierarchical clustering fall into two types (Wikipedia, n.d.):<br><strong>Agglomerative:<\/strong> where we start out with each document in its own cluster. The algorithm iteratively merges documents or clusters that are closest to each other until the entire corpus forms a single cluster. Each merge happens at a different (increasing) distance.<br><strong>Divisive:<\/strong> where we start out with the entire set of documents in a single cluster. At each step the algorithm splits the cluster recursively until each document is in its own cluster. This is basically the inverse of an agglomerative strategy.<br>The results of hierarchical clustering are usually presented in a dendrogram.<br>The R function, hclust() was used to perform hierarchical clustering. It uses the agglomerative method. The following steps explain hierarchical clustering in simple terms:<br>Assign each document to its own (single member) cluster<br>Find the pair of clusters that are closest to each other and merge them, leaving us with one less cluster<br>Compute distances between the new cluster and each of the old clusters<br>Repeat steps 2 and 3 until you have a single cluster containing all documents<br>To perform this operation, the corpus was converted into a matrix with each tweet (or \u2018document\u2019) given an ID. Extremely sparse rows, i.e. rows with elements that are part of less than 2% of the entire corpus were removed. Ward\u2019s method for hierarchical clustering was used.<br>The dendrogram output is to be interpreted as follows:<br>Farther the nodes, greater is the dissimilarity and more robust is that<br>The closer the node, the weaker is the<br>The height of each node in the plot is proportional to the value of the intergroup dissimilarity between its two<br><\/p>\n\n\n\n<p>Also don\u2019t miss out on valuable resources! <br>Learn where to find<a href=\"https:\/\/www.mygreatlearning.com\/blog\/free-download-datasets\/\"> Free Data Sets for Analytics\/Data Science Project <\/a>in latest blog post<\/p>\n\n\n<figure class=\"wp-block-image td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_581-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_581-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_581.png\" alt=\"Screenshot_58\" class=\"wp-image-1411\"><\/figure>\n\n\n<figure class=\"wp-block-image aligncenter td-caption-align-https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_614-1.png zoomable\" data-full=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/Screenshot_614-1.png\"><img decoding=\"async\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_614.png\" alt=\"Screenshot_61\" class=\"wp-image-1414\"><\/figure>\n\n\n\n<p class=\"has-black-color has-text-color\"><a href=\"#2_._2_Data_Extraction\"><strong>3.1.1 Interpretation<\/strong><br>The following distinct clusters of tweets are observable from the dendrogram:<br>Tweets that talk about general information about affected individuals, areas and news about the crisis.<br>Tweets that talk about food, supplies and rescue efforts.<br>Tweets that describe the weather, forecasts of rain and further developments.<br>Tweets that caution people on risky areas and share information on relief efforts.<br>It is also seen that there is a significant similarity between clusters of tweets; this is expected as the terms used across tweets are more or less similar.<br>No locality or specific names are mentioned as the clustering was performed on a matrix that did not contain such rarely-occurring terms.<br><\/a><br><a href=\"#2_._2_Data_Extraction\"><strong>3.2 K-Means Clustering<\/strong><br>As opposed hierarchical clustering, where one does not arrive at the number of clusters until after the dendrogram, in K-means, the number of clusters is decided beforehand. The algorithm then generates k document clusters in a way that ensures the within-cluster distances from each cluster member to the centroid (or geometric mean) of the cluster is minimized.<br>A simplified description of the algorithm is as follows:<br>Assign the documents randomly to k bins<br>Compute the location of the centroid of each<br>Compute the distance between each document and each centroid<br>Assign each document to the bin corresponding to the centroid closest to<br>Stop if no document is moved to a new bin, else go to step<br><\/a><br><a href=\"#2_._2_Data_Extraction\"><strong>3.2.1 Choosing k<\/strong><br>The most significant factor of employing k-means clustering is choosing the no. of clusters, \u2018k\u2019. The \u2018elbow method\u2019, wherein the SUM of Squared Error (SSE, the sum of the squared distance between each member of the cluster and its centroid) decreases abruptly at that value that is theoretically the optimal value of k, is widely applied to arrive at k.<br>When k is plotted against the SSE, it will be seen that the error decreases as k gets larger; this is because when the number of clusters increases, they become smaller, and hence the distortion is also smaller.<br><img decoding=\"async\" width=\"753\" height=\"349\" class=\"alignnone size-full wp-image-1412\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_591.png\" alt=\"Screenshot_59\"><br>Here, the optimal value for k is shown to be 3, as that is where the SSE decreases abruptly. With k=3, the matrix of tweets was clustered using k-means.<br><img decoding=\"async\" width=\"745\" height=\"392\" class=\"alignnone size-full wp-image-1413\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_60.png\" alt=\"Screenshot_60\"><br>The plot clearly shows that there is only marginal dissimilarity with a corpus at 98% sparsity. This is evident from the top 10 words in each of the three clusters.<br><strong>cluster 1:<\/strong> rain need status flood helplin number contact stay safe atus<br><strong>cluster 2:<\/strong> food need contact avail peopl near water area call status<br><strong>cluster 3:<\/strong> people safe stay flood rain need near road contact media<br><\/a><br><a href=\"#2_._2_Data_Extraction\">With respect to clustering, subject matter and corpus knowledge is the best way to understand cluster themes. With the insights gleaned thus far, it is reasonable to assume the following:<br><strong>Cluster 1<\/strong> contains news updates and cautionary<br><strong>Cluster 2<\/strong> contains messages about requests for help and volunteers<br><strong>Cluster 3<\/strong> contains messages about area-specific updates and some cautionary<br><\/a><br><a href=\"#2_._2_Data_Extraction\"><strong>3.3 Topic Modeling<\/strong><br>Another technique that is employed to deduce the themes of text is topic modeling.<br>A topic model is a type of statistical model for discovering the abstract \u201ctopics\u201d that occur in a collection of documents (tweets in this case). Intuitively, given that a document is about a particular topic, one would expect particular words to appear in the document more or less frequently: in this case, \u2018help\u2019 is quite common to almost every tweet.<br>A document typically concerns multiple topics in different proportions; thus, in a document that is 10% about subject A and 90% about subject B, there would probably be about 9 times more words about \u2018B\u2019 than words about \u2018A\u2019 (Wikipedia, n.d.).<br>Topic modeling has implementations in various algorithms, but the most common algorithm in use is Latent Dirichlet Allocation (LDA).<br><\/a><br><a href=\"#2_._2_Data_Extraction\"><strong>3.3.1 Latent Dirichlet Allocation<\/strong><br>Latent Dirichlet Allocation (LDA) is a statistical model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. For example, if observations are words collected into documents, it posits that each document is a mixture of a small number of topics and that each word\u2019s creation is attributable to one of the document\u2019s topics.<br>LDA allows the possibility of a document to arise from a combination of topics. For example, the following tweet may be classified as (say) 90% information &amp; 10% sympathy\/hope.<br><br>LDA was performed with an objective to discover 6 topics. The output gives us following set of words for each topic.<br><img decoding=\"async\" width=\"548\" height=\"134\" class=\"alignnone size-full wp-image-1415\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_621.png\" alt=\"Screenshot_62\"><br>Topics 1 &amp; 3 are quite similar; this is in agreement with the results of the K-means exercise.<br><img decoding=\"async\" width=\"746\" height=\"388\" class=\"alignnone size-full wp-image-1416\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_631.png\" alt=\"Screenshot_63\"><br>This plot graphs the probability associated with the top 100 words for each topic, sorted from most to least likely. Here too, the lines almost overlap, indicating the content similarity in the tweets.<br><img decoding=\"async\" width=\"750\" height=\"398\" class=\"alignnone size-full wp-image-1417\" src=\"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/Screenshot_641.png\" alt=\"Screenshot_64\"><br><strong>4. Conclusion<\/strong><br>It is clear that the general sentiment across the tweets render the tweets quite similar. It has been demonstrated that crucial information like the worst-hit areas can be identified by analyzing tweets and performing basic text analytics.<br>It is clear that the power of social media can be harnessed to great effect in times of crisis. This has not escaped Twitter\u2019s notice; they have initiated the practice of creating hashtags specific to individual crises to index tweets easily. Facebook launches \u2018Mark Safe\u2019 feature to those who have listed a crisis-hit location as their place of residence.<br>The government agencies, NDRF and other relief agencies would do well to develop analytics capabilities focused on mining Twitter for real-time, tangible updates to take meaningful action.<\/a><br><\/p>\n\n\n\n<p class=\"has-black-color has-text-color\">You can also explore real-world solutions with IT analytics through our blog on the <a href=\"https:\/\/www.mygreatlearning.com\/blog\/how-it-is-using-analytics-to-solve-industry-problems\/\">Top 5 Examples of How IT Uses Analytics to Solve Industry Problems<\/a>.<\/p>\n\n\n\n<p class=\"has-black-color has-text-color\"><a href=\"#2_._2_Data_Extraction\"><strong>4.1 Limitations of this study<\/strong><br>The study considers only 6000 tweets of the whole set of tweets that would have been sent on the subject.<br>The study also did not consider captions of pictures, news reports, and other social media reports which could have generated additional insights.<br>There exist other topic models and black box techniques for similar analysis that have a record of better performance. These have not been performed as they are beyond the scope of this exercise.<\/a><br><a href=\"#2_._2_Data_Extraction\"><br><strong>4.2 Challenges to Real-Time Analysis of Tweets<\/strong><br>The following points highlight a few challenges that will be faced by any researcher trying to solve the same problem.<br>Retweets contain information that many users find relevant. The subjectivity of this relevance to the crisis at hand is difficult, if not impossible to measure.<br>This problem is compounded if the tweets contain no hashtags. In the dataset generated for this analysis, 1399 tweets (22%) had no hashtags. These tweets may also be highly relevant to the crisis but may be missed due to the lack of hashtags.<br>Twitter has a 140 character-limit on all tweets (not including pictures and videos). This leads users to truncate or shorten words to forms that is easily interpretable to humans, but is challenging for a machine. Eg: \u2018afcted ppl\u2019 is easily understandable to mean \u2018affected people\u2019 for a human, but not for a program. One way to solve this problem is to maintain a dictionary of such terms and match them in real-time.<\/a><br><a href=\"#2_._2_Data_Extraction\"><br><strong>4.3 Applications &amp; Scope for Further Work<\/strong><br>As mentioned in the introductory chapter, this is an active field. The power of social media will continue to be researched and newer applications will continue to be built to harness its power.<br>One area is quashing rumors. During the Chennai floods, quite a number of false \u2018news reports\u2019 and \u2018alerts\u2019 circulated on Facebook, Twitter and the mobile messaging application WhatsApp. Machine learning can be employed to check the veracity of social media by comparing contents from actual news reports and<\/a><br><a href=\"#2_._2_Data_Extraction\"><br>During the 2011 Australia floods, the civic authorities were among the most prolific users of Twitter in disseminating safety tips, general information and coordinating volunteer relief work. Every civic authority would do well to develop a framework and system to manage crises also through social media. This covers all disasters, both natural (earthquakes, floods, hurricanes) and man-made (terror strikes, shootouts).<br>Media outlets and government agencies can work together in planning for incidents that are expected by creating distinct identifiers and hashtags for each scenario and making the public aware of the<\/a><br><a href=\"#2_._2_Data_Extraction\"><br>Disasters may strike at any time. While it may not be possible to prevent them, it is prudent to be prepared for unfortunate eventualities. Having a dedicated social network analysis platform to analyze information in real-time will definitely aid in this endeavor.<\/a><\/p>\n\n\n\n<p class=\"has-black-color has-text-color\"><em><strong>This article was contributed by Anandhi Venkat, Ashvin Kumar Elangovan, Vignesh Kumar and Vijay Somanath and the mentor Jatinder Bedi and was done as part of their capstone project. They were part of<a href=\"https:\/\/www.mygreatlearning.com\/pg-program-data-science-and-business-analytics-course\"> the <\/a><a href=\"https:\/\/www.mygreatlearning.com\/pg-program-data-science-and-business-analytics-course\" target=\"_blank\" rel=\"noreferrer noopener\">Great Learning PGP Data Science Course<\/a><a href=\"https:\/\/www.mygreatlearning.com\/pg-program-data-science-and-business-analytics-course\"> <\/a>in Chennai and finished their curriculum recently.<\/strong><\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"must-read\">Must Read <\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/what-is-computer-vision-the-basics\/\">What is Computer Vision? Know Computer Vision Basic to Advanced &amp; How Does it Work?<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/types-of-neural-networks\/\">Types of Neural Networks and Definition of Neural Network<\/a><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hear-from-our-successful-learners\"><strong>Hear from Our Successful Learners<\/strong><\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/a-great-opportunity-to-explore-and-implement-concepts\/\">A great opportunity to explore and implement concepts \u2013 Omkar Sahasrabudhe, PGP AIML<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/it-helped-express-my-understanding-of-new-techniques\/\">Great Learning helped express my understanding of new techniques \u2013 Capt Karthik, PGP AIFL<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/mentoring-sessions-strengthen-the-understanding-of-concepts\/\">Structured Self-learning videos and mentoring sessions strengthen the understanding of concepts \u2013 Anand Venkateshwaran, PGP AIML<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/videos-sessions-are-of-great-use-to-understand-the-concepts\/\">Great Learning videos &amp; sessions are of great use to understand the concepts in-depth \u2013 Lenin Christopher, PGP AIML<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/utilizing-the-skills-acquired-to-address-one-of-the-clients-demands\/\">Utilizing the skills acquired at Great Learning to address one of the client\u2019s demands \u2013 Daniel Raju, PGP AIML<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.mygreatlearning.com\/blog\/planned-to-implement-data-driven-approaches-with-skills-gained\/\">I planned to implement data-driven approaches with the skills I gained during this course \u2013 Vanam Sravankrishna, PGP AIML<\/a><\/li>\n<\/ul>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Introduction We did this case study as a part of our capstone project at Great Lakes Institute of Management, Chennai. After we presented this study, we got an overwhelming response from our professors &amp; mentors. Later, they encouraged us to share our work to help others learn something new. So, using this case study we\u2019ve [&hellip;]<\/p>\n","protected":false},"author":41,"featured_media":12841,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[2],"tags":[],"content_type":[],"class_list":["post-1396","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Complete data analysis on 2015 Chennai floods | Great Learning<\/title>\n<meta name=\"description\" content=\"Case study as a part of capstone project at Great Lakes Institute of Management, Chennai prepared a data analytics report.\" \/>\n<meta name=\"robots\" content=\"noindex, follow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods\" \/>\n<meta property=\"og:description\" content=\"Great Learning blog is a wonderful platform to know all the aspects of Data Science and Business Analyst Certification and how these Big Data Courses are transforming the industries.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/\" \/>\n<meta property=\"og:site_name\" content=\"Great Learning Blog: Free Resources what Matters to shape your Career!\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/GreatLearningOfficial\/\" \/>\n<meta property=\"article:published_time\" content=\"2020-12-15T04:12:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-01-30T18:52:26+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1254\" \/>\n\t<meta property=\"og:image:height\" content=\"836\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Great Learning Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@https:\/\/twitter.com\/Great_Learning\" \/>\n<meta name=\"twitter:site\" content=\"@Great_Learning\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Great Learning Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"21 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/\"},\"author\":{\"name\":\"Great Learning Editorial Team\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\"},\"headline\":\"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods\",\"datePublished\":\"2020-12-15T04:12:00+00:00\",\"dateModified\":\"2025-01-30T18:52:26+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/\"},\"wordCount\":3679,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/07\\\/iStock-480553737.jpg\",\"articleSection\":[\"AI and Machine Learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/\",\"name\":\"Complete data analysis on 2015 Chennai floods | Great Learning\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/07\\\/iStock-480553737.jpg\",\"datePublished\":\"2020-12-15T04:12:00+00:00\",\"dateModified\":\"2025-01-30T18:52:26+00:00\",\"description\":\"Case study as a part of capstone project at Great Lakes Institute of Management, Chennai prepared a data analytics report.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/07\\\/iStock-480553737.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/07\\\/iStock-480553737.jpg\",\"width\":1254,\"height\":836,\"caption\":\"tapping twitter\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI and Machine Learning\",\"item\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/artificial-intelligence\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"name\":\"Great Learning Blog\",\"description\":\"Learn, Upskill &amp; Career Development Guide and Resources\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\"},\"alternateName\":\"Great Learning\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#organization\",\"name\":\"Great Learning\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/GL-Logo.jpg\",\"width\":900,\"height\":900,\"caption\":\"Great Learning\"},\"image\":{\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/GreatLearningOfficial\\\/\",\"https:\\\/\\\/x.com\\\/Great_Learning\",\"https:\\\/\\\/www.instagram.com\\\/greatlearningofficial\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/in.pinterest.com\\\/greatlearning12\\\/\",\"https:\\\/\\\/www.youtube.com\\\/user\\\/beaconelearning\\\/\"],\"description\":\"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.\",\"email\":\"info@mygreatlearning.com\",\"legalName\":\"Great Learning Education Services Pvt. Ltd\",\"foundingDate\":\"2013-11-29\",\"numberOfEmployees\":{\"@type\":\"QuantitativeValue\",\"minValue\":\"1001\",\"maxValue\":\"5000\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/#\\\/schema\\\/person\\\/6f993d1be4c584a335951e836f2656ad\",\"name\":\"Great Learning Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"contentUrl\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/unnamed.webp\",\"caption\":\"Great Learning Editorial Team\"},\"description\":\"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.\",\"sameAs\":[\"https:\\\/\\\/www.mygreatlearning.com\\\/\",\"https:\\\/\\\/in.linkedin.com\\\/school\\\/great-learning\\\/\",\"https:\\\/\\\/x.com\\\/https:\\\/\\\/twitter.com\\\/Great_Learning\",\"https:\\\/\\\/www.youtube.com\\\/channel\\\/UCObs0kLIrDjX2LLSybqNaEA\"],\"award\":[\"Best EdTech Company of the Year 2024\",\"Education Economictimes Outstanding Education\\\/Edtech Solution Provider of the Year 2024\",\"Leading E-learning Platform 2024\"],\"url\":\"https:\\\/\\\/www.mygreatlearning.com\\\/blog\\\/author\\\/greatlearning\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Complete data analysis on 2015 Chennai floods | Great Learning","description":"Case study as a part of capstone project at Great Lakes Institute of Management, Chennai prepared a data analytics report.","robots":{"index":"noindex","follow":"follow"},"og_locale":"en_US","og_type":"article","og_title":"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods","og_description":"Great Learning blog is a wonderful platform to know all the aspects of Data Science and Business Analyst Certification and how these Big Data Courses are transforming the industries.","og_url":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/","og_site_name":"Great Learning Blog: Free Resources what Matters to shape your Career!","article_publisher":"https:\/\/www.facebook.com\/GreatLearningOfficial\/","article_published_time":"2020-12-15T04:12:00+00:00","article_modified_time":"2025-01-30T18:52:26+00:00","og_image":[{"width":1254,"height":836,"url":"http:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg","type":"image\/jpeg"}],"author":"Great Learning Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@https:\/\/twitter.com\/Great_Learning","twitter_site":"@Great_Learning","twitter_misc":{"Written by":"Great Learning Editorial Team","Est. reading time":"21 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#article","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/"},"author":{"name":"Great Learning Editorial Team","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad"},"headline":"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods","datePublished":"2020-12-15T04:12:00+00:00","dateModified":"2025-01-30T18:52:26+00:00","mainEntityOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/"},"wordCount":3679,"commentCount":0,"publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg","articleSection":["AI and Machine Learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/","url":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/","name":"Complete data analysis on 2015 Chennai floods | Great Learning","isPartOf":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#primaryimage"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#primaryimage"},"thumbnailUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg","datePublished":"2020-12-15T04:12:00+00:00","dateModified":"2025-01-30T18:52:26+00:00","description":"Case study as a part of capstone project at Great Lakes Institute of Management, Chennai prepared a data analytics report.","breadcrumb":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#primaryimage","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg","width":1254,"height":836,"caption":"tapping twitter"},{"@type":"BreadcrumbList","@id":"https:\/\/www.mygreatlearning.com\/blog\/tapping-twitter-sentiments-a-complete-case-study-on-chennai-floods\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog","item":"https:\/\/www.mygreatlearning.com\/blog\/"},{"@type":"ListItem","position":2,"name":"AI and Machine Learning","item":"https:\/\/www.mygreatlearning.com\/blog\/artificial-intelligence\/"},{"@type":"ListItem","position":3,"name":"Tapping Twitter Sentiments: A Complete Case-Study on 2015 Chennai Floods"}]},{"@type":"WebSite","@id":"https:\/\/www.mygreatlearning.com\/blog\/#website","url":"https:\/\/www.mygreatlearning.com\/blog\/","name":"Great Learning Blog","description":"Learn, Upskill &amp; Career Development Guide and Resources","publisher":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization"},"alternateName":"Great Learning","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.mygreatlearning.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.mygreatlearning.com\/blog\/#organization","name":"Great Learning","url":"https:\/\/www.mygreatlearning.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/06\/GL-Logo.jpg","width":900,"height":900,"caption":"Great Learning"},"image":{"@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/GreatLearningOfficial\/","https:\/\/x.com\/Great_Learning","https:\/\/www.instagram.com\/greatlearningofficial\/","https:\/\/www.linkedin.com\/school\/great-learning\/","https:\/\/in.pinterest.com\/greatlearning12\/","https:\/\/www.youtube.com\/user\/beaconelearning\/"],"description":"Great Learning is a leading global ed-tech company for professional training and higher education. It offers comprehensive, industry-relevant, hands-on learning programs across various business, technology, and interdisciplinary domains driving the digital economy. These programs are developed and offered in collaboration with the world's foremost academic institutions.","email":"info@mygreatlearning.com","legalName":"Great Learning Education Services Pvt. Ltd","foundingDate":"2013-11-29","numberOfEmployees":{"@type":"QuantitativeValue","minValue":"1001","maxValue":"5000"}},{"@type":"Person","@id":"https:\/\/www.mygreatlearning.com\/blog\/#\/schema\/person\/6f993d1be4c584a335951e836f2656ad","name":"Great Learning Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","url":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","contentUrl":"https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2022\/02\/unnamed.webp","caption":"Great Learning Editorial Team"},"description":"The Great Learning Editorial Staff includes a dynamic team of subject matter experts, instructors, and education professionals who combine their deep industry knowledge with innovative teaching methods. Their mission is to provide learners with the skills and insights needed to excel in their careers, whether through upskilling, reskilling, or transitioning into new fields.","sameAs":["https:\/\/www.mygreatlearning.com\/","https:\/\/in.linkedin.com\/school\/great-learning\/","https:\/\/x.com\/https:\/\/twitter.com\/Great_Learning","https:\/\/www.youtube.com\/channel\/UCObs0kLIrDjX2LLSybqNaEA"],"award":["Best EdTech Company of the Year 2024","Education Economictimes Outstanding Education\/Edtech Solution Provider of the Year 2024","Leading E-learning Platform 2024"],"url":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"}]}},"uagb_featured_image_src":{"full":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",1254,836,false],"thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737-150x150.jpg",150,150,true],"medium":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737-300x200.jpg",300,200,true],"medium_large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737-768x512.jpg",768,512,true],"large":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737-1024x683.jpg",1024,683,true],"1536x1536":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",1254,836,false],"2048x2048":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",1254,836,false],"web-stories-poster-portrait":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",640,427,false],"web-stories-publisher-logo":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",96,64,false],"web-stories-thumbnail":["https:\/\/www.mygreatlearning.com\/blog\/wp-content\/uploads\/2016\/07\/iStock-480553737.jpg",150,100,false]},"uagb_author_info":{"display_name":"Great Learning Editorial Team","author_link":"https:\/\/www.mygreatlearning.com\/blog\/author\/greatlearning\/"},"uagb_comment_info":0,"uagb_excerpt":"Introduction We did this case study as a part of our capstone project at Great Lakes Institute of Management, Chennai. After we presented this study, we got an overwhelming response from our professors &amp; mentors. Later, they encouraged us to share our work to help others learn something new. So, using this case study we\u2019ve&hellip;","_links":{"self":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/1396","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/comments?post=1396"}],"version-history":[{"count":19,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/1396\/revisions"}],"predecessor-version":[{"id":109578,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/posts\/1396\/revisions\/109578"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media\/12841"}],"wp:attachment":[{"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/media?parent=1396"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/categories?post=1396"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/tags?post=1396"},{"taxonomy":"content_type","embeddable":true,"href":"https:\/\/www.mygreatlearning.com\/blog\/wp-json\/wp\/v2\/content_type?post=1396"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}