12/08/2021
Climate affects nearly every aspect of our lives, from our food sources to our transport infrastructure, from what clothes we wear, to where we go on holiday. It has a huge effect on our livelihoods, our health, and our future. In this project, we attempt to understand the popular sentiment among public based on their overall opinion. We would also explore and identify the frequently used terms related to global warming, for example “climate change”, “environment”, “temperature” etc. In summary, we would extract tweets using Twitter API, classify opinions into categories, create function to calculate score and visualize data to organize results to express views or emotions.
Sentiment analysis gives us insights into the things that automate mining of attitudes, opinions, views and emotions from text, speech, tweets and database sources. In this project, we use tweets extracted using Twitter API, store tweets as text data, classify opinions in text into categories like positive, or negative or neutral, create a function to calculate the score of each type of opinion in the text and try to explore and visualize as much as we can, using R libraries. Tweets can be imported into R using Twitter API, then the text data has to be cleaned before analysis, for example removing emoticons, removing URLs, etc. Along the way, we use sophisticated AI-related techniques (deep learning) like H20 or Keras.
Pulling Tweets into R (Linking Twitter’s API):
Text cleaning is one of the text mining processing to clean the words or other component that is hard to analyze or figure the meaning of the text. Text data or sentence data contains white spaces, punctuations, stop words etc. These characters do not inform much information and are hard to process for the sentiment analysis. For example, English stop words like “the”, “is” etc. do not tell you much information about the sentiment of the text, entities mentioned in the text, or relationships between those entities
Twitter data is a powerful source of information on a wide list of topics. More than half of the tweets examined support for a negative sentiment associated with Global Warming. Futhermore, climate change is now one of the two most important issues in politics. In this project, We used this data to find trends related to specific topics, measure popular sentiment, obtain feedback on past decisions and also help make future decisions. We further trained and evaluated our model in Keras with different settings and evaluation measures.
Sentiment Analysis has many different applications in several fields: