For this assignment a sentiment analysis on Argan Oil and Lard will be done. The reason for this is that within my final project research on different types of Oils and Fats I found Argan Oil as one of the highest priced Oils, while Lard was priced at the lower end. Interestingly enough a reason for the wide pricing margin could not be found based on data involving the make up of the oils or fats. Therefore, the goal here is to find further understanding of how Oils and Fats are priced when it comes to a sentiment analysis.
Data was scraped from Twitter using a developer account. A Twitter token was used to scrape the tweets from twitter. Once scraped and written into a csv, the data was then uploaded into OneDrive and at that point could be utilized. 500 tweets on Argan Oil and 500 tweets on Lard were gathered. These tweets were then combined into one set for ease of use here when doing the sentiment analysis.
Considering that both Argan Oil and Lard are used in everyday life by a wide variety of people I thought it would be interesting to see the way that people reacted when tweeting about them. With Argan Oil being priced higher than lard, generally speaking, and not much in the way of chemical makeup to explain why it deserves this higher price I would expect a high number of positive words vs. negative in favor of Argan Oil.
I do not think it will have a large impact just because people use these items at anytime of day meaning that spikes in positive could come anytime someone decides to use the oil or fat.
As I expected spikes in positivity did not occur surrounding a specific hour of the day. However, looking at the graph it is interesting to think about Lard tweets being far more frequent than Argan Oil tweets, as Lard tweets only went back a day while Argan Oil went back several. To me this means that Lard is used more frequently and starts to make sense of the price difference. Argan Oil may have to have high prices to turn a profit.
I would assume there are a few common words that people say in tweets that lead to higher sentiment scores, these could be words describing benefits or just the opposite showing what is driving them down.
Unfortunately I could not think of the function to allow the top 20 or so words to pulled out, however, if that function had been applied I would have been able to decipher further reasoning as to why the different words helped contribute to the overall sentiment on another level. As you can see the words that play a big role do exist as they look like they far exceed the rest.
After understanding more about how people perceive these products and utilize them on a daily basis it is my conclusion that Argan Oil is held in a more positive light due to the versatility of the product itself. When I was cleaning the data I found numerous examples of it being used in beauty products as well as in cooking, meaning that people seek it for many different reasons, all in the end contributing to its overall value.