The paper’s aim is to modelling the sentiment analysist for one of the oldest Indonesian Aircraft, Sriwijaya Airlines. Sriwijaya Air founded in 2003 by Chandra Lie, Hendry Lie, Andi Halim and Fandy Lingga. In their first year, Sriwijaya Air grow rapidly, then today, Sriwijaya Air category’s is in Medium Service Airline.
One reason why I want to do the Sentiment Analysis in the aircraft, because in 9th January 2021, Sriwijaya Air SJ-182 (Boeing Classic 737) found missing after 4 minutes takeoff. The tracking shows that the plane was at an altitude of 250ft and at that point contact with the plane was lost, while the pilot had not declared any sort of emergency. So I want to know more on how their performance in the past.
Writer: Laura Florencia (4309985)
Data Science and Business Analytics
University of Warsaw
We do the data cleaning at the beginning of this project. The cleaning such as:
> any unneccesary symbol
> change the text into lower caps so that the machine can read it as the same characters
> remove the pucntuation
> remove any numbers
> add extra stopwords
> remove stopwords from corpus
> remove stopwords
> remove extra white space
> remove URL
> replace words because maybe some typo in the dataset
For the data cleaning, it’s actually we can decide which cleaning method that we aim to use or not.
In the paper, we put the inspect(docs) down below as a comment, because the result is about 400 lines long from the original dataset result that should be inspected before classified.
Before we start our Sentiment Analysis, lets first take a look at all of the responses in general.
Now let’s try and group phrases into a Positive, Neutral or Negative sentiment. We have identified 7161 positive and 3587negative words for our analysis.
Now, let’s take the same sentiments and further classify each word. So here we are not only do one time observation, but we do the second filtering layer using Formal Concept Analysis.
Formal concept and formal context are two basic notions for Formal Concept Analysis (FCA). To generate a list of formal concepts, we need to have a formal context. Context is represented using a triad (O, A, R) where O is a set of objects, A is a set of elements and R is the binary relation between O and A.
In FCA, we are know about the formal concepts determination which requires a formal context respectively. It should have a set of objects, attributes and a degree of memberships of each object. The process obtaining formal concepts depends on a sequence of steps, such as:
1. Tokenization and stop word removal (we did it in the first steps)
2. Lemmatization which was applied in order to bring the objects into its root forms
3. POS tagging, which performed out of various tagged words
4. Aspect extraction
5. Matrix generation
We follow the 5 steps of FCA to identify the opinions level that expressed on available aspects in the sentence.
Actually in this observation, we cannot do fully performed the stopwords removal, there is a word included in positive categories which is month name (August). We review the tokenized on the basis of the appearance of a list of puctuation marks, white spaces and other punctuations. Those punctuations marks were retained in order to decided the number of sentences. The next step is lematizations and lematized words has passed and in input for the next layar of observation. And at the end, aspect-level sentiment score were computed in order to make the category of sentence.
The categories generated a list that generated the sentiment score. The association rules generates the output containing document conclusions and sentiment score. It appear together in a list of the document.
The suggestion for Sriwijaya Air is to make a tool for complaint handling and representation to decrease the negative score. For now, the positive sentiment occurs more compared to the negative one. I want to know more about the sentiments in the Q1 and Q2 of 2021 after the accident. In the future, I will try to find out more about how Indonesian says about it. At the end, I support Sriwijaya Air. Don’t stop flying.
[1. Sriwijaya Air] (https://en.wikipedia.org/wiki/Sriwijaya_Air)
[2. Interpret the Sentiment Analysis] (https://monkeylearn.com/sentiment-analysis/)
[3. Sriwijaya Air Goes Missing] (https://onemileatatime.com/sriwijaya-air-737-crash-indonesia/)
[4. Corpus] (https://www.rdocumentation.org/packages/tm/versions/0.7-8/topics/Corpus) [5. Fuzzy Format Concept Analysis Based Opinion Mining for CRM in Financial Services - a paper from Kumar Ravi, Vadlamani Ravi and P. Sree Rama Krishna Prasad, 2017]