Exploring Emotions in Agritourism:
A Sentiment Analysis of Online Reviews in Campania, Italy
Autor: Luisa Fernanda Arenas Estevez
Tutor: Teresa Del Giudice | CoTutor: Roberta Garibaldi
December - 2024
Exploring Emotions in Agritourism:
A Sentiment Analysis of Online Reviews in Campania, Italy
Autor: Luisa Fernanda Arenas Estevez
Tutor: Teresa Del Giudice | CoTutor: Roberta Garibaldi
December - 2024
1. Introduction
Context:
Rural tourism has emerged as a key driver of local economic development and the revitalization of rural areas facing demographic and economic challenges. A distinctive model within rural tourism in Italy is Agritourism, a policy aimed at diversifying the agricultural economy. This form of tourism is exclusively carried out on agricultural holdings.
Problem:
Although agritourism is important for diversifying rural economies and supporting local development, there is limited understanding of how tourists perceive these experiences and the emotional value they derive from them. Understanding these perceptions is critical for improving services and evaluating the effectiveness of this public policy.
Justification::
In the context of a postmodern consumer, decision-making is influenced not only by functional and economic factors but also by emotions and meaningful experiences. Understanding these emotional drivers is essential to fully grasp consumer behavior. Analyzing online reviews allows us to capture these emotions and perceptions, providing valuable insights to enhance agritourism services and promote sustainable rural tourism.
To systematically analyze the experiences of agritourism consumers in the Campania region using Sentiment Analysis (SA) on online reviews.
Specific Objectives:
Guiding Questions:
2) Framework:
The Role of Emotions in Tourism
Emotions as Key Drivers:
Challenges in Measuring Emotions:
User-Generated Content (UGC):
Sentiment Analysis in Tourism Research:
Applications in Italy:
Evaluation of tourist satisfaction: Examples: Albergo Diffuso (Vallone & Veglio, 2019), Wine tours in Tuscany (Barbierato et al., 2022).
Analysis of destination reputation: Mount Etna (Graziano & Albanese, 2020), Uffizi Gallery (Collini et al., 2023).
Assessment of critical events (e.g., COVID-19 impact) (Peña et al., 2021).
Opportunities for Agritourism:
3) Materials and Methodology:
3.1 Data Collection
3.2 Methodology
3.1 Data Collection
3.2 Analysis Methodology
3.2.1 Data Preprocessing
Key Steps:
Example:
Original Review: E un posto bellissimo elegante rilassante Per una giornata all insegna del relax non c è di meglio
Clean Review: bellissimo elegante rilassante insegna relax meglio
The final result of this process was a set of unique elements or words \(𝑊 ={𝑤_1,𝑤_2,…,𝑤_𝑚}\)
where \(𝑤_𝑗∈ 𝑊\) corresponds to each unique word after text cleansing, and \(𝑚= 17122\) is the total number of unique words.
and the set of reviews \(𝑑\), where \(𝑑= {𝑑_1,_2,…,𝑑_𝑛}\) and \(n=4047\)
3.2.2 Frequency and Topic Analysis
DTM organizes the frequency of words across documents is defined as a matrix \(M\) with dimensions \(n \times m\):
\[ M = \begin{bmatrix} m_{1,1} & \cdots & m_{1,m} \\ \vdots & \ddots & \vdots \\ m_{n,1} & \cdots & m_{n,m} \end{bmatrix} \]
Where \(M_{ij}\) indicates the frequency of word \(w_j\) in review \(d_i\).
Example of a Mini DTM
Words | Review_1 | Review_2 | Review_3 |
---|---|---|---|
ottimo | 2 | 1 | 0 |
cibo | 1 | 2 | 1 |
qualità | 0 | 1 | 2 |
Topic Distribution per Document (\(\Gamma_{ik}\)):
\[ \Gamma_{ik} = \{\Gamma_{i1}, \Gamma_{i2}, \dots, \Gamma_{nq} \} \]
where \(k\) represents the topics, and \(\Gamma_{ik}\): is the probability that the review \(d_i\) belongs to topic \(k = {1,..𝑞}\).
Word Distribution per Topic (\(\phi_{jk}\)):
\[ \phi_{jk} = \{\phi_{j1}, \phi_{j2}, \dots, \phi_{mq} \} \]
where \(\phi_{jk}\) is the probability that the word \(w_j\) is associated with topic \(k\).
3.2.3 Sentiment Analysis (SA)
Sentiment Analysis identifies and quantifies the emotions expressed in textual data.
Tools: Syuzhet Package & NRC EmoLex Lexicon (Plutchik’s emotions).
Emotions Analyzed
Emotion | Meaning |
---|---|
Joy | Happiness, pleasure, satisfaction. |
Sadness | Loss, disappointment, unmet expectations. |
Anger | Frustration, injustice, hostility. |
Fear | Perception of insecurity or threat. |
Trust | Safety, acceptance, positive connections. |
Disgust | Rejection of unpleasant aspects (e.g., hygiene, quality). |
Surprise | Reactions to unexpected events (positive or negative). |
Anticipation | Expectations, curiosity, excitement for future events. |
Polarity Classification : Words categorized into : Positive: - Negative:
4) Results
The analysis was based on a total of 4,047 reviews collected from agritourisms in the Campania region, distributed across the five provinces of the region. These reviews cover a total of 251 communes
Province | Reviews | Communes |
---|---|---|
Salerno | 1,321 | 79 |
Avellino | 783 | 63 |
Benevento | 754 | 35 |
Napoli | 719 | 35 |
Caserta | 470 | 39 |
Total | 4,047 | 251 |
4.1 Frequency Analysis
Topic Identification
Topic 1. Gastronomy: This topic specifically addresses culinary experience, a recurring theme in the reviews. Words such as qualità, locale, menù, piatti, vino, and antipasti highlight aspects such as menu diversity and food quality.
Topic 2. Environment: This topic emphasizes visitors’ connection to nature and the relaxing environment offered by agritourisms. Associated with terms such as natura, verde, immerso and relax.
Topic 3. Authenticity: This topic highlights the authenticity and quality of local products, as well as personalized interactions with the owners. Words such as prodotti, sapori, genuini, and proprietari emphasize the relevance of local foods, as well as the hosts’ hospitality.
Topic 4. Service: This topic is dominated by terms related to hospitality and service quality. Words such as personale, gentile, accogliente and grazie reflect visitors’ appreciation for the friendly treatment and welcoming atmosphere.
Topic 5. Facilities: This topic focuses on the physical facilities of agritourisms. Terms such as struttura, piscina, camere, bagno, and parcheggio highlight the amenities available, including accommodation spaces.
Topic Identification by Province
4.2 Sentiment Analysis
The words were classified into the eight basic emotions proposed by Plutchik. The results reveal that words related to trust are the most predominant (72.55%), followed by joy (50.32%) and anticipation (35.06%). In contrast, words linked to anger (8.89%) and disgust (8.70%) are the least frequent.
Distribution of Sentiments by Province
When analyzing the distribution of emotions by province, a general homogeneity in proportions is observed
Province | Anger | Anticipation | Disgust | Fear | Joy | Sadness | Surprise | Trust |
---|---|---|---|---|---|---|---|---|
Avellino | 0.098 | 0.346 | 0.101 | 0.116 | 0.486 | 0.148 | 0.162 | 0.729 |
Benevento | 0.079 | 0.338 | 0.090 | 0.106 | 0.488 | 0.120 | 0.160 | 0.746 |
Caserta | 0.086 | 0.355 | 0.090 | 0.101 | 0.505 | 0.120 | 0.155 | 0.754 |
Napoli | 0.079 | 0.357 | 0.073 | 0.094 | 0.513 | 0.129 | 0.159 | 0.706 |
Salerno | 0.096 | 0.354 | 0.085 | 0.119 | 0.514 | 0.148 | 0.150 | 0.713 |
Campania | 0.089 | 0.351 | 0.087 | 0.109 | 0.503 | 0.136 | 0.156 | 0.726 |
Wordcloud by Emotion
Polarity
the sentiment analysis evaluated the valence within each review, categorizing words into two broad categories: positive and negative. The overall results show that 83.76% of the words analyzed were associated with a positive valence, while 16.24% corresponded to a negative valence
Province | Pos | Neg |
---|---|---|
Avellino | 83.1 | 16.9 |
Benevento | 84.0 | 16.0 |
Caserta | 84.8 | 15.2 |
Napoli | 84.6 | 15.4 |
Salerno | 83.1 | 16.9 |
Campania | 83.8 | 16.2 |
Conclusions
Key Insights:
Practical Implications:
Limitations:
Future Directions:
Muchas Gracias | Grazie mille | Thank you