Introduction
This project aims to summarize
measurements of sentiments, emotions, and reception and model potential
levels of user-generated content on Twitter about the LIV Golf
professional golf series between November 8th, 2022, and November 30th,
2022. Since the creation of a well-funded competing golf league to the
PGA Tour, the golfing world has been split between two camps, those
supportive of the newly established professional golf league and those
strongly opposed to the league. While the reasons for resistance to the
disruptive golf league vary, the 645 Twitter observations creating the
dataset used in this study were gathered exclusively in the English
language, represent a sampling of users distributed globally, and aim to
represent users’ opinions on both sides of the contentious topic. This
study takes place after the first year of the competing golf league and
captures conversations about the golf league on the Twitter platform for
analysis. Through exploratory and descriptive analytics conducted in
“R,” the research will better understand the LIV Golf brand perception,
followers of LIV Golf, and the reception of the competing golf league
into the sporting world. At the time of this paper’s writing, there are
many editorial articles about the LIV Golf vs. PGA Tour debate and the
future of golf globally; however, this study is the only known analysis
of sentiment, opinion, and prediction of LIV Golf through Twitter
analysis.
The LIV Golf field is currently comprised of a global
representation of players of varying abilities at different points in
their sporting careers. Notable participants in the golf league are
Major Championship winners and relatively well-known names in the
golfing world, like Sergio Garcia, Dustin Johnson, Brooks Koepka, Phil
Mickelson, Bryson DeChambeau, and the current number one player in the
world, Cam Smith. LIV Golf has spent exorbitant money to create the
league and to gain the consideration of these accomplished players to
depart from their current careers on the PGA Tour. It is reported that
the total salaries of the players, as mentioned earlier, are estimated
at greater than 750 million US dollars. While only some participants on
the 54-player roster have received the same compensation as some of the
most popular players in the sport, the contract is guaranteed income and
is not dependent on performance. Additionally, the total purse of all
LIV Golf events is 225 million dollars (Camenker, 2022). Even when
excluding the lesser-known players and operating expenses, over one
billion dollars has been spent by organizers to create the roster and
establish LIV Golf.
The sovereign wealth fund of Saudi Arabia finances LIV Golf. A
sovereign wealth fund is a state-owned investment that invests in real
and financial assets such as stocks, bonds, real estate, private equity
funds, and hedge funds. A persistent narrative in media reporting on the
challenging golf series is that creating the golf league is an effort by
the Saudi Arabian government to conduct large-scale sports washing. The
term sports washing describes efforts by individuals, corporations, or
governments leveraging the appeal of sports to improve sentiment toward
their socially unacceptable practices (Wojtowicz, 2022). The most famous
example of sports washing occurred in the 1936 Olympic Games hosted by
Germany and the Nazi regime. The German government made a significant
effort to convey a modern, civil, and ethical society to the rest of the
world. Other instances of sports washing occurred in the Moscow Olympics
in 1980, Beijing Olympics in 2008, Sochi Olympics in 2014, The World Cup
in Italy in 1934, The World Cup in Argentina in 1978, and The World Cup
in Qatar in 2022. Each event allows a global observer to become less
focused on social issues that would tarnish their reputation and become
more focused on the positives of sport.
This study is not the first research project that has sought to
extrapolate conclusions of a population based on a sample created by the
opinions found on Twitter. Studies from the Pew Research Center have
found that 80% of the American population accesses the internet daily
and approximately 15% of those users frequent Twitter (Smith, 2013). The
age, gender, and income levels have been established to be well
distributed which aid in eliminating specific sociological factors
(Smith, 2013). Through accessing Twitter data about a topic, broad
generalizations and conclusions can be formed. Unfortunately, because
downloads represent all social groups, it can be difficult to extract
meaningful conclusions about any social group. For that reason, the
target in this study does not focus on age, gender, or income level and
focuses conclusions exclusively on the entire population.
Like other studies, specific focus was placed on removing spam,
advertisements, and bots from the data set. Previous studies have found
that as much as 10% of the data on Twitter is spam (Chowdury, 2012). The
prevalence of spam can be problematic from a research perspective
because of how it can lead to faulty conclusions and inaccurate
analysis. To create accuracy of analysis, this study took painstaking
effort and exhausted all efforts to remove advertisements, bots, spam,
and other irrelevant data for the study.
During this study, established Natural Language Processing (NLP)
methods have been used at the word, and word pair level to extract
meaning and opinions from Twitter observations. Studies by Argawal, Xie,
Vovsha, Rambow, and Possenneau in 2011 conduct further analysis into
emoticons to deduce meaning from Tweets that have increased accuracy
over the state-of-the art methods used in this study. Unfortunately, the
research in this study does not leverage those methods but continues
with the established state-of-the-art baseline methods.