Travel industry is a highly competitive and seasonal industry. It’s subject to many external factors, ranging from location, season, weather, economy etc. Therefore, it’s crucial for the players in this industry to know their business well and try to use history as a guide to future operation. The best way to understand their own history is to look deep into past transaction data and slice/dice customer data, we well as pay attention to customer behavior data in real time. To start with, a deep understanding of what happened in the past could be a nice first step. By analyzing booking records, the business can take a deep look at
Business can adjust their business strategies accordingly. For example, their inventory strategy, pricing strategy and promotion strategy. The analysis below strives to analyze the data from 7 different angles to discover the areas of interest and deep dive in select topics.
The analysis of this project is based on a dataset of 119390 hotel booking records from 2015 to 2017.
The first chart renders the booking trend by arrival date. Confirmed bookings and cancelled bookings are separated so that the audience can clearly see the trend. Since both confirmed bookings and cancelled bookings could have business implications (e.g., inventory, cancellation policy etc.), it’s important to monitor the trend regularly and make necessary business adjustments.
We can see that
As we can see in the first chart, booking cancellation is pretty significant across the year and it varies a lot. As a business, we want to understand which market segment caused the most cancellations so that we could revisit GTM strategies as well as pricing strategies. We can easily observe from the chart below that Online TA (Travel Agency)’s cancellation proportion more than doubled in two years timeframe (from 29% in July 2015 all the way to 82% in August 2017). Further analysis needs to be done against this market segement to find out the cause.
The chart below depicts which Distribution Channel/Market Segment combinations have the highest average daily price (over two years time frame). As we can see, the following combinations are hero
Starting from 2016, Aviation has becoming a strong performing market segment
You can interact with the chart below by zoom in/out or mouse over the dots. Give it a try.
Price is not the only KPI, but volume as well. We want to know which market segment/distribution channel excel on both fronts. In the diagram below, segment/channel combination is ranked by price on the left, volume on the right. You can tell that even though some pairs’ price is ideal, but their transaction volume is quite low (e.g., Offline TA/TO | GDS). Some pair ranks pretty high in both measures (e.g., Direct|Direct). The third category would be fair on price but leading in volume (Online TA | TA/TO ). Based on this finding, business can optimize channel strategy and give more emphasis to leading pairs.
You can interact with the chart below by zoom in/out or mouse over the dots. Give it a try.
From business’s point of view, maximizing revenue is a goal. Since hotel inventory is fairly fixed, how to give more inventory to those performing companies is a key business question. The first chart below is a scatter plot of # of booking and average daily price. We can see that the variation is quite big. We form the hypothesis that a good performing company should satisfy the following criteria:
We use the criteria to single out some companies, highlighted by red circle in the chart below.
Using the criteria, we picked ten companies. The chart below depicts their average adr over time. It’s quite obvious that most companies’ adr fluctuate over time. However, company #40’s performance is very stable. Based on the finding, the business can come up with a strategy to maximize the revenue by giving different companies different inventory or different incentives.
From business’s perpective, it’s also interesting to find out if there’s any correlation among different parameters, even though correlation doesn’t imply causation. The chart below depicts the correlation among all numeric fields in the booking data. As you can tell, there’s no strong correlation (correlation coefficient greater than 70%) between any two variables. The only visible positive correlation is between weekday stays and weekend stays. While arrival week number seems to have noticable negative correlation with arrival year. However, it was caused by the starting and end point of the dataset itself, thus no business significance.
We want to know how advanced do people book their hotel prior to their arrival. The first chart shows the average lead time (the difference between booking date and actual arrival date, for confirmed booking only) comparison between global and top 6 countries (in terms of # of bookings).
** Observations:**
The spread of lead time varies a lot country by country. Again, you can tell from the chart below that lead time for people with Germany and Britain origin varies a lot on any given month, comparing with that of other countries. Peole from Spain tends to have shorter lead time and narrower lead time span, indicating that they tend to have more ad hoc travels. Business can create different marketing strategies based on this finding to change people’s behavior based on thier origin.
The analysis above touched on trending analysis, behavior analysis and performance analysis. More in-depth analysis could be done towards finding out more insights. For example, if the dataset starts in January and ends in December in all three years, then accumulated booking can be compared across years to get a sense of the economy. If in certain year travel industry gets hit by whatever factor, the accumulated booking by months can be different, even though the month by month pattern is the same. Those more advanced analysis is outside the scope of the project.