For the 2017 season, the ATP has unveiled a new set of graphics they use during the telecast of a match. There are surprisingly many graphcis that pop up from time to time and they all have a cohesive theme. For example, the score is always shown in a corner of the screen, there’s a summary graphic at the end of each set, there’s a summary graphic at the end of each match, and they sprinkle in interesting little tid-bits as they become relevant throughout the match – things like each players first serve percent so far in this set, the average duration of each player’s service game, etc.

That they all match is cool. But they are generally worse than the graphics used in years past. I find the ‘match summary’ graphic particularly bad:

The highest-level information is the score (6-1, 7-5) is up in the top left corner. Probably could be a little easier to find. But that’s juse the score – the goal of this graphic is to provide insight into why the winner won and the loser lost.

There are six more pieces of information: 1. First serve percent in 2. First serve percent won 3. Second serve percent won 4. Count of aces 5. Count of break points and break points won per set and total 6. Count of double faults

It’s a little confusing that part of this information is in units of percent and part of it is in units of points. And the total number of points each player served isn’t shown, which makes it impossible to convert between the two units.

From more of a “design”" standpoint, the circular bar plots compress the information into a very thin strip and then curve it around a circle. The good part about this design is that certain breakpoints like 25%, 50%, 75% and 100% are easy to identify (right, bottom, left, and top). Simply showing the numberals of the counts seems a little lazy.

Probably the biggest problem is that yelow and white aren’t that different of colors and the association between color and player is hard (for me) to remember. One signal that the color scheme isn’t working is that the word “Nadal” appears five times, and the same for “Fognini”.

I’m not the only one who doesn’t like this graphic. So I decided to try to do better.

First Idea – One Mosaic Plot per Player

My first idea was to give the top banner to overall information, then under that the winner gets the left side of the screen and the loser gets the right side. This impairs comparisons between the players, which is a major negative, but I am going to try it out to start.

Here’s an example:

library(tidyverse)
library(ggmosaic)
library(gridExtra)
library(grid)
nadal <- data_frame(serve = factor(x = rep(x = c('first serve', 'second serve', 'double fault'), times = c(70, 30, 0)),
                                     levels = c('double fault', 'second serve', 'first serve')),
                    outcome = factor(x = c(rep(x = c('ace', 'win', 'loss'), times = c(10, 48, 12)),
                                           rep(x = c('win', 'loss'), times = c(24, 6))),
                                     levels = c('ace', 'win', 'loss')))
fognini <- data_frame(serve = factor(x = rep(x = c('first', 'second', 'double fault'), times = c(49, 41, 10)),
                                     levels = c('double fault', 'second', 'first')),
                      outcome = factor(x = c(rep(x = c('ace', 'win', 'loss'), times = c(5, 30, 14)),
                                             rep(x = c('win', 'loss'), times = c(13, 28)),
                                             rep(x = 'loss', times = 10)),
                                       levels = c('ace', 'win', 'loss')))
p1 <- nadal %>%
    ggplot(mapping = aes(x = product(outcome, serve), fill = outcome)) +
    geom_mosaic() +
    coord_flip() +
    scale_fill_manual(breaks = c('ace', 'win', 'loss'), values = c('darkblue', 'blue', 'red')) +
    theme_minimal() +
    theme(legend.position = c(1.2, 1.0),
          legend.title = element_blank(),
          axis.text.x = element_blank(),
          axis.ticks = element_blank(),
          axis.title.y = element_blank(),
          panel.grid = element_blank(),
          plot.title = element_text(hjust = 0.5)) +
    ggtitle('Nadal')
p2 <- fognini %>%
    ggplot(mapping = aes(x = product(outcome, serve), fill = outcome)) +
    geom_mosaic() +
    coord_flip() +
    scale_fill_manual(breaks = c('ace', 'win', 'loss'), values = c('darkblue', 'blue', 'red')) +
    theme_minimal() +
    theme(legend.position = 'none',
          axis.text.x = element_blank(),
          axis.ticks = element_blank(),
          axis.title.y = element_blank(),
          panel.grid = element_blank(),
          plot.title = element_text(hjust = 0.5)) +
    ggtitle('Fognini')
t <- textGrob(label = 'Nadal   6 7       Start time: March 18, 2017 11:45 AM \nFognini 1 5       Duration:   1hr 48min', 
              just = 'left',
              gp = gpar(fontfamily = 'mono'))
grid.arrange(layout_matrix = matrix(data = c(1, NA, 2, 3), nrow = 2, byrow = TRUE),
             grobs = list(t, p1, p2),
             heights = c(1, 5))

You can see that I used a fixed-width font for the top bar. That’s not because I think that looks good – it’s because I want things to line up and I can’t figure out how to get tabs to render right in a textGrob. I made some assumptions about the number of points each player served to get an equivalence between the aces/doubles and the percents.

I would love to get rid of the little spaces that seem to appear for unobserved combinations (e.g. neither player had a second serve ace, but the second serve row has a thin dark blue line on the right side).

I would love to get rid of the word “double fault” on the left side of Nadal’s plot, since he didn’t double fault at all, but I don’t know how to do that.

I would love to move the players’ names a bit closer to their plots, but I don’t know how to do that.

I would love to indicate ‘ace’ with the same blue that I used for ‘win’, but with some extra layer of hash marks or something to make it clear that aces are conceptually a subset a wins.

I would love to work in information on break points, but I’m not sure how to do it.

It might be nice to scale the size of each player’s plot according to the number of points he served, but I’m not sure if that’s a good idea.

Second Idea

Sankey plot.

---
title: "Improving the ATP Match Summary Graphic"
author: Robert W. Corty
output: 
    html_notebook:
        code_folding: hide
---

For the 2017 season, the ATP has unveiled a new set of graphics they use during the telecast of a match.
There are surprisingly many graphcis that pop up from time to time and they all have a cohesive theme.
For example, the score is always shown in a corner of the screen, there's a summary graphic at the end of each set, there's a summary graphic at the end of each match, and they sprinkle in interesting little tid-bits as they become relevant throughout the match -- things like each players first serve percent so far in this set, the average duration of each player's service game, etc.

That they all match is cool.
But they are generally worse than the graphics used in years past.
I find the 'match summary' graphic particularly bad:

![](example.jpg)

The highest-level information is the score (6-1, 7-5) is up in the top left corner.
Probably could be a little easier to find.
But that's juse the score -- the goal of this graphic is to provide insight into why the winner won and the loser lost.

There are six more pieces of information:
1.  First serve percent in
2.  First serve percent won
3.  Second serve percent won
4.  Count of aces
5.  Count of break points and break points won per set and total
6.  Count of double faults

It's a little confusing that part of this information is in units of percent and part of it is in units of points.
And the total number of points each player served isn't shown, which makes it impossible to convert between the two units.

From more of a "design"" standpoint, the circular bar plots compress the information into a very thin strip and then curve it around a circle.
The good part about this design is that certain breakpoints like 25%, 50%, 75% and 100% are easy to identify (right, bottom, left, and top).
Simply showing the numberals of the counts seems a little lazy.

Probably the biggest problem is that yelow and white aren't that different of colors and the association between color and player is hard (for me) to remember.
One signal that the color scheme isn't working is that the word "Nadal" appears five times, and the same for "Fognini".

[I'm not the only one who doesn't like this graphic.](http://www.menstennisforums.com/2-general-messages/879777-do-you-like-new-match-stats-graphics.html)
So I decided to try to do better.



# First Idea -- One Mosaic Plot per Player

My first idea was to give the top banner to overall information, then under that the winner gets the left side of the screen and the loser gets the right side.
This impairs comparisons between the players, which is a major negative, but I am going to try it out to start.

Here's an example:

```{r, echo=TRUE}
library(tidyverse)
library(ggmosaic)
library(gridExtra)
library(grid)

nadal <- data_frame(serve = factor(x = rep(x = c('first serve', 'second serve', 'double fault'), times = c(70, 30, 0)),
                                     levels = c('double fault', 'second serve', 'first serve')),
                    outcome = factor(x = c(rep(x = c('ace', 'win', 'loss'), times = c(10, 48, 12)),
                                           rep(x = c('win', 'loss'), times = c(24, 6))),
                                     levels = c('ace', 'win', 'loss')))

fognini <- data_frame(serve = factor(x = rep(x = c('first', 'second', 'double fault'), times = c(49, 41, 10)),
                                     levels = c('double fault', 'second', 'first')),
                      outcome = factor(x = c(rep(x = c('ace', 'win', 'loss'), times = c(5, 30, 14)),
                                             rep(x = c('win', 'loss'), times = c(13, 28)),
                                             rep(x = 'loss', times = 10)),
                                       levels = c('ace', 'win', 'loss')))

p1 <- nadal %>%
    ggplot(mapping = aes(x = product(outcome, serve), fill = outcome)) +
    geom_mosaic() +
    coord_flip() +
    scale_fill_manual(breaks = c('ace', 'win', 'loss'), values = c('darkblue', 'blue', 'red')) +
    theme_minimal() +
    theme(legend.position = c(1.2, 1.0),
          legend.title = element_blank(),
          axis.text.x = element_blank(),
          axis.ticks = element_blank(),
          axis.title.y = element_blank(),
          panel.grid = element_blank(),
          plot.title = element_text(hjust = 0.5)) +
    ggtitle('Nadal')


p2 <- fognini %>%
    ggplot(mapping = aes(x = product(outcome, serve), fill = outcome)) +
    geom_mosaic() +
    coord_flip() +
    scale_fill_manual(breaks = c('ace', 'win', 'loss'), values = c('darkblue', 'blue', 'red')) +
    theme_minimal() +
    theme(legend.position = 'none',
          axis.text.x = element_blank(),
          axis.ticks = element_blank(),
          axis.title.y = element_blank(),
          panel.grid = element_blank(),
          plot.title = element_text(hjust = 0.5)) +
    ggtitle('Fognini')

t <- textGrob(label = 'Nadal   6 7       Start time: March 18, 2017 11:45 AM \nFognini 1 5       Duration:   1hr 48min', 
              just = 'left',
              gp = gpar(fontfamily = 'mono'))
grid.arrange(layout_matrix = matrix(data = c(1, NA, 2, 3), nrow = 2, byrow = TRUE),
             grobs = list(t, p1, p2),
             heights = c(1, 5))
```

You can see that I used a fixed-width font for the top bar.
That's not because I think that looks good -- it's because I want things to line up and I can't figure out how to get tabs to render right in a `textGrob`.
I made some assumptions about the number of points each player served to get an equivalence between the aces/doubles and the percents.

I would love to get rid of the little spaces that seem to appear for unobserved combinations (e.g. neither player had a second serve ace, but the second serve row has a thin dark blue line on the right side).

I would love to get rid of the word "double fault" on the left side of Nadal's plot, since he didn't double fault at all, but I don't know how to do that.

I would love to move the players' names a bit closer to their plots, but I don't know how to do that.

I would love to indicate 'ace' with the same blue that I used for 'win', but with some extra layer of hash marks or something to make it clear that aces are conceptually a subset a wins.

I would love to work in information on break points, but I'm not sure how to do it.

It might be nice to scale the size of each player's plot according to the number of points he served, but I'm not sure if that's a good idea.

# Second Idea

Sankey plot.
