Benchmark forwarding by Snabb branch

Simple slice

Compare multiple Snabb versions while keeping all other factors fixed (QEMU version, DPDK version, configuration, etc).

We do this by taking a subset of the full test matrix that only includes rows with a chosen value for each factor that we are not varying. The reason to subset the data this way is to make the analysis simple: every difference in the results should be explained by either a Snabb software difference (or some uncontrolled aspect of the test environment).

sliced <- subset(d, subset=(benchmark == "l2fwd" & kernel == "3.18.29" & qemu == "2.4.1" & dpdk == "16.04"))

Raw data

Simple look at the data for initial eyeballing.

p <- ggplot(sliced, aes(y=score, x=id, color=snabb))
p <- p + geom_point()
p <- p + geom_line()
p <- p + expand_limits(y=0)
p + ggtitle("Results ordered by test ID and colored by Snabb version")

Box plot

p <- ggplot(sliced, aes(x=id, y=score, color=snabb))
p <- p + geom_boxplot(alpha=0.50)
p + ggtitle("Distribution of results across all tests, by Snabb version")

Density plot

Density plot showing the distribution of results. Here we can see how spread out they are, how they cluster together, etc.

p <- ggplot(sliced, aes(score, fill = snabb, color = snabb))
p <- p + geom_density(alpha = 0.1)
p <- p + expand_limits(x=0)
p + ggtitle("Distribution shape of results")