1 Introduction

Heya, Go4ino here again with another SGC data analytics report. The original inspiration for this report was RebelFox on twitter asking if I could investigate if differences in performances existed between different champions with different types of player bases.

For example: Is there a difference between Ornn and Sona? Ornn has been played extensively by practically every top laner in SGC, while Sona is more or less exclusively played by Dean. Does this mean Sona has higher average performance? Or is it overall better to stick primarily to the meta?

Thankfully @pookarGG’s SGC data set exists and has concrete data to analyze. I did heavy data modification / rearranging / tinkering / etc to get the data sets I used in R-Studio.


2 The Data

As mentioned previously, the original source for this data was from Pookar’s SGC stats doc. In specific this report pulled data from the Champion By Player tab. I put all this data together into an Excel spreadsheet to read into R-Studio.

Pookar’s data takes match results data from almost every AM league up to 7/27, with only BIG League missing. Even with some matches being absent, there are 493 games present to analyze.

The sheet is heavily automated, meaning every tab of interest is updated when match histories are entered. I chose the Champion By Player tab as it allowed me to separate champions by position more easily to distinguish flex picks, and I felt like lumping flex champs together would skew results (eg: Top, Jungle, and Support Sett have vastly different statistics).


3 Analyzing the Data

3.1 The Basics; Categorizing and Defining the 3 Types of Playerbase

fig 1: Interactive dot-plot of player champ play rate vs win rate

note: only champions with at least 10 games played, and players with 10 games on a champ are shown.

To interact with the graph hover over the graph to display options, hover over data points to display critical information, and click on the positions in the legend to show/hide data points for a specific role.

The vertical lines at 20% and 60% of total play rate is where I drew lines for separating the 3 factor groups. I decided on these 3 groupings based off what I perceived to be 3 distinct groups of data, and picked 20% and 60% since they would divide the 3 groups fairly well. Of course, these separations are by no means written in stone and just my personal interpretation which I based purely off of Proportion of Picks, and my eyeball’s guessing power. Important to note is that due to the minimum game requirements this graph is naturally biased towards players who have played the most games. For example 100T FallenBandit moved to starting top during UPL playoffs and has only 8 games in the recorded data, meaning it is impossible for him to even be included anywhere on fig 1. Likewise, teams who play more games per set on average are more favored to have their players appear on the graph. Lets say Team X is extremely dominant, and 2-0s every opponent, but Team Y is less consistent and all their series go to game 3. This means that Team Y plays 50% more games than Team X despite playing the same number of Bo3 series, and inherently is biased in favor of Team Y.

Points to the left are more widely played meta champs, whereas points to the right tend to have players who represent a significant portion of a given champ’s playerbase. As such I have categorized the 3 playerbase groups as: Broad, Medium, and Narrow. You can think of Broad being meta champions, Medium being somewhat meta champions that certain players/teams may more heavily favor than others, and lastly Narrow is champions that are almost exclusively played by single players.

As expected, the further along the x-axis you go the fewer data points there are. But are there any differences to be noted?

3.2 Does Playing Narrow Category Champions Have Any Effect On Performance Off of That Champion?

This was a question that popped into my head for the 4 players in the Narrow category. While boasting high performance on those champions they’re known for is great, can they still perform when not on said champions? Because if they struggle on different champions, well then said power picks could be a potential weakness for their teams rather than a strength.

The column of most interest is Ratio which is just \(\frac{Column-2}{Column-3}\), as that can show if there is any statistically significant differences. A score of 1.00 means a player overall performs the same even if they aren’t playing their signature champion. Scores above/below 1 mean they perform worse/better overall when not playing their signature champion1. Ideally we want this ratio to be as close to 1 as possible, values significantly above 1 mean they struggle on different champions, and values below 1 are kinda pepega.

3.2.1 Always Plan Ahea

Always Plan Ahea Averages
Metric A-Sol Non A-Sol Champs Difference Ratio
Games 15 45
Win rate 0.8 0.3855 0.4145 2.0752
KDA 5.77 5.18 0.59 1.1139
KPAR 0.6285 0.7163 -0.0878 0.8775
DPM 656.15 604.29 51.86 1.0858
CS/Minute1 8.8 7.672 1.128 1.147
CS @ 101 80.73 80.226 0.504 1.0063
Vision Score/Minute 1.09 1.025 0.065 1.0634
Enemy Jungle Camp CS2 1.87 2.906 -1.036 0.6435
Lane Diff @ 10 -165 58.894 -223.894 -2.8016

1 Lane minions only, jungle camps are not included.

2 Number of enemy camps taken.

table 2

Always Plan Ahea has ratios mostly close to 1 across the board, of note though is Win rate. Because while the other 3 all have significantly higher win rates on their respective champions, Always Plan Ahea has a ratio over a whole 2.0. Almost all of Always Plan Ahea’s other metrics are fairly close with Lane Diff @ 10 being the other significantly different metric2\(^,\)3, I wonder if that has to do with A-Sol’s tendency to roam a lot? I expected a significant difference in win rate, but not this much. It’s not like there are major changes in the other metrics, so it could be some other factor outside of the data set, but as it stands this insane dip in win rate can not be ignored.

3.2.2 Dean

Dean Averages
Metric Sona Non Sona Champs Difference Ratio
Games 26 33
Win rate 0.7692 0.4322 0.337 1.7795
KDA 5.58 4.2936 1.2864 1.2996
KPAR 0.6723 0.6359 0.0364 1.0572
DPM 205.18 176.9464 28.2336 1.1596
CS/Minute1 1.29 1.1173 0.1727 1.1546
CS @ 101 14.15 11.4582 2.6918 1.2349
Vision Score/Minute 1.85 1.9236 -0.0736 0.9617
Enemy Jungle Camp CS2 0.31 0.0909 0.2191 3.41
Lane Diff @ 10 408.54 106.4127 302.1273 3.8392

1 Lane minions only, jungle camps are not included.

2 Number of enemy camps taken.

table 3

Like Always Plan Ahea, Dean also has a massive differential in Win rate albeit not as large of a gap. Lane Diff @ 10 is the other main massive difference4, with almost a 400% differential which can be interpreted to potential mean several things:

  • Dean is really damn good at Sona
  • Dean really knows his Sona match ups
  • Lobozz knows how to play with Sona support
  • Opponents don’t know how to properly counter Sona in lane

Any of the listed reasons could be true, but I do wish I had been able to interview FrostFire’s bot lane to get their perspectives. I can say for certain though Dean is very comfortable on Sona with almost as many games as Sona as games off Sona.

Likewise, all of Dean’s metrics (save for Vision Score/Minute) are above 1, with many being significantly higher5. I think it’s safe to say that Dean does perform notably better on Sona than when he is off Sona.

3.2.3 Hunter

Hunter Averages
Metric Nunu Non Nunu Champs Difference Ratio
Games 10 60
Win rate 0.7 0.5035 0.1965 1.3903
KDA 6.03 4.3153 1.7147 1.3973
KPAR 0.6886 0.6168 0.0718 1.1164
DPM 336.56 392.7333 -56.1733 0.857
CS/Minute1 5.29 5.242 0.048 1.0092
CS @ 101 3.8 4.4827 -0.6827 0.8477
Vision Score/Minute 1.01 1.2567 -0.2467 0.8037
Enemy Jungle Camp CS2 7 9.4993 -2.4993 0.7369
Lane Diff @ 10 -92.9 -491.558 398.658 0.189

1 Lane minions only, jungle camps are not included.

2 Number of enemy camps taken.

table 4

Hunter is definitely an interesting case, since he has roughly the same amount of ratios over 1 and ratios under 1. DPM being lower isn’t too surprising given that Nunu is a supportive tank jungler when several of the other junglers he has played are fairly damage focused like Kindred/Graves/etc. What did catch my eye though was Enemy Jungle Camp CS being lower on Nunu. I expected it to be somewhat higher given that Nunu’s consume helps him be a fairly good control jungler, but maybe Hunter’s playstyle is more objective / gank focused? I’m not sure, and that’s just a loose guess.

3.2.4 Lobozz

Lobozz Averages
Metric Jinx Non Jinx Champs Difference Ratio
Games 15 46
Win rate 0.7333 0.52 0.2133 1.4103
KDA 3.81 3.9943 -0.1843 0.9539
KPAR 0.6281 0.4735 0.1546 1.3266
DPM 675.62 496.53 179.09 1.3607
CS/Minute1 8.1 6.9971 1.1029 1.1576
CS @ 101 72.2 70.4586 1.7414 1.0247
Vision Score/Minute 0.76 0.6721 0.0879 1.1307
Enemy Jungle Camp CS2 2.67 1.6229 1.0471 1.6452
Lane Diff @ 10 -525.4 -20.7186 -504.6814 25.3589

1 Lane minions only, jungle camps are not included.

2 Number of enemy camps taken.

table 5

Lobozz is fairly similar to his lane buddy Dean, with all of his notable metrics being over 1 (aside from KDA), with many being significantly higher6. There are 2 notable differences in Win rate, and Lane Diff @ 10 however. Like Hunter, Lobozz holds a positive Win rate even off his signature champ, which his lane buddy Dean can’t say.

Once again I want to look at Lane Diff @ 10, as this is the exact opposite of Dean’s own score. Off of Jinx Lobozz has a fairly even laning phase, but on Jinx he has an average deficit of close to 2 kills worth of gold despite his other metrics suggesting he’d be doing better. Of course Jinx has a poor laning phase which may be partially to blame. Combine that weak laning phase with Dean’s preference for squishier enchanter supports, and the abundance of Nautilus + Thresh, maybe they had to cede some control in lane letting their lane opponents garner advantages elsewhere (eg: plates, roams, etc). It’s hard to really say exactly why this is the case with out more data / the player’s perspectives.

3.2.5 Overall Conclusions / Observations

As expected most of the values are above 1 in the Ratio column, there is a reason why they keep playing these champs when they can. But as for the original question of “Are these players still able to perform well off their main champion?”, I would say it depends from case to case. Furthermore, in retrospect I probably should have gotten the over all means for the player’s respective positions in each metric and also compared with them. That way I could say if a score is still “comparatively good”, even off their signature champ. All I can say definitively right now


4 Conclusions

4.1 Shoutouts

RebelFox for bringing this idea to me in the first place on Twitter. Pookar for managing to even get and compile all this data in the first place. Always Plan Ahea for letting me interview him to get some crucial insight.

4.2 Notes

  • Feel free to reference/use this paper wherever, just please credit my twitter @go4ino.
  • I can provide the source code for this document if desired, but I should upload a final version to the GitHub repository after I finish this hopefully.

5 Bibliography

5.1 Sources notes:

  • Pookar’s SGC stats Google sheet:
    • Has records of SGC matches between the dates of 5/12/20-7/27/20, which was patches 10.9-10.15.
    • It has the records for every player and team over 493 games. Which is 4930 raw data entries.
    • Data does not include bans, only picks. This does mean some champs who were near perma ban status at certain points (eg: Varus, Yuumi) may have much lower presence despite being S+ tier champs for large parts of the season.
    • It has data from Upsurge, Risen, LWL, Focus, and CUP.
    • BIG League, and FACEIT playoffs matches were not included due to lack of match histories to enter.
    • Lane Score stat is just XP and Gold earned at 10.
    • Lane10 is the differential vs your lane opponent.
    • Due to missing data from BIG and FACEIT, certain players and teams my be underrepresented. (EG: 100 Next only competed in 4/6 AM leagues, and with BIG gone potentially a quarter of their games are just not included.)
  • My personal datasets:
    • All notes for Pookar’s SGC stats Google sheet apply here too.
    • Are modified versions of Pookar’s Champion By Player tab in the CSV format so I can load them into R-Studio. + They also have new variables added that weren’t in the original dataset. Furthermore, most existing variables were renamed to be “code friendly”.
    • all role champs 7-27.csv is just a fusion of all 5 role’s individual spreadsheets.
    • all role champs 10g min 7-27.csv is a filtered version of all role champs 7-27.csv with only observations that meet the minimum criteria for a minimum of 10 games in total for a champion, and players who have played at least 10 games of said champion.
    • You are free to download and use my datasets for whatever, but be sure to credit both me (@go4ino) and Pookar (@PookarGG)

  1. If Lane Diff @ 10 has 2 negative values, then the interpretation is flipped.↩︎

  2. Enemy Jungle Camp CS, while it is significantly different based off the ratio, I deemed it non-important for everyone except Hunter.↩︎

  3. I’m going off gut feeling when I say “significant” here. A more accurate gauge for this significance would be to look at the Z-scores for both “As champion X”, and “Not as champion X” compared to the sample mean for the player overall. Sadly getting those would be too time consuming for my given time budget, but it is doable though if one tinkered with the Data tab rows from Pookar’s sheet.↩︎

  4. Enemy Jungle Camp CS, while it is significantly different based off the ratio, I deemed it non-important for everyone except Hunter.↩︎

  5. I’m going off gut feeling when I say “significant” here. A more accurate gauge for this significance would be to look at the Z-scores for both “As champion X”, and “Not as champion X” compared to the sample mean for the player overall. Sadly getting those would be too time consuming for my given time budget, but it is doable though if one tinkered with the Data tab rows from Pookar’s sheet.↩︎

  6. I’m going off gut feeling when I say “significant” here. A more accurate gauge for this significance would be to look at the Z-scores for both “As champion X”, and “Not as champion X” compared to the sample mean for the player overall. Sadly getting those would be too time consuming for my given time budget, but it is doable though if one tinkered with the Data tab rows from Pookar’s sheet.↩︎