Machine LoLing

Unsupervised fun with LoL data


The Definitive Season 16 Champion Class Encyclopedia

It’s been a long time coming, but we are revisiting our ecosystem posts from a year+ back, but this time we are incorporating more data, using better techniques, doing all roles, and I think generally being more respectful of the nuance of this challenge.

What are classes? At a fundamental level, we want to know what champions are similar to what other champions. There are a variety of ways to attack this problem, from evaluating kits to observing performance. Here, we are using champion’s match-ups (how they perform against other opposing champions) and synergies (how they perform with allied champions) to evaluate similarity. Our assumption is that two champions with similar kits will share similar match-ups and synergies with other champions and share a class.

The Mission: To show what champions are in the same class using using their match-ups and synergies from season 16.

The Outcome: Heatmaps establishing classes for each role. A champion similarity explorer to see which champions are most similar and why.

The Cool Stuff

Heatmaps showing classes:

Champion similarity app:

The Classes

Support

Dendrogram chart titled 'SUP Dendrogram' displaying a hierarchical structure with four categories: Warden, Playmaker, Mage, Cat, and Enchanter, each containing several subcategories.
Bar chart displaying the strongest matchup interactions for SUP_Warden against various opponents, with median matchup delta in percentage points.

ADC

Dendrogram illustrating the classification of ADC champions in a game, organized into four categories: Poke, LR_Immobile, SR_Mobile, and Anti-Melee.

Mid

A dendrogram titled 'MID Dendrogram' categorizing various characters into groups such as Melee Assassins, True Mages, Ranged Assassins, Battle Mages, and Anti-Melee, with connections and branches illustrated.

Jungle

A dendrogram illustrating different jungle champions categorized by their playstyle in gaming, including types such as Farming, Assassin, Heavy-Fighter, and more.

[This dendrogram uses SNN clustering instead of Ward. Junglers have less-clear classes than support, ADC, and mid, but these approximate classes feel reasonable.]

Top

A dendrogram titled 'TOP Dendrogram' displaying a hierarchical classification of characters categorized into four groups: Ranged (red), Duelist (blue), Melee (green), and Anti-Ranged Tank (purple).

[I know, melee is too big of a class. If you want to know why top classes were punted on, go read the nitty gritty.]

The Nitty Gritty

The Data: 8.6 million diamond+ games across regions BR, EUN, EUW, KR, NA, TW, and VN from patch 16.1-16.4 in ranked solo queue.

The Method:  For each role (Top, Mid, etc…), I determined champion similarity using match-ups and synergies against all roles. I eliminated overall champion win rate by employing WR-corrected deltas. I corrected for different sample sizes using Empirical Bayes shrinkage (explained in blindablility post). I removed damage type (magic vs physical) from synergies via principal component analysis. I measured champion similarity via Pearson correlation. I determined champion classes using Ward’s hierarchical clustering.

What is a class? Like really?

When determining class is easy: Rell and Leona’s match-ups have a Pearson Correlation of 0.6, and synergies 0.45. [Pearson Correlation measures the strength of the correlation between two variables ranging from 1 (perfect negative) to +1 (perfect positive), with 0 indicating no linear correlation] Therefor, they are extremely similar and are probably the same class.

Table showing champion Leona's stats, including total score, matchup, and synergy ratings, with icons representing strong allies and opponents.

Rell is Leona’s most similar peer. Leona is Rell’s most similar peer. When this is the case, any system will put them in the same class. In fact, the support class in general has very strong relationships like this as shown below.

A correlation heatmap showing relationships among different character types including Warden, Playmaker, Mage, Cat, and Enchanter, with a color gradient indicating correlation strength from -1 to 1.

Wtf am I looking at? This is a correlation matrix. The colors show Pearson Correlation. 1 means perfectly correlated (all match-ups and synergies match). You can see champions match themselves with a value of 1, so the diagonal is all green. Champions that aren’t correlated will be negative. For example, Nami and Braum have a very negative correlation. That means in match-ups and synergies where Nami is strong, Braum is weak, and vice-versa.

Now that you know how to read it, it’s easy to see why I say support has strong classes. Wardens, Mages, and Enchanter all are very similar to their own class and dissimilar to others. Mages and Enchanters have some similarity overlap. Playmakers are more diverse, you can tell because their block is less green.

Now, I had to choose a method to separate supports into these classes. I chose a Ward clustering method using using Manhattan distance on each champion’s full profile of matchup and synergy deltas. Ward’s method minimizes within-cluster variance — it merges the two groups that increase total variance the least at each step, which tends to produce compact, evenly-sized clusters. The dendrogram below uses Ward to assign groups.

Dendrogram illustrating the categorization of characters into four classes: Warden, Playmaker, Mage, Cat, and Enchanter, with branches and groups clearly labeled.

But minimizing within-cluster variance is just one way to group. Alternatively, we could use Shared Nearest Neighbor (SNN) clustering, which groups champions based on how many neighbors they share — if Leona and Nautilus both have Braum, Rell, and Taric among their closest neighbors, SNN considers them similar even if their raw profiles differ in magnitude. This makes it more robust to outliers but can struggle with ties in sparse neighborhoods. When we use this method, we get slightly different groups than before, as shown below.

A dendrogram representing an SNN clustering analysis with k=5, illustrating the hierarchical relationships between different clusters labeled as SNN_G1 to SNN_G5.

While Bard has migrated to the mage side, mages have bifurcated, and Yuumi has joined the enchanters, most of the other groups remain stable. The support role has very strong classes.

TL;DR: Assigning classes is easy when champions have strong similarities to a single set of peers.

But it isn’t always like that. Let’s talk about top lane.

Uh… Why did you punt on top classes?

When determining class it hard: Top lane is a different beast. Why? Take a look at the correlation matrix.

A heatmap displaying correlation values between different character categories in a game, with color coding indicating strength and direction of correlations.

Ranged? Strong. Duelist? Strongish. But the rest? Look how middling the Unique champion correlations are. Do Chogath and Ksante and Shen and Tahm share similarity with the true tanks? They do, but they share stronger similarities with Illaoi and Urgot and nasus and other fighters and tanks with mixed features. All classifiers, Ward or SNN or other will struggle to put that mess into groups.

Dendrogram diagram categorizing characters into four groups: Ranged (red), Duelist (blue), Unique (green), and True Tank (purple), with various names listed under each category.
Dendrogram illustrating hierarchical clustering of data points with k=10, featuring various groups represented in different colors.

So while I could subdivide into 10 groups and end up with some more groupings, the groupings aren’t really satisfying. Some make sense, like Jax-Volibear-Warwick. If you look at the correlation matrix you can see these 3 are STRONGLY associated. Others give you food for thought like Gangplank-Olaf. But Olaf is only really there because he has no home. Their common thread is probably being CC immune. And while I think these groupings are interesting, I don’t feel comfortable arbitrarily calling them classes.

This is why top has a massive grouping called “Unique”. There are some strong groupings in there, but we simply can’t arrange these into satisfying classes with any of the UMAP, Complete Linkage, Single Linkage, Ward, SNN, or other clustering methods I have tried, and I think it’s incorrect to do so. I could just manually group things, and I may have a crack in a later post, but for now let’s appreciate how diverse the Top role roster really is.

If you want some food for thought, here is another equally valid way of clustering top laners.

Heatmap displaying the Z-scored correlation matrix of various entities, showing positive and negative correlations represented by a color gradient.

Conclusion and Road Forward

I think we have really satisfying classes for the support role, reasonably satisfying classes for ADC, Mid, and Jungle, and pretty unsatisfying classes for Top. I’m happy with that for now. Clearly top will require its own post. You can use the champion similarity explorer to find champions opposite your current champs or similar to them. The match-up and synergy heatmaps can be a quick season 16 reference for what beats what. I hope you enjoy mucking around in the data and please leave suggestions for what to do next!

Discover more from Machine LoLing

Subscribe now to keep reading and get access to the full archive.

Continue reading