Gen 2 GSC Ubers Viability Rankings

Earthworm

is a Site Content Manageris a Forum Moderatoris a Community Contributoris a Top Contributoris a Tournament Director Alumnusis a Super Moderator Alumnusis a Tiering Contributor Alumnusis a Smogon Media Contributor Alumnusis the Smogon Tour Season 6 Championis a Past SPL and WCoP Championis the defending GSC Circuit Champion
Moderator
GSC Ubers Viability Rankings

S Rank:

S1 Rank
01
Snorlax


S2 Rank
02
Mew
03
Mewtwo


A Rank:

A1 Rank
04
Lugia
05
Zapdos


A2 Rank
06
Steelix
07
Forretress


A3 Rank
08
Celebi
09
Raikou
10
Tyranitar
11
Ho-Oh
12
Cloyster


B Rank:

B1 Rank
13
Skarmory
14
Umbreon
15
Jolteon
16
Golem
17
Rhydon
18
Blissey


B2 Rank
19
Misdreavus
20
Gengar
21
Heracross
22
Marowak


C Rank:

C1 Rank
23
Vaporeon
24
Miltank
25
Starmie
26
Smeargle
27
Muk
28
Exeggutor


30 September 2019: VR first posted
 

Earthworm

is a Site Content Manageris a Forum Moderatoris a Community Contributoris a Top Contributoris a Tournament Director Alumnusis a Super Moderator Alumnusis a Tiering Contributor Alumnusis a Smogon Media Contributor Alumnusis the Smogon Tour Season 6 Championis a Past SPL and WCoP Championis the defending GSC Circuit Champion
Moderator
To explain how I came to the above results, I first publicly linked a survey form in this post. There were a total of 14 respondents to the survey form: Mr.378, CKW, me, FriendOfMrGolem120, Eeveeto, Beelzemon 2003, Eo Ut Mortus, Waveshaper, Isa, zf, M Dragon, Watchog, Jorgen, and Blightbringer. The form contained a question asking for the number of GSC Ubers games played as a means of determining experience in the tier and deciding how to weight the responses when determining the final results. Initially I used the upper end of # of games played as a raw weighting, i.e. my rankings counted as though 200 people had submitted them, while Isa's counted as 100, etc. I then received feedback saying that it would be better to use the square root of this number instead to reduce the degree of bias towards people that had played a huge number of games in the tier. I was also originally adding a flat 1 extra weighting to all submissions, meaning even though Eo Ut Mortus had never played GSC Ubers before, his ratings would influence the outcome. Upon receiving further feedback, this was removed. I then compared the rankings with and without outliers with vapicuno. He showed me how outliers were affecting the outcome due to the relatively small number of respondents and wildly different ratings from Beelzemon 2003. With outliers removed, the results look a lot cleaner and I am confident we have come to the optimal result possible from this data set.

This time, there were material differences between the rankings with and without outliers. You can see what changed by comparing the Weighted_Result with the Weighted_OR_Result sheet in the spreadsheet.

This form was a big improvement on the previous google form used in the OU VR, but one thing I will clarify next time is when to use the "N/A" option--some people used it for Pokemon they thought didn't deserve to be ranked, while others used it when they had no experience using or facing the Pokemon in question.

With vapicuno's help yet again, here are some nice pictures to show everyone's rankings and how I split up the tiers above. Everything below is using the weighted results, with outliers removed. The charts have error bars based on weighted standard deviation.

Everyone's rankings compared with the result and the Old VR (which didn't rank Pokemon within sub-ranks so is a bit misleading)


Chart of each Pokemon's ranking against their weighted average ranks (i.e. if they were very close in ranking, they will be close together on the y axis)


Same chart as above but with coloured boxes showing where I drew the divisions between ranks in the OP


Please let me know if you have any questions and I will do my best to answer.
 

Users Who Are Viewing This Thread (Users: 1, Guests: 0)

Top