Data Official Smogon University Usage Statistics Discussion Thread

Status
Not open for further replies.

Antar

is a Battle Server Administratoris a Programmeris a Super Moderatoris a Community Contributor
Official Data Miner
#1

In recent months, there has been a massive explosion of activity on Pokemon Showdown. While that's great and all, it has led to some problems. First, it takes a lot longer to process the stats than it did when we were dealing with a Pokemon Online server running off my desktop computer (Comcast did not care for that at all, lol). Secondly, there are now waay more active metagames than ever before. This is a good thing, by and large, but it means that my statistics threads have grown so large that it takes my browser a full minute to load the thread sometimes.

Finally, we're attracting a bunch more players who are new to competitive Pokemon. Considering we use usage stats for tiering, this is kind of a big problem, and what we've decided to do about that is to use weighted stats for tiering, with baselines significantly above that of a "new" or "average" player. Read more about that here. But what that means is that now I need to not only prepare "regular" stats for each metagame, I need to prepare stats at a variety of baseline levels, and putting that all into a single thread would be a *nightmare.*

So instead, i'm doing away with the monthly stats threads, and we're going to have a single continuous thread for statistics discussion. The previous rules apply, namely:

NOTE: DISCUSSION IN THIS THREAD WILL BE LIMITED TO STATISTICS CALCULATIONS, CLARIFICATIONS AND OVERALL TRENDS. DISCUSSIONS OF INDIVIDUAL POKEMON WILL BE DELETED (each Pokemon has its own thread--discuss there). POSTS THAT SIMPLY QUOTE OR REFERENCE STATISTICS WITH PERSONAL COMMENTARY WILL BE DELETED. POSTS DISCUSSING HYPOTHETICAL LOWER TIERS WILL BE DELETED.
and I'll announce each month when the stats are "up."

Feel free to ask any questions you have about how things are calculated, but be sure to first check the FAQ directly below this post.

Enjoy, data junkies!
 

Antar

is a Battle Server Administratoris a Programmeris a Super Moderatoris a Community Contributor
Official Data Miner
#2
Frequently Asked Questions
  1. I can't load that link!
    That stinks! You must be on a network that blocks port 8080. Unfortunately, there's nothing I can do about that right now. Eventually, we might move the stats to their own dedicated web server, and that will probably fix your problem.
  2. What's this business with "Raw" and "Real?"
    Jimera0, yeah--I should really include just a bit of text at the top of each Standard Stats post...
    • Usage % : Weighted
    • Raw: Unweighted
    • "Real": Only counts the Pokemon which actually appear in battle (Doubles not supported)
    The reason for the name "real" is historic--back when I first took over the stats and then the running of PO, only the Pokemon that appeared in battle were recorded in the logs, so there was no way to actually *get* the full team stats. When I modified PO to generate logs with full team info in them, we were left with a decision regarding which stats to use, and the argument was that counting only Pokemon appearing in battle was somewhat more legit, because that corresponded to actual, or "real" usage (that argument lost out in the end).
  3. How are usage stats weighted?
    Every player on Pokemon Showdown has a skill rating for each metagame they participate in. This rating--which is different from your ladder score--is calculated using an algorithm called Glicko and consists of an estimated skill value R and an uncertainty in that estimate RD. Based on these two values, we calculate the likelihood that a given player has a "true" skill value above a certain baseline (the conventional baseline was 1500, corresponding to the "average" player). For more about ratings, read here. For more about weightings, read here.

  4. How are tiers determined from usage?
    Tiers are based off a predictive algorithm designed to estimate how often a Pokemon will appear in the next month's usage statistics, based on the usage stats for the past three months (we update our standard tiers every three months). So we start by weighting the last three months' stats like this:
    Code:
    Three month usage= (20x last month + 3x month before that + 1x month before that)/24
    then the "OU" list for that metagame consists of all the Pokemon who appear on at least ~3.41% of teams, which is not as random a number as it might seem. Note that suspect tests are designed to move Pokemon into the Borderline ("BL") teams, which, like Ubers, are not based on usage statistics.

  5. Why does "Illuminate" sometimes show up in the abilities section of the moveset stats for Pokemon that can't have Illuminate as an ability?
    "Illuminate" is my placeholder for "no ability," or an ability that simply isn't recognized. This kind of situation happens when Showdown glitches out and (should be) exceedingly rare. Note that the nature equivalent is Hardy (though all five neutral natures are also aliased to Hardy) and the item equivalent is "nothing" (though that could also correspond to no item).

  6. What's the deal with the file names?
    You'll notice that for each tier and type of analysis, there are a bunch of of different files, most with names like uu-1695.0.txt. The first part of the filename is the tier, the second part is the weighting baseline (see 3). If there's no number following the tier name, then the baseline is 1500. Also note that a baseline of 0.0 means that the stats are basically unweighted.

  7. Can I perform my own analyses?
    Due to privacy concerns, I can't give you access to the raw logs, but if you have background with a programming language that can parse json, take a look in the "chaos" folder of each month's stats. Those files contain all the information used to generate the moveset statistics and include a lot more data than I could feasibly put into a file.
More to come!
 
Last edited:
#4
What's the difference between files like "ou.txt" and "ou-0.0.txt"? I get that other numbers like "ou-1760.0.txt" is usage statistics high up the ladder, but the first two confuse me
 
#6
Thanks for the stats as always!

RU is set to start this month apparently. What stats will be used for making RU? There's UU stats and UU beta stats for this month, will just the UU stats be used, or both of them? Also, there isn't 3 months worth of 1760 stats, so will just this month's be used this time?
 
#8
When the time comes for 3 month cumulative stats to be posted, not just for UU but for all tiers, is there going to be a place where we could find them?
 
#10
I don't know if this will be considered ok to ask here, but I'm not sure where else to ask: given that UU went official and out of beta, does that mean the tiers won't update for another three months now? I just noticed that a few things moved up and down (if using OU 1760 like last month). If this is the wrong place to ask this then feel free to delete this or whatever action is necessary.
 
#14
I'm sorry if I didn't search hard enough but where can I see the mega usage stats ?
And what the hell is seasonalfabulousfebruary ?
 
#17
I apologize if I'm just missing something, but will the changes since last month still be viewable? Doing them for every cutoff level for each metagame would be pretty unnecessary, but it would be nice to see them for the only stats used for tiering at the very least.
 
#21
Since UU is now official, changes between OU and UU will now happen every three months. The changes you listed (along with Quagsire possibly moving to OU) will likely happen in June.
This may be a stupid question, or it may even be in the wrong place (or both), but why do we do it this way? Why not have them update every month?
 
#22
This may be a stupid question, or it may even be in the wrong place (or both), but why do we do it this way? Why not have them update every month?
Trends happen, sometimes someone gets popular for 40 days and goes back down. It would be dumb to move something manageable like Manectrike to OU when it was only very used for a while so theres a 3 month period before it's considered OverUsed.
 
#25
Because there are twice as many leads, and my stats currently can't handle that. It's also why there's no Checks & Counters data, because figuring out who killed what in a 2v2 situation is a hot mess.
I understand the Checks & Counters situation. Not to jinx it and see them gone (because they're insightful), but then why VGC, XYDoubles and DoublesSuspect have them?
 
Status
Not open for further replies.