This is my first PR proposal, sorry if there's an issue! just let me know.
The Problem
I have been calculating the rises and drops between Ubers and Ubers UU myself, as well as the usage stats for Ubers UU, since the switch from 1 month to 3 month tiering in generation 9 (starting in q2 of 2024). However, some people have noted that my numbers don't always line up with what they'd assume. They took the average of the pokemon across three months and called that the statistic. For example, let's look at Kyurem-White in q3 of 2024:
(2.16% in July + 4.73% in August + 11.72% in September)/3=6.20%
So if there was ever a usage based tier below us (if "Ubers RU" became official for example) Kyruem-White is at no risk of dropping down, it's a solid Ubers UU 'mon. But is that a truly objective description of how much Kyruem-White was used?
In July, we had 3186 games played. In August, we had 4449 games played, a 40% increase! In September, the numbers plumeted down to 2861! Thats also when it had its highest usage in terms of percent. One way to handle this is to weigh each month by number of games played.
If we re-do the kyurem-white numbers they get to 5.80%. A 0.4 drop is pretty significant when you're only 1 or two percentage points away from the cutoff. In reality, people used kyurem-white less than 6% of the time, but if you lump all 2861 games into "one block" and give the same weight to that block as one made up of almost twice as many games, there are bound to be some discrepancies.
This conversation began here between me and Marty, and I think it would be an easy to implement measure for all tiering that would be preventative of future problems, with little to no detriment for the future, especially if each generation we continue switching between 1 month and 3 month tiering (which I fully support and don't want to discuss in this thread). I would just like to re-open the conversation and ask if weighing each months usage by games played is as simple, obvious, and beneficial as I see it.
The Benefits
Preventing Disaster
Put simply, what if there's some big problem on pokemon showdown, and one or more metagames are unplayable for a significant amount of one month? In the extreme case, should one game in month one be treated on equal level to a hundred thousand the next? That feels ridiculous, but preparation against this sort of fiasco would be excellent.
Makes 3 month tiering more similar to 1 month
In the 1-month system, it does not matter whether you use your team at the beginning or end of the month. the 1st and the 31st are weighted equally. Imagine if instead, the people who played at the end of the month got their usage counted more than the people at the beginning? That seems wacky, but its the system we currently have in place. We've accidentally decided that 1 month is the unit to chunk these stats just because we wanted to extend the total stat collection time. This fixes the problem.
Helps represent reality
Let's say some big event, like a suspect test, viral video, or big competition begins and one month sees a ton of usage compared to the other two. Shouldn't that be represented? We almost have an "electoral college"-like system (it's a US politics reference) going on where the highest month (most populated state) has each individual team (voter) count the least. Every team should have an equal representation. This is especially important for non-OU/randbats tiers where usage and total games played are much more variable
It's simple
This would be a tiny backend change that almost no one would notice. I'd be truly shocked if implementation was a limiting factor. It also won't matter for 99% of cases, and very consistent metagames like current gen OU probably would tier exactly the same.
Anticipating Counter-Arguments
As Marty first suggested,
The Problem
I have been calculating the rises and drops between Ubers and Ubers UU myself, as well as the usage stats for Ubers UU, since the switch from 1 month to 3 month tiering in generation 9 (starting in q2 of 2024). However, some people have noted that my numbers don't always line up with what they'd assume. They took the average of the pokemon across three months and called that the statistic. For example, let's look at Kyurem-White in q3 of 2024:
(2.16% in July + 4.73% in August + 11.72% in September)/3=6.20%
So if there was ever a usage based tier below us (if "Ubers RU" became official for example) Kyruem-White is at no risk of dropping down, it's a solid Ubers UU 'mon. But is that a truly objective description of how much Kyruem-White was used?
In July, we had 3186 games played. In August, we had 4449 games played, a 40% increase! In September, the numbers plumeted down to 2861! Thats also when it had its highest usage in terms of percent. One way to handle this is to weigh each month by number of games played.
If we re-do the kyurem-white numbers they get to 5.80%. A 0.4 drop is pretty significant when you're only 1 or two percentage points away from the cutoff. In reality, people used kyurem-white less than 6% of the time, but if you lump all 2861 games into "one block" and give the same weight to that block as one made up of almost twice as many games, there are bound to be some discrepancies.
This conversation began here between me and Marty, and I think it would be an easy to implement measure for all tiering that would be preventative of future problems, with little to no detriment for the future, especially if each generation we continue switching between 1 month and 3 month tiering (which I fully support and don't want to discuss in this thread). I would just like to re-open the conversation and ask if weighing each months usage by games played is as simple, obvious, and beneficial as I see it.
The Benefits
Preventing Disaster
Put simply, what if there's some big problem on pokemon showdown, and one or more metagames are unplayable for a significant amount of one month? In the extreme case, should one game in month one be treated on equal level to a hundred thousand the next? That feels ridiculous, but preparation against this sort of fiasco would be excellent.
Makes 3 month tiering more similar to 1 month
In the 1-month system, it does not matter whether you use your team at the beginning or end of the month. the 1st and the 31st are weighted equally. Imagine if instead, the people who played at the end of the month got their usage counted more than the people at the beginning? That seems wacky, but its the system we currently have in place. We've accidentally decided that 1 month is the unit to chunk these stats just because we wanted to extend the total stat collection time. This fixes the problem.
Helps represent reality
Let's say some big event, like a suspect test, viral video, or big competition begins and one month sees a ton of usage compared to the other two. Shouldn't that be represented? We almost have an "electoral college"-like system (it's a US politics reference) going on where the highest month (most populated state) has each individual team (voter) count the least. Every team should have an equal representation. This is especially important for non-OU/randbats tiers where usage and total games played are much more variable
It's simple
This would be a tiny backend change that almost no one would notice. I'd be truly shocked if implementation was a limiting factor. It also won't matter for 99% of cases, and very consistent metagames like current gen OU probably would tier exactly the same.
Anticipating Counter-Arguments
As Marty first suggested,
That will not always be the case, especially in lower tiers. Individual tiers may have events such as suspects, tournaments, or memes at any point in the timeline that can inflate one month over another. I'm also not thinking of it as "the first month of the three be worth the most" but rather "why should one game in a low-gameplay month be worth more to the overall usage stats than one game in a high-gameplay month?" Inherently, if you don't factor in the amount of games, the impact of each game is greater in the lower months. That doesn't feel to me like "usage."Each month has fewer games than the one prior, that's just how every format pans out as interest goes down throughout the generation. Having the first month of the three be worth the most when the meta has had two more months to develop since then just seems bad on paper.
Believe me I am trying that as well for Ubers UU, but the fact is that ladders with less games are more susceptible to these trends, and it would help people like us the most. I also believe it wouldn't really affect higher tiers, so why not?This is only a problem in lower tiers like yours that have low gameplay. Just get more games and you'll fix this problem.