As the title says, someone simulated 87,000 Pokemon Red NPC trainer vs. NPC trainer battles, and then created an ELO ranking and tier lists for how well they did against other NPC trainers.
The youtuber's process was roughly:
Summary + some of my impressions from watching the video:
The youtuber's process was roughly:
- Select two trainers two perform a battle (initially randomly, later sequentially), Trainers 1 & 2.
- Load a save state in an emulator against Trainer 2, with Trainer 1's Pokemon in the position of the player. Load another save state, but with the positions of the two trainers reversed.
- Whenever the player needs to make another decision, load up the second emulator and check to see what option it would choose. Input that decision in the first emulator.
- Make some adjustments (e.g. disable exp gain) to make the trainer in the player position more like an NPC trainer.
- Repeat until someone wins, no damage is done over 75 turns, or if the battle hasn't ended in 1,000 turns.
Summary + some of my impressions from watching the video:
- RBY AI is terrible as expected, but surprisingly some Cool Trainers had a decision tree for switching out Pokemon.
- 4:24 Basically confirms the AI cheats: If the player moves first the AI doesn't determine its move until after the player.
- Based on the ELO rankings of different iterations of Gary with different starters Bulbasaur is the best starter until very late in the game.
- Lorelei (pokemon levels 53-56) is only 2 ELO points above Gary during the rival battle in the Pokemon Tower (pokemon levels 20-25).
- Sabrina is OP as a 5th gym (#13th highest ELO overall), she is highest of all gyms and is also above Bruno.
- In general the Gym Leaders and rival fights are stronger than their surrounding trainers.
- There is a Victory Road trainer (Juggler, lvl 48 Mr. Mime), that has a lower ranking than SS Anne Gary (lvl 16-20). There is a Scientist with a level 33 Electrode with a lower ELO than a Bug Catcher with a level 14 Caterpie and Weedle. There is a Hiker with a level 25 Geodude with a lower ELO than a Lass with two level 10 basic Pokemon.
- The D-tier Lass with a level 11 Oddish was robbed. She could have beaten B- tier Beauty's Level 29 Ivysaur if the youtuber hadn't implemented a 1,000-turn limit before calling a match a draw.