• The moderator of this forum is jetou.
  • Welcome to Smogon! Take a moment to read the Introduction to Smogon for a run-down on everything Smogon, and make sure you take some time to read the global rules.

Programming Reinforcement Learning Battle Data

As the title states, I am currently trying to train a pokemon singles battle bot with RL (PPO + MCTS). But I have realized recently that training from scratch through selfplay is very time consuming and pretty impractical. I am achieving steady rise in reward meaning that the model is learning, but it is learning too slowly. Based off of prior reading and my own estimations, it would take at least 100 million steps to reach a relatively good bot, but right now it takes about 8 hours to run 1 million steps. This is unsustainable.

I believe the limitation is because pokemon showdown runs on cpu and thus there isn't really a way to speed it up(without using more resources). Thus my idea was to train a simple base model first with existing battle data. But I was unable to find any data on this.

Do you guys know where I could find datasets that contain entire battle histories (all the battle states, team info, etc.)?

Or if you have any other ideas I would really appreciate the help.

Thanks!
 
Unfortunately, PS data grants for battle logs are not being offered currently.

If PS is too slow you could instead try using @pkmn/engine and see if it meets your performance needs.
 
  • Wow
Reactions: tko
As the title states, I am currently trying to train a pokemon singles battle bot with RL (PPO + MCTS). But I have realized recently that training from scratch through selfplay is very time consuming and pretty impractical. I am achieving steady rise in reward meaning that the model is learning, but it is learning too slowly. Based off of prior reading and my own estimations, it would take at least 100 million steps to reach a relatively good bot, but right now it takes about 8 hours to run 1 million steps. This is unsustainable.

I believe the limitation is because pokemon showdown runs on cpu and thus there isn't really a way to speed it up(without using more resources). Thus my idea was to train a simple base model first with existing battle data. But I was unable to find any data on this.

Do you guys know where I could find datasets that contain entire battle histories (all the battle states, team info, etc.)?

Or if you have any other ideas I would really appreciate the help.

Thanks!


Reinforcement learning applied to Pokémon battles is a brilliant way to merge gaming with AI theory. For anyone researching or writing on such topics, truowl.com offers valuable writing tools and support to articulate complex ideas clearly. Whether you're into machine learning or competitive battling, strong explanations and solid structure make your work stand out.
How is everything now?

Edit: Seems good.
 
Last edited:
Back
Top