Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning

Maty\'a\v{s} Lorenc; Roman Neruda

arxiv: 2502.06301 · v2 · pith:O5HSNSVZnew · submitted 2025-02-10 · 💻 cs.LG · cs.NE

Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning

Maty\'a\v{s} Lorenc , Roman Neruda This is my paper

classification 💻 cs.LG cs.NE

keywords modelsnovelty-basedtrainingdecisionlargerlearningns-esnsr-es

0 comments

read the original abstract

In this paper, we experiment with novelty-based variants of OpenAI-ES, the NS-ES and NSR-ES algorithms, and evaluate their effectiveness in training complex, transformer-based architectures designed for the problem of reinforcement learning, such as Decision Transformers. We also test if we can accelerate the novelty-based training of these larger models by seeding the training with a pretrained models. The experimental results were mixed. NS-ES showed progress, but it would clearly need many more iterations for it to yield interesting agents. NSR-ES, on the other hand, proved quite capable of being straightforwardly used on larger models, since its performance appears as similar between the feed-forward model and Decision Transformer, as it was for the OpenAI-ES in our previous work.

This paper has not been read by Pith yet.

Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning

discussion (0)