Reinforcement Learning

Training agents to make decisions? Benchmark your policies on game playing, continuous control, and offline learning tasks.

3 tasks 2 datasets 9 results

Atari Games

Playing Atari video games (Atari 2600 benchmark).

1 datasets 9 results
Atari 2600 Arcade Learning Environment (Atari 2600) 2013
SOTA: 40000 (human-normalized-score)
go-explore

Suite of 57 Atari 2600 games. Standard benchmark for deep reinforcement learning agents.

Continuous Control

Control tasks with continuous action spaces (MuJoCo).

1 datasets 0 results
MuJoCo Multi-Joint dynamics with Contact 2012

Physics engine for continuous control tasks like walking, running, and manipulation.

Offline RL

Learning from fixed datasets without environment interaction.

0 datasets 0 results
No datasets indexed yet. Contribute on GitHub