Reinforcement Learning
Training agents to make decisions? Benchmark your policies on game playing, continuous control, and offline learning tasks.
3 tasks
2 datasets
9 results
Atari Games
Playing Atari video games (Atari 2600 benchmark).
1 datasets
9 results
SOTA: 40000 (human-normalized-score)
go-explore Suite of 57 Atari 2600 games. Standard benchmark for deep reinforcement learning agents.
Continuous Control
Control tasks with continuous action spaces (MuJoCo).
1 datasets
0 results
Physics engine for continuous control tasks like walking, running, and manipulation.
Offline RL
Learning from fixed datasets without environment interaction.
0 datasets
0 results
No datasets indexed yet. Contribute on GitHub