Reinforcement Learning
Training agents to make decisions? Benchmark your policies on game playing, continuous control, and offline learning tasks.
3 tasks2 datasets
Tasks in Reinforcement Learning
Explore Other Areas
Computer Vision
Building systems that understand images and video? Find benchmarks for recognition, detection, segmentation, and document analysis tasks.
Natural Language Processing
Processing and understanding text? Evaluate your models on language understanding, generation, translation, and information extraction benchmarks.
Reasoning
Testing if your model can think logically? Benchmark math problem solving, commonsense understanding, and multi-step reasoning capabilities.
Computer Code
Developing AI coding assistants? Test code generation, completion, translation, bug detection, and repair capabilities.