Recent Papers / arXiv:2606.02380

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

arXiv:2606.02380Submitted Jun 2, 20260 benchmark results

Yuyan Bu, Haowei Li, Qirui Zheng, Bowen Dong, Kaiyue Yang, Jiaming Ji, Yingshui Tan, Wenxin Li, Yaodong Yang, Juntao Dai

Abstract

First benchmark to isolate agent deception (plan-action divergence under pressure) from hallucination; reveals that deception is a genuine and pressing issue in tool-use contexts.

Tasks
edit
Results

No benchmark results recorded yet.

submit

Benchmark results referencing this paper haven't been added to the registry yet. If you have a reproduction, submit it →

CodeSOTA extraction

Benchmark evidence

edit
  • SPADE-Bench: Leakage rate and H-score across models (extract from main results).
Add or update benchmark results
Logged-in editor · benchmark trail
Read next

Three places to go from here.

Index
All papers
All tracked papers in the registry, with benchmark result, model, and leaderboard linkage where available.
Replacement
Papers with Code is dead — alternatives
What replaced PWC for each use case: LLMs, OCR, speech, vision, robotics.
Top hub
Agentic AI
Every benchmark in Agentic AI.