Not All Errors Are Equal: Consequence-Aware Reasoning Compute Allocation

arXiv:2606.04402Submitted Jun 4, 20260 benchmark results

Authors pending

Abstract

Proposes consequence-aware test-time compute allocation; on SWE-bench Lite, reduces cost-weighted loss by 22–33% vs. difficulty-only routing.

Tasks

Results

No benchmark results recorded yet.

Benchmark results referencing this paper haven't been added to the registry yet. If you have a reproduction, submit it →

CodeSOTA extraction

Link this paper to benchmark rows, datasets, model cards, and reproduced results as evidence is extracted.

Add or update benchmark results

Logged-in editor · benchmark trail