Beyond Goodhart's Law: A Dynamic Benchmark for Evaluating Compliance in Multi-Agent Systems


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2606.07805