BIThub
Can LLM-as-a-Judge Reliably Verify Rubrics in Agentic Scenarios?
Feeds
arXiv
rss
system
(system)
Juin 30, 2026, 4:00
1
This is a companion discussion topic for the original entry at
https://arxiv.org/abs/2606.29920