C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2603.05167