% Bibliography for hypothesis h-13dc63ff74 % Title: whether debate-structured causal reasoning improves calibration over direct LLM baselines requires proximal validation % Generated: 2026-06-15T13:38:23Z