tencent/VCB-Bench
Preview
•
Updated
•
1.45k
•
8
None defined yet.
Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning