BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 4 days ago • 8
BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 4 days ago • 8
BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 4 days ago • 8
PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data Paper • 2508.15180 • Published Aug 21, 2025 • 1
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning Paper • 2508.15868 • Published Aug 21, 2025 • 3
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models Paper • 2510.21604 • Published Oct 24, 2025
PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data Paper • 2508.15180 • Published Aug 21, 2025 • 1
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published Jul 22, 2025 • 8 • 4
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published May 26, 2025 • 64 • 4
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published May 26, 2025 • 64