view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 27 days ago • 82
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 27 days ago • 82
DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs Paper • 2503.15793 • Published Mar 20, 2025
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters