view article Article BioClinical ModernBERT: an example of continued pre-training of ModernBERT Sep 10, 2025 • 6
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 433
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper • 2412.03304 • Published Dec 4, 2024 • 19
aisingapore/Llama-SEA-LION-v2-8B-IT Text Generation • 8B • Updated Apr 15, 2025 • 578 • • 17