view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 110
view article Article Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads about 8 hours ago • 4
view article Article Diversity Vs Density: A strategy comparison for fine-tuning VLMs about 16 hours ago • 3
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition Paper • 2509.19768 • Published Sep 24, 2025 • 5
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published 22 days ago • 14
fiNERweb Collection A multilingual dataset for NER covering 91 langauges and 25 scripts • 3 items • Updated 21 days ago • 1
Datasets Wrapped 2025: Reasoning Collection The reasoning datasets that defined 2025. Part 1 of Datasets Wrapped 2025. #DatasetsWrapped2025 • 20 items • Updated 21 days ago • 1
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 14 days ago • 32
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 14 days ago • 55
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 6 days ago • 113
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 22 days ago • 104
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 14 days ago • 42
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 13 days ago • 26