jacobmorrison/dpo-yolo1-200k-gpt4.1-judge-2weak2strong-maxdelta_rejected-DECON-remove-gemma3 Viewer • Updated Oct 14, 2025 • 182k • 8
jacobmorrison/Nemotron-Post-Training-Dataset-v2-reasoning-chat Viewer • Updated Aug 27, 2025 • 546k • 36
jacobmorrison/olmo-2-1124-7b-preference-mix-filtered-overlapping Viewer • Updated Aug 12, 2025 • 258k • 9
jacobmorrison/qwen3-30b-3a-coder-no-reasoning-combined-outputs Viewer • Updated Aug 12, 2025 • 2M • 67