Yale-ROSE/Qwen3-4B-dimacs_cube-sft_gpt-oss-120b-dpo_gpt-oss-120b_reasoning_grpo-v2 Text Generation • 4B • Updated Sep 20, 2025
Yale-ROSE/Qwen3-4B-dpo_gpt-oss-120b_8k_reasoning_ablation Text Generation • 4B • Updated Sep 19, 2025 • 1
Yale-ROSE/Qwen3-4B-dimacs_cube-sft_gpt-oss-120b-dpo_gpt-oss-120b_reasoning_grpo-v1 Text Generation • 4B • Updated Sep 18, 2025
Yale-ROSE/Qwen3-4B-dimacs_cube-sft_gpt-oss-120b-dpo_gpt-oss-120b_reasoning-v1 Text Generation • 4B • Updated Sep 15, 2025 • 1