Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shengjia-toronto 's Collections
llama
tag-exploded
qwen3-4b
GFPO
Ablation L
SSFT

SSFT

updated 13 days ago
Upvote
-

  • Training Large Language Models To Reason In Parallel With Global Forking Tokens

    Paper • 2510.05132 • Published Oct 1, 2025 • 1

  • shengjia-toronto/ssft32b_grpo_bs256_step10

    Updated 28 days ago • 468

  • shengjia-toronto/ssft-32B-N6

    Text Generation • 4B • Updated 19 days ago • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs