Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shengjia-toronto 's Collections
llama
tag-exploded
qwen3-4b
GFPO
Ablation L
SSFT

SSFT

updated 15 days ago
Upvote
-

  • Training Large Language Models To Reason In Parallel With Global Forking Tokens

    Paper • 2510.05132 • Published Oct 1, 2025 • 1

  • shengjia-toronto/ssft32b_grpo_bs256_step10

    Updated 30 days ago • 327

  • shengjia-toronto/ssft-32B-N6

    Text Generation • 4B • Updated 21 days ago • 35
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs