Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
13
Xiaoyu Tan
WIlliam1900
Follow
SteveSHEN's profile picture
21world's profile picture
2 followers
ยท
5 following
https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
authored
a paper
1 day ago
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
authored
a paper
1 day ago
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
View all activity
Organizations
Papers
11
arxiv:
2512.24618
arxiv:
2512.24615
arxiv:
2512.22322
arxiv:
2510.08191
Expand 11 papers
models
0
None public yet
datasets
0
None public yet