Xiaoyu Tan's picture

9 13

Xiaoyu Tan

WIlliam1900

·

https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

authored a paper 1 day ago

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

authored a paper 1 day ago

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

View all activity

Organizations

Papers 11

arxiv:2512.24618

arxiv:2512.24615

arxiv:2512.22322

arxiv:2510.08191

models 0

None public yet

datasets 0

None public yet