sian cao
sonald
AI & ML interests
AI, big data, OS
Recent Activity
upvoted
an
article
10 days ago
Deriving the DPO Loss from First Principles
upvoted
an
article
12 days ago
Deriving the PPO Loss from First Principles
upvoted
an
article
15 days ago
From GRPO to DAPO and GSPO: What, Why, and How