sian cao
sonald
AI & ML interests
AI, big data, OS
Recent Activity
upvoted
an
article
8 days ago
Deriving the DPO Loss from First Principles
upvoted
an
article
10 days ago
Deriving the PPO Loss from First Principles
upvoted
an
article
13 days ago
From GRPO to DAPO and GSPO: What, Why, and How