Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop ๐
96.8
TFLOPS
10
20
aayush garg
PRO
garg-aayush
Follow
somusan's profile picture
ljupco's profile picture
victor's profile picture
16 followers
ยท
16 following
https://aayushgarg.dev/
Aayush_ander
garg-aayush
aayush-garg-8b26a734
AI & ML interests
None yet
Recent Activity
published
an
article
8 days ago
Understanding GRPO: PPO without the critic
upvoted
an
article
9 days ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
published
an
article
10 days ago
Deriving the DPO Loss from First Principles
View all activity
Organizations
garg-aayush
's datasets
4
Sort:ย Recently updated
garg-aayush/sft-cs336-assign5-datasets
Preview
โข
Updated
Dec 2, 2025
โข
88
garg-aayush/GPT4-LLM-Cleaned-10K
Viewer
โข
Updated
May 24, 2024
โข
10k
โข
19
garg-aayush/ultrachat-refined-100K-2048
Viewer
โข
Updated
Apr 23, 2024
โข
110k
โข
14
garg-aayush/mini-platypus-1K
Viewer
โข
Updated
Apr 18, 2024
โข
1k
โข
15
โข
1