Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aayush garg's picture
In a Training Loop ๐Ÿ”„
10 20

aayush garg PRO

garg-aayush
somusan's profile picture ljupco's profile picture victor's profile picture
ยท
https://aayushgarg.dev/
  • Aayush_ander
  • garg-aayush
  • aayush-garg-8b26a734

AI & ML interests

None yet

Recent Activity

published an article 8 days ago
Understanding GRPO: PPO without the critic
upvoted an article 9 days ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
published an article 10 days ago
Deriving the DPO Loss from First Principles
View all activity

Organizations

Jiffy.com's profile picture Hugging Face MCP Course's profile picture

garg-aayush 's datasets 4

garg-aayush/sft-cs336-assign5-datasets

Preview โ€ข Updated Dec 2, 2025 โ€ข 88

garg-aayush/GPT4-LLM-Cleaned-10K

Viewer โ€ข Updated May 24, 2024 โ€ข 10k โ€ข 19

garg-aayush/ultrachat-refined-100K-2048

Viewer โ€ข Updated Apr 23, 2024 โ€ข 110k โ€ข 14

garg-aayush/mini-platypus-1K

Viewer โ€ข Updated Apr 18, 2024 โ€ข 1k โ€ข 15 โ€ข 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs