Arunkumar Venkataramanan's picture

In a Training Loop 🔄

140 296

Arunkumar Venkataramanan

ArunkumarVR

·

https://arunkumarramanan.github.io

AI & ML interests

AGI Research: Reasoning, Safety & Alignment (Superalignment), Generative AI (GenAI), Multi-Modal Foundation Models (FMs), Large Language Models (LLMs), Transformers & Diffusion Models, Open LLM Training, Optimization & Finetuning, Serving & Inference

Recent Activity

liked a model 1 day ago

Qwen/Qwen3-0.6B-Base

liked a dataset 3 days ago

princeton-nlp/SWE-bench_Verified

liked a model 3 days ago

Qwen/Qwen3-4B-Thinking-2507

View all activity

Organizations

upvoted a collection 5 days ago

Qwen3-Coder

5 items • Updated 11 days ago • 147

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 253

upvoted a collection 21 days ago

Moonlight-A3B

Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer • 3 items • Updated Nov 2, 2025 • 9

upvoted a paper 21 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

upvoted a collection 22 days ago

FunctionGemma

3 items • Updated 23 days ago • 32

upvoted an article 23 days ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

24 days ago

•

44

upvoted 2 collections 25 days ago

Nemotron v3 Pre-Training

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 18 days ago • 7

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 39

upvoted an article 25 days ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

26 days ago

•

104

upvoted a paper 26 days ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published 26 days ago • 88

upvoted 2 collections 26 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 18 days ago • 56

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 6 items • Updated 11 days ago • 117

upvoted 2 collections 29 days ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 18 days ago • 91

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 13 items • Updated 18 days ago • 34

upvoted 2 collections about 1 month ago

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated Dec 9, 2025 • 38

Essential-Web v1.0

10 items • Updated Jun 18, 2025 • 10

upvoted a paper about 1 month ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published Apr 5, 2025 • 80

upvoted 2 articles about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

570

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

270

upvoted a collection about 1 month ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 82