Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
64.4
TFLOPS
82
87
Jarrod Barnes
PRO
Jarrodbarnes
Follow
ariG23498's profile picture
aman-jaglan's profile picture
redutskaya's profile picture
5 followers
·
50 following
https://arc.computer
jarrodbarnes
jbarnes850
jarrodbarnes
AI & ML interests
Continual Learning, Reinforcement Learning
Recent Activity
updated
a model
2 days ago
Jarrodbarnes/opensec-gdpo-4b
upvoted
a
collection
2 days ago
Nemotron-Post-Training-v3
liked
a dataset
3 days ago
ScalingIntelligence/KernelBench
View all activity
Organizations
Articles
1
Article
2
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
Papers
1
arxiv:
2511.01093
spaces
2
Sort: Recently updated
Sleeping
RL
OpenSec-Env
🚀
Sleeping
Trackio
🚀
Display tracking information
models
4
Sort: Recently updated
Jarrodbarnes/opensec-gdpo-4b
Text Generation
•
4B
•
Updated
2 days ago
•
52
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
Text Generation
•
4B
•
Updated
14 days ago
•
59
Jarrodbarnes/Qwen3-4B-tau2-sft1
4B
•
Updated
14 days ago
•
22
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13, 2025
•
4
•
2
datasets
6
Sort: Recently updated
Jarrodbarnes/osworld-reasoning-sft-v1
Preview
•
Updated
14 days ago
•
30
Jarrodbarnes/osworld-train-v1
Viewer
•
Updated
16 days ago
•
66
•
17
Jarrodbarnes/tau2-sft-seed-v3
Updated
Dec 19, 2025
•
13
Jarrodbarnes/tau2-sft-final
Updated
Dec 15, 2025
•
50
Jarrodbarnes/tau2-sft-v4-dataset
Viewer
•
Updated
Nov 29, 2025
•
219
•
82
Jarrodbarnes/cortex-1-market-analysis
Viewer
•
Updated
Mar 9, 2025
•
521
•
65
•
2