Safe Reinforcement Learning in a Simulated Robotic Arm Paper • 2312.09468 • Published Nov 28, 2023 • 2
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning Paper • 2412.09858 • Published Dec 13, 2024 • 2
Offline Reinforcement Learning as One Big Sequence Modeling Problem Paper • 2106.02039 • Published Jun 3, 2021 • 2
A Survey of Reinforcement Learning from Human Feedback Paper • 2312.14925 • Published Dec 22, 2023 • 1