Longze Chen
lzchen2001
AI & ML interests
NLP & LLM
Recent Activity
upvoted
a
paper
about 1 month ago
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
authored
a paper
5 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR
upvoted
a
paper
5 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR