-
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Paper • 2409.04109 • Published • 48 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
Reward-Robust RLHF in LLMs
Paper • 2409.15360 • Published • 6 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 29
Haote Yang
Hoter
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
upvoted
a
paper
2 days ago
K-EXAONE Technical Report
liked
a dataset
about 2 months ago
walktaster/LTD_Bench
Organizations
None yet