Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 10 days ago • 17
huihui-ai/Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated Image-Text-to-Text • 31B • Updated Dec 15, 2025 • 1.61k • 80
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 392
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 21 days ago • 138
Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching Paper • 2512.18184 • Published Dec 20, 2025 • 20