view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7, 2025 • 19
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9, 2025 • 36
Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN Paper • 2508.06647 • Published Aug 8, 2025 • 16
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data Paper • 2501.12012 • Published Jan 21, 2025 • 9
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 434
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 480
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper • 2407.10457 • Published Jul 15, 2024 • 24
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper • 2403.13570 • Published Mar 20, 2024 • 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 95