FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published 3 days ago β’ 2
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published 3 days ago β’ 2
Self-Evaluation Unlocks Any-Step Text-to-Image Generation Paper β’ 2512.22374 β’ Published 7 days ago β’ 14
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published 23 days ago β’ 8
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper β’ 2512.10881 β’ Published 23 days ago β’ 29
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 21
SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis Paper β’ 2511.19319 β’ Published Nov 24, 2025 β’ 1
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 36
TTT3R: 3D Reconstruction as Test-Time Training Paper β’ 2509.26645 β’ Published Sep 30, 2025 β’ 14
UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections Paper β’ 2509.24817 β’ Published Sep 29, 2025 β’ 8
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper β’ 2509.22653 β’ Published Sep 26, 2025 β’ 24
view post Post 585 Qwen 3 Coder is a personal attack to k2, and I love it.It achieves near SOTA on LCB while not having reasoning.Finally people are understanding that reasoning isnt necessary for high benches...Qwen ftw!DECENTRALIZE DECENTRALIZE DECENTRALIZE See translation π 6 6 π₯ 4 4 + Reply
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper β’ 2507.15028 β’ Published Jul 20, 2025 β’ 21
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Paper β’ 2506.03594 β’ Published Jun 4, 2025
view post Post 3092 deepseek-ai/DeepSeek-R1-0528This is the end See translation 1 reply Β· π€ 7 7 β€οΈ 1 1 + Reply
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Paper β’ 2412.09606 β’ Published Dec 12, 2024 β’ 2