marcelweiss
's Collections
Robotics
updated
Cosmos World Foundation Model Platform for Physical AI
Paper
•
2501.03575
•
Published
•
81
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with
Video LLM
Paper
•
2501.00599
•
Published
•
46
OmniManip: Towards General Robotic Manipulation via Object-Centric
Interaction Primitives as Spatial Constraints
Paper
•
2501.03841
•
Published
•
56
Are VLMs Ready for Autonomous Driving? An Empirical Study from the
Reliability, Data, and Metric Perspectives
Paper
•
2501.04003
•
Published
•
27
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or
Robot Hardware
Paper
•
2505.09601
•
Published
•
6
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action
Models
Paper
•
2507.23682
•
Published
•
23
MolmoAct: Action Reasoning Models that can Reason in Space
Paper
•
2508.07917
•
Published
•
44
Genie Envisioner: A Unified World Foundation Platform for Robotic
Manipulation
Paper
•
2508.05635
•
Published
•
73
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
Paper
•
2509.12989
•
Published
•
28
FLOWER: Democratizing Generalist Robot Policies with Efficient
Vision-Language-Action Flow Policies
Paper
•
2509.04996
•
Published
•
13
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with
Realistic Layouts
Paper
•
2509.10813
•
Published
•
30