arxiv:2511.09611
Xiangtai Li
LXT
AI & ML interests
Computer Vision, Multi-Modal Understanding, Generative AI
Recent Activity
upvoted
a
paper
about 10 hours ago
BabyVision: Visual Reasoning Beyond Language
upvoted
a
paper
about 10 hours ago
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning
upvoted
a
paper
26 days ago
Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future