Submitted by
yinanhe
AI & ML interests
Computer Vision
Recent Activity
View all activity
Papers
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs