OpenGVLab/ExpVid
Preview
•
Updated
•
1.92k
•
6
Computer Vision
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs