OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

vansin submitted a paper 1 day ago

End-to-End Video Character Replacement without Structural Guidance

heroding77 authored a paper 2 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

heroding77 submitted a paper 2 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

View all activity

Papers

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

View all Papers

OpenGVLab 's Papers 7

Submitted by

yinanhe

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

OpenGVLab

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

OpenGVLab

Submitted by

Long Cui

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution

OpenGVLab

Submitted by

Yicheng Xu

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

OpenGVLab

Submitted by

Changyao Tian

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

OpenGVLab

Submitted by

Songze Li

Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

OpenGVLab

Submitted by

Cao Yue

Sequential Diffusion Language Models

OpenGVLab