cagatay odabasi's picture

20 96

cagatay odabasi

cagatayodabasi

·

cagbal

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

builddotai/Egocentric-10K

upvoted a collection about 2 months ago

liked a model 4 months ago

meituan-longcat/LongCat-Flash-Thinking

View all activity

Organizations

upvoted a collection about 2 months ago

VST

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 5 items • Updated Nov 12, 2025 • 6

upvoted a collection 5 months ago

Cosmos-Predict2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict25 • 13 items • Updated 2 days ago • 33

upvoted a paper 10 months ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7, 2025 • 81

upvoted a collection 10 months ago

Physical AI

Collection of open, commercial-grade datasets for physical AI developers • 23 items • Updated 15 days ago • 103

upvoted a paper 11 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

upvoted a collection about 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 15 days ago • 85

upvoted a paper about 1 year ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted a collection over 1 year ago

Theia

Distilling Diverse Vision Foundation Models for Robot Learning • 6 items • Updated Sep 30, 2024 • 9

upvoted an article over 1 year ago

Article

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

Jul 10, 2024

•

91

upvoted 2 papers over 1 year ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 94

3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 11

upvoted a collection over 1 year ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 15 days ago • 62

upvoted 6 papers over 1 year ago

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 32

DC3DO: Diffusion Classifier for 3D Objects

Paper • 2408.06693 • Published Aug 13, 2024 • 11

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 62

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7, 2024 • 8

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 31

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 28

upvoted an article over 1 year ago

Article

Vision Language Models Explained

Apr 11, 2024

•

505

upvoted a paper about 2 years ago

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 27