1 45 135

seruva19

seruva19

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

nvidia/NitroGen

upvoted a paper 6 days ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper 6 days ago

Pretraining Frame Preservation in Autoregressive Video Memory Compression

View all activity

Organizations

upvoted 2 papers 6 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 7 days ago • 210

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Paper • 2512.23851 • Published 9 days ago • 21

upvoted 2 papers 20 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 21 days ago • 19

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 23 days ago • 72

upvoted an article 22 days ago

Article

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

22 days ago

•

upvoted 2 papers about 1 month ago

Adversarial Flow Models

Paper • 2511.22475 • Published Nov 27, 2025 • 22

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 71

upvoted a paper about 2 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

upvoted a collection 2 months ago

VisionLM

Collection

1867 items • Updated 15 days ago • 139

upvoted 2 papers 2 months ago

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 29

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23, 2025 • 40

upvoted a paper 3 months ago

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 50

upvoted a paper 4 months ago

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published Aug 28, 2025 • 35

upvoted a paper 5 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 124

upvoted a collection 7 months ago

Alchemist

Collection

📊 Dataset and 🏆 checkpoints for paper 📝 "Alchemist: Turning Public Text-to-Image Data into Generative Gold" • 8 items • Updated Oct 16, 2025 • 17

upvoted a paper 8 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 56

upvoted 2 papers about 1 year ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 34

upvoted 2 papers over 1 year ago

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 57

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

seruva19

AI & ML interests

Recent Activity

Organizations

seruva19's activity

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation