Collections
Discover the best community collections!
Collections trending this week
-
Stable Diffusion Webui
💻16Generate images from text prompts
-
Stable Diffusion 3 Medium Superpompt
📷35Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
-
IllusionDiffusion
👁5.34kGenerate stunning high quality illusion artwork
-
Multi View Diffusion
🧊64Generate multi-view images from text or images
-
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Paper • 2406.11768 • Published • 24 -
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Paper • 2407.03169 • Published • 11 -
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation
Paper • 2407.02869 • Published • 21 -
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
Paper • 2407.04051 • Published • 40
-
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Paper • 2406.11768 • Published • 24 -
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Paper • 2407.03169 • Published • 11 -
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation
Paper • 2407.02869 • Published • 21 -
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
Paper • 2407.04051 • Published • 40
-
Stable Diffusion Webui
💻16Generate images from text prompts
-
Stable Diffusion 3 Medium Superpompt
📷35Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
-
IllusionDiffusion
👁5.34kGenerate stunning high quality illusion artwork
-
Multi View Diffusion
🧊64Generate multi-view images from text or images