text-to-speech
updated
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
•
2404.14700
•
Published
•
32
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper
•
2306.15687
•
Published
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
•
2403.03100
•
Published
•
38
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
•
2404.09956
•
Published
•
12
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech
Prompts
Paper
•
2307.07218
•
Published
•
27
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
Bias
Paper
•
2306.03509
•
Published
•
5
parler-tts/dac_44khZ_8kbps
76.7M
•
Updated
•
792
•
19
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
•
0.6B
•
Updated
•
3.21k
•
358
Wenetspeech4TTS/WenetSpeech4TTS
Updated
•
719
•
83
Text-to-Audio
•
Updated
•
4
•
9
Feature Extraction
•
96.2M
•
Updated
•
1.19M
•
•
277
Text-to-Speech
•
Updated
•
1.74M
•
•
5.53k
Text-to-Speech
•
4B
•
Updated
•
854
•
523
Text-to-Speech
•
Updated
•
10.6k
•
1.1k
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
•
4B
•
Updated
•
179
•
193
Text-to-Speech
•
Updated
•
177
•
414
Text-to-Speech
•
Updated
•
76.1k
•
•
2.82k