Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
paoloski97
's Collections
Image-Generation
Embedding
Med
Mesh
Point tracking
Materials
Multimodal+ImageGen
Dataset
Traduttori
Detection
Depth
Video
Stable Cascade Models
Wuerstchen
Image Captioning
Stable DIffusion 3
HunyuanDiT
Audio
LLM
AuraFlow
Flux
Text_to_Audio
Repository
Multimodal
UltraPixel
Forecasting
OCR
llamafile
Audio
updated
12 days ago
Upvote
-
CAMB-AI/MARS5-TTS
Text-to-Speech
•
Updated
Jul 5, 2024
•
209
•
448
suno/bark
Text-to-Speech
•
Updated
Oct 4, 2023
•
48.9k
•
•
1.24k
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
Nov 5, 2024
•
1.13k
•
448
nyrahealth/CrisperWhisper
Automatic Speech Recognition
•
Updated
Dec 19, 2024
•
39.2k
•
224
fishaudio/fish-speech-1.5
Text-to-Speech
•
Updated
Dec 3, 2024
•
8.36k
•
432
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
7 days ago
•
248k
•
2.94k
NexaAIDev/OmniAudio-2.6B
Audio-Text-to-Text
•
Updated
Dec 13, 2024
•
1.33k
•
242
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation
•
Updated
9 days ago
•
30.5k
•
351
Upvote
-
Share collection
View history
Collection guide
Browse collections