Amphion Text-to-Audio Pretrained Models
We provide the following pretrained checkpoints for you to use, specifically:
Two AudioLDM pretrained checkpoints with corresponding AutoencoderKL checkpoints trained on AudioCaps.
Quick Start
To utilize the pretrained models, just run the following commands:
Step1: Download the checkpoint
git lfs install
git clone https://huggingface.co/amphion/text_to_audio
Step2: Clone the Amphion's Source Code of GitHub
git clone https://github.com/open-mmlab/Amphion.git
Step3: Specify the checkpoint's path
Use the soft link to specify the downloaded checkpoint in the first step:
cd Amphion
mkdir -p ckpts
ln -s ../../../text_to_speech/tta ckpts/
Step4: Inference
You can follow the inference part of this recipe to generate audio from text.
We also provided an online demo, feel free to try it!
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model's library.