Generate and edit audio from text prompts
a super consistent video depth model
Apply the motion of a video on a portrait