Chat Long COT model that uses tags
Audio Conditioned LipSync with Latent Diffusion Models
Gaze detection using Moondream