Taming Stable Diffusion for Lip Sync!
-
Updated
Jun 20, 2025 - Python
Taming Stable Diffusion for Lip Sync!
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Once the Adobe Sora API can be use, this repository will be updated soon. My other projects: https://videosora.app https://makeimage.ai
Create a Waveform Video (usable on Youtube, Tiktok, etc.) from a WAV or MP3 file. Two output options: ultrafast generation (static background with optional title) and standard generation (dynamic background).
Add a description, image, and links to the video-gen topic page so that developers can more easily learn about it.
To associate your repository with the video-gen topic, visit your repo's landing page and select "manage topics."