CONFERENCE / ICCAIS-2026
Generative AI Based Lip Sync Generator Using GAN for Photo-to-Talking Video Synthesis
Published Online: 2026
Pages: 175-177
Cite this article
↗ https://www.doi.org/10.59256/indjcst.20260501C029Abstract
In recent years, Generative Artificial Intelligence has shown remarkable progress in creating realistic multimedia content. One interesting application of this technology is talking face generation, where a still image of a person can be animated according to a given speech signal. The main challenge in such systems is to ensure that the lip movements correctly match the spoken words. If synchronization is not accurate, the generated video appears unnatural. In this work, we present a lip synchronization system based on Generative Adversarial Networks (GANs). The proposed model takes a static facial image and a speech audio file as inputs and produces a talking video in which the lip movements are aligned with the audio. The adversarial learning strategy helps improve both synchronization accuracy and visual quality of the generated frames.
Related Articles
2026
Design and Implementation of Bit Swapping and Reversible Logic Based Numeric Data Encryption and Decryption
2026
Smart Crop Advisory and Disease Detection System with Cloud-Connected Irrigation Using IoT
2026