Generative AI Based Lip Sync Generator Using GAN for Photo-to-Talking Video Synthesis

T. Sireesha; M. Gopinath Reddy; Ch. Ramya Sri; N. Divya Prasanna

doi:https://www.doi.org/10.59256/indjcst.20260501C029

CONFERENCE / ICCAIS-2026

Research Article

Generative AI Based Lip Sync Generator Using GAN for Photo-to-Talking Video Synthesis

T. Sireesha¹ M. Gopinath Reddy² Ch. Ramya Sri³ N. Divya Prasanna⁴

¹ ² ³ ⁴ Department of Artificial Intelligence and Machine Learning, Sasi Institute of Technology and Engineering Tadepalligudem, Andhra Pradesh, India.

Published Online: 2026

Pages: 175-177

Cite this article

↗ https://www.doi.org/10.59256/indjcst.20260501C029

Abstract

View PDF

In recent years, Generative Artificial Intelligence has shown remarkable progress in creating realistic multimedia content. One interesting application of this technology is talking face generation, where a still image of a person can be animated according to a given speech signal. The main challenge in such systems is to ensure that the lip movements correctly match the spoken words. If synchronization is not accurate, the generated video appears unnatural. In this work, we present a lip synchronization system based on Generative Adversarial Networks (GANs). The proposed model takes a static facial image and a speech audio file as inputs and produces a talking video in which the lip movements are aligned with the audio. The adversarial learning strategy helps improve both synchronization accuracy and visual quality of the generated frames.

Quick Links

Download

Manuscript Template Copyright Form

Policies

Share Article

X

Facebook

Or copy link

https://indjcst.com/conference/10.59256/indjcst.20260501C029

CONFERENCE / ICCAIS-2026

Generative AI Based Lip Sync Generator Using GAN for Photo-to-Talking Video Synthesis

Cite this article

Abstract

Related Articles

Design and Implementation of Bit Swapping and Reversible Logic Based Numeric Data Encryption and Decryption

Smart Crop Advisory and Disease Detection System with Cloud-Connected Irrigation Using IoT

Develop A Real-Time Closed Captioning Solution with Simplified Captions in Multiple Indian Languages for Accessibility and Inclusivity of Deaf and Hard-Of-Hearing Individuals

PlumX Metrics

Dimension

Quick Links

Download

Policies

Share Article