CONFERENCE / ICCAIS-2026

Research Article

Generative AI Based Lip Sync Generator Using GAN for Photo-to-Talking Video Synthesis

T. Sireesha1 M. Gopinath Reddy2 Ch. Ramya Sri3 N. Divya Prasanna4
1 2 3 4 Department of Artificial Intelligence and Machine Learning, Sasi Institute of Technology and Engineering Tadepalligudem, Andhra Pradesh, India.

Published Online: 2026

Pages: 175-177

Abstract

In recent years, Generative Artificial Intelligence has shown remarkable progress in creating realistic multimedia content. One interesting application of this technology is talking face generation, where a still image of a person can be animated according to a given speech signal. The main challenge in such systems is to ensure that the lip movements correctly match the spoken words. If synchronization is not accurate, the generated video appears unnatural. In this work, we present a lip synchronization system based on Generative Adversarial Networks (GANs). The proposed model takes a static facial image and a speech audio file as inputs and produces a talking video in which the lip movements are aligned with the audio. The adversarial learning strategy helps improve both synchronization accuracy and visual quality of the generated frames.

Related Articles

2026

Design and Implementation of Bit Swapping and Reversible Logic Based Numeric Data Encryption and Decryption

2026

Smart Crop Advisory and Disease Detection System with Cloud-Connected Irrigation Using IoT

2026

Develop A Real-Time Closed Captioning Solution with Simplified Captions in Multiple Indian Languages for Accessibility and Inclusivity of Deaf and Hard-Of-Hearing Individuals

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://indjcst.com/conference/10.59256/indjcst.20260501C029