Deep Fake Audio Detection Using a Hybrid CNN-BiLSTM Model with Attention Mechanism
DOI:
https://doi.org/10.3126/injet.v2i2.78619Keywords:
: Deepfake Audio Detection, CNN-BiLSTM Hybrid Model, Synthetic Speech, Audio AuthenticityAbstract
The increasing sophistication of deepfake technologies has raised significant concerns regarding the authenticity of audio files. To address this challenge, we propose a deepfake audio detection system that employs a CNN-BiLSTM hybrid model for identifying synthetic speech. The system processes audio files by converting them into Mel-Spectrograms, which effectively capture the unique spectral features distinguishing real voices from fake ones. The processed data is then fed into the CNN-BiLSTM model, which leverages the strengths of Convolutional Neural Networks (CNNs) for spatial pattern recognition and Bidirectional Long Short-Term Memory Networks (BiLSTMs) for capturing long-term dependencies in the temporal sequence of the audio data. The model, trained on dataset, achieves an accuracy of 95%, effectively detecting subtle irregularities indicative of deepfake audio. The system provides users with a comprehensive analysis, including a confidence score and detailed insights into the authenticity of the audio, offering an effective tool for distinguishing real from fake audio. Our system combines cutting-edge machine learning technology with a user-friendly interface, making it both highly effective and accessible for practical applications in deepfake detection. Unlike existing systems that rely solely on either CNN or RNN architectures, our approach integrates both to enhance detection accuracy, particularly for complex and subtle manipulations. Additionally, we introduce a confidence scoring mechanism and insight analysis that provides users with transparent reasoning behind each detection.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal on Engineering Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.