CNN-Based Bird Sound Detection: A Comparative Performance Study

Authors

  • Gaurav Giri Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Iza KC Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Prajwal Khatiwada Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Samrat Kumar Adhikari Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Subarna Shakya Department of Electronics and Computer Engineering, IOE Pulchowk Campus, Pulchowk, Lalitpur, Nepal

DOI:

https://doi.org/10.3126/injet.v2i2.78615

Keywords:

Bird Sound Detection, Deep Learning, Convolutional Neural Network, Audio Processing, Mel spectrogram

Abstract

The use of automated systems for biodiversity monitoring has positioned bird sound detection as a crucial   technology for modern ecological research. This study introduces a deep learning framework for bird sound detection, demonstrating strong performance in challenging acoustic environments. By leveraging feature extraction techniques such as Mel spectrograms and comparing multiple convolutional neural network architectures, this approach achieves competitive results. In our experiments, the best-performing model ResNet50 achieved an accuracy of 90.75%, with a recall of 0.89, a precision of 0.91, and an F1-score of 0.90 on the test set. These results highlight the method’s potential for real-time biodiversity monitoring, providing a reliable tool for ecological research, citizen science initiatives, and conservation efforts.

Downloads

Download data is not yet available.
Abstract
212
PDF
219

Downloads

Published

2025-05-19

How to Cite

Giri, G., KC, I., Khatiwada, P., Adhikari, S. K., & Shakya, S. (2025). CNN-Based Bird Sound Detection: A Comparative Performance Study. International Journal on Engineering Technology, 2(2), 176–187. https://doi.org/10.3126/injet.v2i2.78615

Issue

Section

Articles