Bird-Song-Detector

Bird-Song-Detector is part of a research to improve bird vocalization identification. The Bird Song Detector is designed to detect bird vocalizations in audio files using a YOLO-based model from the BIRDeep project. The system converts audio files into spectrogram images, performs bird song detection on these images, and transforms the predictions into time segments.

⚠️ Note: The model has been trained on the BIRDeep dataset, which consists of audio recordings from Doñana National Park (Huelva, Spain), located in Huelva, Spain. As such, the detector is particularly well-suited for identifying bird songs from this region and has not been tested on data from other areas.

For more information, visit the full BIRDeep Bird Song Detector repository.

📄 Citation

This repository supports the research article:

Márquez-Rodríguez, A., Mohedano-Munoz, M. Á., Marín-Jiménez, M. J., Santamaría-García, E., Bastianelli, G., Jordano, P., & Mendoza, I.
A Bird Song Detector for improving bird identification through Deep Learning: a case study from Doñana
Ecological Informatics, 2025, 103254. https://doi.org/10.1016/j.ecoinf.2025.103254

@article{marquez2025bird,
  title={A Bird Song Detector for improving bird identification through Deep Learning: a case study from Doñana},
  author={Márquez-Rodríguez, Alba and Mohedano-Munoz, Miguel Ángel and Marín-Jiménez, Manuel J. and Santamaría-García, Eduardo and Bastianelli, Giulia and Jordano, Pedro and Mendoza, Irene},
  journal={Ecological Informatics},
  volume={90},
  pages={103254},
  year={2025},
  publisher={Elsevier},
  doi={10.1016/j.ecoinf.2025.103254}
}

📄 Read the article

Project Structure

The project is structured as follows:

App/                        # Application code
    assets/
        images/             # Images for README and documentation
Code/                       # Core code files
    audio_processing.py     # Functions for processing audio files
    predict_on_audio.py     # Script to predict bird songs in a single audio file
    predict_on_folder.py    # Script to predict bird songs in all audio files in a folder
Data/                       # Data directory
    Audios/                 # Sample audio files (place your own here)
    Images/                 # Spectrogram images generated from audio files
    Segments/               # Directory for detected audio segments
flagged/                    # Generated files from the Gradio app
Models/                     # Model directory
    Bird Song Detector/     # YOLO model
    BirdNET FineTuned BIRDeep/  # Fine-tuned YOLO model
README.md                   # This README file
environment.yml             # Conda environment specification

Setup

Clone the repository:

git clone https://github.com/yourusername/Bird-Song-Detector.git
cd Bird-Song-Detector

Create and activate conda environment from environment.yml:

conda env create -f environment.yml
conda activate bird-song-detector

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Detect Bird Songs in a Single Audio File

Run the script predict_on_audio.py to detect bird songs in a single audio file:

python Code/predict_on_audio.py

Detect Bird Songs in Multiple Audio Files

Run the script predict_on_folder.py to detect bird songs in all audio files in a folder:

python Code/predict_on_folder.py

Web Interface

For a more interactive experience, you can run the Gradio web interface (app.py) by executing the following:

python App/app.py

The web interface will be available at http://127.0.0.1:7860/ by default. On the main page, drag and drop your audio file into the upload box and click Detect Bird Song to process it.

Once the detection is complete, you will see the following:

A list of predicted bird song segments with their start time, end time, and confidence score.
The corresponding spectrogram image with bounding boxes indicating detected bird songs.

You can then click on the Generate Segments button to download a ZIP file containing the individual detected audio segments in WAV format.

The generated ZIP file will contain the predictions WAV format with the name of original audio file followed by the start and end time of the detection and the confidence score.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Funding

This work has received financial support from the BIRDeep project (TED2021-129871A-I00), which is funded by MICIU/AEI/10.13039/501100011033 and the ‘European Union NextGenerationEU/PRTR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bird-Song-Detector

📄 Citation

Project Structure

Setup

Usage

Detect Bird Songs in a Single Audio File

Detect Bird Songs in Multiple Audio Files

Web Interface

License

Funding

About

Uh oh!

Releases 4

Packages

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
App		App
Code		Code
Data		Data
Models		Models
__pycache__		__pycache__
assets/images		assets/images
flagged		flagged
runs/detect/predict		runs/detect/predict
LICENSE		LICENSE
README.md		README.md
citation.cff		citation.cff
environment.yml		environment.yml

License

GrunCrow/Bird-Song-Detector

Folders and files

Latest commit

History

Repository files navigation

Bird-Song-Detector

📄 Citation

Project Structure

Setup

Usage

Detect Bird Songs in a Single Audio File

Detect Bird Songs in Multiple Audio Files

Web Interface

License

Funding

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Packages