audio_classification
audio_classification copied to clipboard
Audio classification using CNN and LSTM
Data Visualization
MFCC Features
Spectrogram
Raw Audio
Results
| CNN | ||||
|---|---|---|---|---|
| Spectrogram | MFCC | |||
| Dataset | Train | Validation | Train | Validation |
| urbansound8k | 99.914 | 97.252 | 100 | 84.544 |
| CNN-LSTM | ||||
|---|---|---|---|---|
| Spectrogram | MFCC | |||
| Dataset | Train | Validation | Train | Validation |
| urbansound8k | 99.928 | 96.451 | 99.985 | 82.369 |
Training Accuracy Plot
Validation Accuracy Plot
Training Error Plot
Validation Error Plot
Installation
Use the package manager pip to install foobar.
pip install requirements.txt
or
conda create --name <env> --file requirements.txt
Usage
Dataset
Pre-process Data
python codes/pre_processing/pre_processing_urbansound.py
Train and Test
python codes/baseline/main.py
Contributing
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.