semetrics icon indicating copy to clipboard operation
semetrics copied to clipboard

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Speech Enhancement Metrics

This implementation of Matlab was originally taken from https://ecs.utdallas.edu/loizou/speech/software.htm

REQUIREMENTS

Octave must be installed

apt install octave
pip install oct2py Cython # install Cython before installing pesq
pip install pesq

COMPOSITE MEASURE

Usage:

[Csig,Cbak,Covl,segSNR]=composite(cleanfile.wav,enhanced.wav)

where Csig is the predicted rating of speech distortion, Cbak is the predicted rating of background distortion, Covl is the predicted rating of overall quality.

You may run example files included in the zip file. In MATLAB, type:

>> [c,b,o,s]=composite('sp09.wav','enhanced_logmmse.wav')

c = 3.3050

b = 2.6160

o = 2.7133

s = 3.9917

where sp09.wav is the clean file and enhanced_logmmse.wav is the enhanced file.

References

MATLAB Source: https://ecs.utdallas.edu/loizou/speech/software.htm

@ARTICLE{4389058,
  author={Y. {Hu} and P. C. {Loizou}},
  journal={IEEE Transactions on Audio, Speech, and Language Processing}, 
  title={Evaluation of Objective Quality Measures for Speech Enhancement}, 
  year={2008},
  volume={16},
  number={1},
  pages={229-238},
}