output = joergensen2013(x, y, fs, IO_param)
output = joergensen2013(x, y, fs, IO_param) calculates the signal-to-noise envelope-power (SNRenv) ratio using the multi-resolution speech-based envelope spectrum model (mr-sEPSM) described in Joergensen et al. (2013).
'',x | noisy speech mixture |
'',y | noise alone |
'',fs | sample rate in Hz |
'',IO_param | (optional) vector with parameters for the ideal observer that converts the SNRenv to probability of correct, assuming a given speech material. It contains four parameters of the ideal observer formatted as [k q m sigma_s]. |
The model is based on the model from Joergensen et al. (2011), which consists of the following stages:
5) A decision mechanism based on a statistically ideal observer (output.P_correct)
The main difference between to the Joergensen et al. (2011) model is that the present model estimates the envelope power using multi-resolution segmentation of the envelope. The segment duration depends on the modulation filter center-frequency. In addition, the modulation filter bank includes filters up to modulation frequencies of 256 Hz in contrast to the 64 Hz considered by the model from Joergensen et al. (2011).
S. Joergensen and T. Dau. Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing. J. Acoust. Soc. Am., 130(3):1475--1487, 2011.
S. Jørgensen, S. D. Ewert, and T. Dau. A multi-resolution envelope power based model for speech intelligibility. J. Acoust. Soc. Am., 134(1):436--446, 2013.