Enhancing the magnitude spectrum of speech features for robust speech recognition

EURASIP Journal on Advances in Signal Processing

Table 2 Recognition accuracy (%) achieved by various approaches for Aurora-2 clean-condition training task averaged across the SNRs between 0 and 20 dB, where AVG (%) and RR (%) are the averaged accuracy rate and the relative error rate reduction over the baseline

Method	Set A	Set B	Set C	AVG	RR
MFCC baseline	59.24	56.37	67.53	59.75	–
	Spectral-domain methods
SS_Boll	61.81	64.09	60.09	62.38	6.53
SS_Berouti	69.76	70.47	69.39	69.97	25.40
SS_Kamath	66.91	67.50	67.19	67.20	18.52
WF_PSNR	71.78	73.66	70.37	72.25	31.05
WF_TSNR	51.12	55.90	45.64	51.94	-19.41
WF_HRNR	56.20	59.65	53.47	57.03	-6.74
MMSE log-STSA	72.71	73.58	71.99	72.91	32.71
MSE	77.76	79.89	69.42	76.94	42.72
	Cepstral-domain methods
MVN	73.81	75.02	75.08	74.55	36.77
HEQ	81.42	83.34	81.51	82.21	55.80
MVA	78.15	79.17	79.12	78.75	47.21