Skip to main content

Table 1 Optimal embedding context size ( K ) and model sizes ( N M of GMM and N H of MLP)

From: Audio visual speech source separation via improved context dependent association model

 

GMM(K/N M )

 

MLP(K/N H )

k a

k v :2

4

6

8

 

2

4

6

8

2

1/4

1/16

1/28

1/8

 

4/20

4/16

4/20

6/28

4

1/24

1/24

1/24

1/20

 

4/20

3/16

2/24

3/16

6

1/28

1/20

1/24

1/24

 

4/8

2/24

3/16

2/8

8

1/8

1/8

1/28

1/28

 

2/20

2/16

2/28

2/16

10

1/24

1/28

1/28

1/24

 

2/16

2/28

2/16

2/16

12

1/8

1/16

1/20

1/20

 

4/4

2/16

2/20

4/8

  1. Optimal values are selected using cross validation for different values of k a and k v .