Fig. 8From: Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonarAverage loss curve for different action selection probabilities \(\varepsilon\)Back to article page