Fig. 6From: Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonarThe Dyna-Q-Max-Action algorithm with \(\varepsilon\)=1Back to article page