Fig. 7From: Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonarThe Dyna-Q-Max-Action algorithm with \(\varepsilon\)=0.6Back to article page