Fig. 3From: Accurate visual localization with semantic masking and attentionThe architectures of the proposed modules. a The bias network, composed of three convolutional layers. b The attention network, composed of a residual network with one Convolutional Block Attention ModuleBack to article page