Fig. 3From: Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular networkTransfer learning for Transfer EEM-UCB (TLEEM-UCB) policy. The figure illustrates the principle of transfer learning between two consecutive days on the learning convergence rateBack to article page