Fig. 1From: Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular networkReinforcement learning (RL) framework for BS switching operation. The figure illustrates the principle of the multi-armed bandit reinforcement learning model for ON/OFF base stationsBack to article page