Fig. 6From: Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular networkNetwork EE obtained with several learning algorithms with respect to the number of BS. The figure compares the network EE achieved by several learning policies w.r.t. the number of BS, i.e., from 4 to 16, after 3000 iterations of each algorithm and a target traffic rate Λtarget=0.05 × 10−4Back to article page