From: Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular network

Improvement gain of TLEEM-UCB policy w.r.t. EEM-UCB policy for different target arrival rate. The figure shows the percentage of improvement in CEER while using transfer learning w.r.t. to non-transferred knowledge algorithm. The bars corresponding to the left Y-axis reflect the gain in CEER while the right Y-axis represents the difference ΛtargetΛsource. The performance gain is plotted for several number of iterations, i.e., 100, 500, 1500, and 3000

