Abstract
Physical machines in modern datacenters are routinely upgraded due to their maintenance requirements, which involve migrating all the virtual machines they currently host to alternative physical machines. For this kind of datacenter upgrades, it is critical to minimize the time it takes to upgrade all the physical machines in the datacenter, so as to reduce disruptions to cloud services. To minimize the upgrade time, it is essential to carefully schedule the migration of virtual machines on each physical machine during its upgrade, without violating any constraints imposed by virtual machines that are currently running. Rather than resorting to heuristic algorithms as existing work, we propose a new scheduler, Raven, that uses an experience-driven approach with deep reinforcement learning to schedule the virtual machine migration. With our design of the state space, action space and reward function, Raven trains a fully-connected neural network using the cross-entropy method to approximate the policy of choosing a destination physical machine for each virtual machine before its migration. We compare Raven with state-of-the-art algorithms in the literature, and our results show that Raven can effectively shorten the time to complete the datacenter upgrade under different datacenter settings.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Beloglazov A, Abawajy J, Buyya R (2012) Energy-Aware Resource Allocation Heuristics for Efficient Management of Data Centers for Cloud Computing. Future generation computer systems 28(5):755–768
Maurya K, Sinha R (2013) Energy conscious dynamic provisioning of virtual machines using adaptive migration thresholds in cloud data center. International Journal of Computer Science and Mobile Computing 2 (3):74–82
Luo J, Fan X, Yin L (2019) Communication-Aware and Energy Saving Virtual Machine Allocation Algorithm in Data Center. In: 21st IEEE International Conference on High Performance Computing and Communications. IEEE, pp 819–826
Ji S, Li M D, Ji N, Li B (2018) An online virtual machine placement algorithm in an over-committed cloud. In: 2018 IEEE International Conference on Cloud Engineering, IC2E 2018, Orlando, FL, USA, April 17-20, 2018, pp 106–112
Sutton R S, Barto A G (1998) Reinforcement learning: An introduction. MIT Press Cambridge, MA, USA.
Mnih V, Kavukcuoglu K, Silver D, Rusu A A, Veness J, Bellemare M G, Graves A, Riedmiller M, Fidjeland A K, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D (2015) Human-level control through deep reinforcement learning. Nature 518:529–533
Mao H, Alizadeh M, Menache I, Kandula S (2016) Resource management with deep reinforcement learning. In: Proceedings of the 15th ACM Workshop on Hot Topics in Networks, ACM, pp 50–56
Silver D, Huang A, Maddison C J, et al. (2016) Mastering the game of go with deep neural networks and tree search. nature 529(7587):484–489
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, et al. (2017) Mastering the game of go without human knowledge. Nature 550:354–359
Rubinstein RY, Kroese DP (2004) The cross-entropy method. Springer New York, NY, USA.
Ahmad R W, Gani A, Hamid S H A, Shiraz M, Xia F, Madani S A (2015) Virtual Machine Migration in Cloud Data Centers: a Review, Taxonomy, and Open Research Issues. The Journal of Supercomputing 71(7):2473–2515
Borgetto D, Maurer M, et al. (2012) Energy-Efficient and SLA-Aware Management of IaaS Clouds. In: Proceedings of the 3rd International Conference on Energy-Efficient Computing and Networking, Madrid, Spain, May 9-11, 2012. ACM, p 25
Garg S K, Toosi A N, Gopalaiyengar S K, Buyya R (2014) SLA-Based Virtual Machine Management for Heterogeneous Workloads in a Cloud Datacenter. J. Netw. Comput. Appl. 45:108–120
Hieu N T, Francesco M D, et al. (2014) A Virtual Machine Placement Algorithm for Balanced Resource Utilization in Cloud Data Centers. In: 2014 IEEE 7th International Conference on Cloud Computing, Anchorage, AK, USA, June 27 - July 2, 2014. IEEE Computer Society, pp 474–481
Zhang X, Shae Z-Y, Zheng S, Jamjoom H (2012) Virtual Machine Migration in an Over-Committed Cloud. In: Proc. IEEE Network Operations and Management Symposium (NOMS)
Dabbagh M, Hamdaoui B, Guizani M, Rayes A (2015) Efficient Datacenter Resource Utilization Through Cloud Resource Overcommitment. In: Proc. IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ying, C., Li, B., Ke, X. et al. Raven: Scheduling Virtual Machine Migration During Datacenter Upgrades with Reinforcement Learning. Mobile Netw Appl 27, 303–314 (2022). https://doi.org/10.1007/s11036-020-01632-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-020-01632-1