Virtual machines of high availability using hardware-assisted failure detection

Wei Jen Wang, Hung Lin Huang, Shan Hao Chuang, Shao Jui Chen, Chia Hung Kao, Deron Liang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

The virtualization technology has been widely used in today's doud computing datacenters. With the virtualization technology, each physical machine in a datacenter can be logically divided into several virtual machines, on which different types of software services can host. However, many reasons may decrease the availability of the whole system. For example, a failed physical machine automatically fails all virtual machines on the physical machine, and consequently fails every software service on the virtual machines. It is difficult to detect failures efficiently in a general-purpose computer architecture because the hardware cannot provide enough information for fast failure detection. On the contrary, the ATCA (Advanced Telecommunications Computing Architecture) physical machines provide high hardware availability, and support IPMI (Intelligent Platform Management Interface) that can quickly detect the hardware status. In this paper, we developed a novel failure model and designed a symmetric fault-tolerant mechanism using ATCA physical machines and KVM to provide a solution for high system availability. The proposed fault-tolerant mechanism divides ATCA physical machines into pairs, such that each machine of a pair supports fault tolerance for each other. Once a failure is detected in the physical machine layer or the virtualization layer, the failed virtual machines are then recovered on the other physical machine. We have compared the proposed fault-tolerance mechanism with another prior VM-based fault-tolerance tool. The results show that the proposed mechanism significantly reduces the service downtime. That is, it provides better system availability for software services running on the virtual machines.

原文???core.languages.en_GB???
主出版物標題ICCST 2015 - The 49th Annual IEEE International Carnahan Conference on Security Technology
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9781479986910
DOIs
出版狀態已出版 - 21 1月 2016
事件49th Annual IEEE International Carnahan Conference on Security Technology, ICCST 2015 - Taipei, Taiwan
持續時間: 21 9月 201524 9月 2015

出版系列

名字Proceedings - International Carnahan Conference on Security Technology
2015-January
ISSN(列印)1071-6572

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???49th Annual IEEE International Carnahan Conference on Security Technology, ICCST 2015
國家/地區Taiwan
城市Taipei
期間21/09/1524/09/15

指紋

深入研究「Virtual machines of high availability using hardware-assisted failure detection」主題。共同形成了獨特的指紋。

引用此