Recent Posts
Most Viewed Posts
- Exchange 2007 Availability Tool Kit
- Forrester Research on High Availability
- Q & A for the June 24th Webinar: SQL Availability - Protecting Your Database and Applications
- everRun and Exchange 2007 Mailbox Servers
- Q & A for the January 2009 Webinar - Customer Spotlight: How the Sullivan Group Got Reliable High Availability without Breaking the Bank
Tuesday, September 16th, 2008 - 2:30 pm EDT
VMware FT – The Top Four Reasons it’s Kinda Sorta Fault Tolerance
Marathon’s team at VMworld got to sit in on the VMworld session Tuesday morning that covered VMware’s newly announced technology for fault tolerant VMs. While not bad for FT rookies, from what we saw, it’s a less than perfect solution for a lot companies that want to run business critical and mission critical applications in VMs. Let’s look at the four primary reasons why.
1. No component-level fault tolerance. The most common failures that result in unplanned downtime are component failures such as storage, NIC or controller failures. Yet VMware Fault Tolerance doesn’t do anything to protect against I/O, storage or network failures. By not addressing these primary sources of failures, VMware appears to be saying that you/the customer are on your own do figure out how to protect your storage and network connections. This may be okay for the very largest IT staffs in the world, but for the other 98%; it will not be sufficient.
2. Complexity on top of complexity. In order to use VMware Fault Tolerance, you’ll first have to install both VMware HA and DRS. No small feat in and of themselves. Then, because VMware FT requires NIC teaming, you’ll also have to manually install paired NICs. Then you’ll need to manually setup dual storage controllers (with the software to manage them) because it requires multi-pathing. And to top it all off, you’re required to use an expensive, and often complicated, SAN.
3. Limited CPU fault tolerance. With VMware FT, you’ll need to setup what VMware refers to as a “record/replay” capability on both a primary and secondary server. If something happens to the primary server, the record is stored on the SAN and then restarted on the secondary server. Two things to point out here. First, the whole thing depends on the quality of the SAN. Second, in the words of the VMware engineer who presented at VMworld, “this can take a couple of seconds.” So what happens to your application state in those couple of seconds?
4. For VMware virtual environments only. VMware FT will only work in VMware environments. It won't work with other hypervisors, and most importantly, you can't use for business critical and mission critical applications that you want to keep on physical server platforms (i.e., non-virtualized environments which still represent the vast majority of customer use cases). Oh well, only the vast majority of critical applications run in physical environments anyway.
It’s great to see VMware recognizing the need for fault tolerance, but we’re puzzled why they decided not to address the biggest source of failures – component failure. And we wonder how many mid-market companies will be able to justify the cost and complexity of getting VMware FT setup and keep it running.
If you had a chance to attend the session what were your thoughts?
Show Discussion / Comments (6)
Fault Tolerant
Marathon
VMware
VMworld
| More