Recent Posts
Most Viewed Posts
- Exchange 2007 Availability Tool Kit
- Forrester Research on High Availability
- Q & A for the June 24th Webinar: SQL Availability - Protecting Your Database and Applications
- everRun and Exchange 2007 Mailbox Servers
- Q & A for the January 2009 Webinar - Customer Spotlight: How the Sullivan Group Got Reliable High Availability without Breaking the Bank
Blog Entries in downtime
Thursday, June 17th, 2010 - 1:37 pm EDT
How to Cut Risks and Costs with a Downtime Analysis & Action Plan
Earlier this week, we hosted a webinar on the topic of “How to Cut Risks and Costs with a Downtime Analysis & Action Plan.” We know from our experience in application availability that many companies avoid these types of assessments – they either don't know where to start or decide that they don’t have the time or experience to conduct an assessment, so they just live with the unknowns and hope that nothing bad happens. (We’ve seen the consequences of downtime at many companies and don’t recommend this method!)
Our VP of Services & Support, Beth Shea, explored this topic in detail and provided a simple framework that companies can use today to uncover their risks and put measures in place to minimize the impact of downtime. To learn more, be sure to watch the 30-minute webinar. You can also check out the Q&A session from the webinar, summarized below.
Q: When looking at the impact of downtime, it is just unplanned downtime, or should you include planned downtime as well?
You absolutely need to plan for both planned and unplanned downtime, as there’s a real cost and business impact to both. They both need to be included in your impact assessment.
Q: What about branch offices – should they be included in a downtime assessment?
According to Forrester Research, about 20% of a company’s business is tied up in branch and remote offices, and IT needs to include these offices in any assessment that they are conducting. You shouldn’t overlook these offices when putting together your downtime and business impact assessments. They have to be factored in.
Q: How often should I conduct a business impact and risk assessment?
What we’ve found with our customers is that conducting an annual assessment is sufficient, or in some cases, twice a year, depending on the type of business. You can then use these as your benchmark going forward to determine the success of the initiative and ensure that you have the key metrics to report to your management team.
Q: How do you determine when to use local high availability vs. a disaster recovery solution?
Fault tolerance, high availability, disaster recovery - all of these different terms can be confusing and they can have different meanings to different people. The way we think of this is that when you’re implementing high availability or fault tolerance this is to ensure that locally you are protected against the everyday, nuisance failures that cause downtime. If you lose a fan or a drive for example, you would automatically route to another server within the same building or local area. Disaster recovery solutions are really for recovery from catastrophes (fire, flood) or other events where you need to failover to a much more distant location. You don’t want to use this type of solution for everyday failures, as it can be very time consuming to failover and failback, and you can potentially lose some data. For local protection, you want high availability/fault tolerant solutions.
Q: What about hosted applications like salesforce.com, how do I account for those in this type of assessment?
In today’s world, so many applications are offered as Software-as-a-Service (SaaS) or sometimes called hosted applications, where they are no longer hosted at your site. However, they are still important to your business and need to be included as part of your overall assessment. Our approach is to conduct the assessment for your SaaS applications as if all they were onsite. Then use your tiered analysis and make sure that your SaaS vendor is meeting your availability requirements for that application, and that they have the necessary protections in place to protect that application to the same level that you would protect if it were in-house.
Q: Does Marathon offer any services to conduct this type of assessment?
Yes – this is a service that we provide for our customers. Most customers are very satisfied with the service, because it usually has an immediate ROI for their business. If you are interested in this type of service, please feel free to us at 978-489-1100.
Q: Does Marathon have any templates available to build a framework for this type of assessment?
Absolutely. From our 16+ years of working with customers on the assessment and prevention of downtime, we’ve put together an extensive list of questions to ask about the business risks and impact of downtime. Please feel free to contact us if you would like more information.
Q: How do you measure or put a price on the intangible impacts of downtime?
This can be tough to nail down, but what we recommend is developing some basic estimates. This isn’t meant to be an exact number, what we are really trying to achieve here is to prioritize applications, put them into the tiers that we discussed and make sure that you are putting the right amount of resources against the right applications. From a productivity perspective, one metric you could use is to look at the cost of employee salaries and how much it would cost in salary costs to have employees not be able to work for a certain amount of time. This is just one example.
Q: Does everRun handle quick switch over to back up site if the main site goes down?
Yes, within seconds.
Q: What are the requirements for the backup site?
The machines at the backup site are in the same pool as the primary site, so the backup machines must meet the requirements to be in the same pool as the primary site machines.
Q: How about regular data sync between main site and backup site?
Since the primary and backup site are running in lockstep mode, the application and the data are always in sync between the primary and backup sites.
Show Discussion / Comments (0)
Downtime
Availability
Disaster Recovery
Fault Tolerance
High Availability
Interview
Webinar
| More
Tuesday, March 9th, 2010 - 10:52 am EST
Q&A with Craig Resnick of ARC Advisory Group
Next week Craig Resnick, research director and automation expert at ARC Advisory Group will be the guest speaker for our webinar "Best Practices for Preventing Downtime in Automation Systems." We recently sat down with Craig to discuss some of the recent trends in the manufacturing and automation industries.
Q: What are some of the newer trends that you are seeing in the automation space?
Craig Resnick: A primary trend that we see at ARC is the convergence of automation and IT systems. Nearly every manufacturing company uses a variety of plant automation and enterprise IT systems to manage its operations. Plant floor systems, such as distributed control systems (DCS), programmable automation and logic controllers (PACs/PLCs), and a wide range of plant floor applications provide a wealth of real-time information regarding productivity, efficiency, equipment health, capability, and quality. Business systems, in turn, provide information on raw material costs, product orders and inventories, manufacturing resources, production schedules, etc. This wide range of information often remains isolated in systems such as manufacturing execution systems (MES), laboratory systems, maintenance systems, scheduling systems, enterprise resource planning (ERP) systems, supply chain management (SCM) systems, and customer relationship management (CRM) systems. Decisions based on data from any one of these system will always be less than optimal because, without the corresponding information from the other systems, the information will be incomplete.
To close this gap between automation and IT systems, and to address the trend of the plant floor becoming more IT-centric, ARC has defined a new space, defined as Collaborative Production Systems. These new systems consist of platforms in which the controls layer domains of process, logic, motion, building automation, and power control systems converge with the information layer domains of production management and MES systems. These converged systems enable, for example, the required data and information to be directly tied into applications such as corporate reporting and manufacturing compliance. Collaborative Production Systems will become the industrial blade server that provides full monitoring and control of the enterprise, from the office to the plant floor, sharing that information with the supply chain to, for example, procure materials and resources and purchase or sell power at the optimal times and prices from the smart grid, while providing full financial metrics and KPIs to ERP systems to maximize profitability.
Q: Now that corporate reporting and systems are heavily tied into the “factory floor”, how is that changing the need for system availability and data protection?
Craig Resnick: The need for system availability and data protection continues to expand, driven by a combination of issues ranging from global competition to regulatory requirements. Process safety and critical control are primarily focused on system availability and process uptime. As a specific example, take the Pharmaceutical industry, where data and batch information can never be lost or interrupted. System availability and data protection needs are also forcing E-records regulations to evolve across the globe. In the US, this includes 21 CFR Part 11, as well as the FDA’s Good Manufacturing Practice (GMP) and Process Analytical Technology (PAT) initiatives. In Europe, this includes Annex 11 of the EU GMPs, electronic Signatures Directive 1999/93/EC, and Data Protection Directive 95/46/EC. The European Data Protection Directive requires even more protection on data than the current FDA regulations and extends this requirement to clinical trials patients, as all clinical trials data requires maximum protection to remain compliant with regulations.
Unscheduled downtime is expensive. It often impacts production’s ability to meet its schedule and may cause missed customer commitments. Unplanned downtime, which also includes unexpected stoppages resulting from equipment failure, operator error, or nuisance trips, is the nemesis of all manufacturers. Statistics on the impact of unplanned downtime on plant operations show that it accounts for 2 to 5 percent of production lost in, for example, the petrochemical industry. Unscheduled downtime is also costly in terms of equipment damage, environmental harm, and worker safety. The cost of downtime is reflected in a primary key performance indicator (KPI) used by manufacturers known as Dynamic Overall Equipment Effectiveness (OEE), which helps determine the real-time impact of the performance of any individual process or piece of equipment on the overall efficiency of the plant. Unscheduled downtime is a primary factor that significantly lowers Dynamic OEE, which translates to the manufacturer decreasing both its efficiency and profitability.
Q: What are some of the basic steps that companies can implement to ensure the availability of their systems?
Craig Resnick: The first step that companies can implement to ensure the availability of their systems is to maximize their operator’s effectiveness in the control room, which is essential to minimize the risks of accidents, eliminate unscheduled downtime, and maximize production quality. The global process industry loses $20 billion, or five percent of annual production, due to unscheduled downtime and poor quality. ARC estimates that almost 80 percent of these losses are preventable and 40 percent of those preventable losses are primarily the result of human or operator error. Maximizing operator effectiveness requires automating as many functions as technology will allow, as well as reducing complexity wherever possible. For example there are still many plants where operators monitor the processes and collect data manually or semi-automatically using chart recorders. This process is both tedious and error prone, and does not provide appropriate process insight or instill a sense of ownership among the control room operators.
The Abnormal Situation Management Consortium (ASM) points out that most incidences occur from multiple modes of failure. Preventable human error is a contributing factor to these losses, but is hardly the only cause. Preventing abnormal situations requires a multilayered multi-discipline approach focused on maximizing production throughput, efficiency and quality while minimizing lost production time and preventing damage to assets and endangerment to personnel. This approach requires deploying collaborative production systems designed and implemented to be able to deliver high levels of availability and fault-tolerance expected from any other mission critical industrial system. This typically requires effective data backup mechanisms, redundant controllers for critical applications, plus industrial grade software. Manufacturers are also deploying more fault tolerant server technology to ensure continuous availability of these mission critical applications; the continuous flow of vital products to the market; and the avoidance of the potentially negative financial, social, or environmental impact that operating without high availability fault-tolerant systems might bring.
To learn more about preventing downtime in your automation applications, be sure to attend next week's webinar where Craig will provide expert info on steps for reducing the human error that leads to downtime, how to protect your hardware, storage and networks for complete availability coverage, and how to protect against a complete site failure. You can register here.
Show Discussion / Comments (0)
Manufacturing
Downtime
Fault Tolerance
High Availability
Interview
Webcast
Webinar
| More
Wednesday, October 14th, 2009 - 11:19 am EDT
4 Simple Steps to Reducing Downtime
We had a fantastic presentation last week from IT expert and author Niel Nickolaisen. Niel shared his proven methods for reducing downtime and improving the alignment of IT resources to better support business goals. If you weren’t able to attend the live event, you can watch the recorded version here.
If you prefer a white paper format, Niel’s strategies and best practices have also been summarized in a brand-new 8-page white paper, “Reduce Downtime by 70% - Without Spending a Dime” which you can download here.
The Q&A session from the live webinar with Niel Nickolaisen and Michael Bilancieri of Marathon has been summarized below:
Q: Can you give some tips on how I can educate my branch offices about my business continuity plan?
Niel Nickolaisen, CIO: At Headwaters, Inc., we have 120 remote sites. We approached this from an SLA perspective. We translated how the SLAs affected the operations at our branch locations. Then we communicated it and got them to buy into the SLAs and the things we were doing and suggested that they followed our lead.
Q: How often should you update your disaster recovery plans?
Niel Nickolaisen, CIO: In our case at Headwaters, Inc., we have Sarbanes-Oxley regulatory requirements. We do an annual formal risk assessment both for our business and for IT. When we’re done with that assessment we update our disaster recovery plans, which are based on the risks. Our disaster plan is designed to mitigate or recover from the risks that we’ve identified.
Q: How does everRun work?
At a high-level, everRun takes your entire Windows environment and protects it as a whole. Most protect from within the OS but we protect from underneath the OS. We clone to a second system for redundancy in a synchronous fashion. A good way to understand how everRun works is to watch our product demos videos and flash demos available on our website.
Q: How does everRun fit into a virtual environment?
everRun allows the ability to create multiple workloads on a single server. Our technology is based on virtualization technology – we’re virtualizing two instances to appear as one. You can create multiple workloads and put them on the same server and protect them. It’s based on Citrix XenServer.
Q: Will this work in conjunction with SAN offhost backups using Vertias Netbackup and FlashSnap option?
We are agnostic to the storage. If you’re using back-up right from the SAN, that’s fine. You can also use a mirrored option, where we can mirror the entire system in a synchronous fashion. That allows you to have SAN on one side and NAS on the other, or direct-attached, or both. It’s your choice, which gives you greater flexibility. You can separate the servers as well between buildings. The other option is a single copy of storage, not mirrored and both systems can connect to that storage, but the SAN device will then have to protect the data.
Q: How can Marathon contribute to companies considering a move to SAP?
everRun can provide availability and fault tolerant protection to that SAP environment. If you’re considering a move to SAP, I would assume you have had some discussions about how to protect that—the SLA, the data, availability and disaster recovery. everRun can protect and provide disaster tolerance disaster recovery, and high availability for that application, as well as data protection. We don’t cause any changes to the application.
Q: Should Marathon be brought in as a consultant before SAP is contracted?
Sometimes it’s a good idea to have a joint discussion with vendors. A lot of times when you look at availability and redundancy or data replication, it’s doing things to the applications and data and can cause interaction issues. Sometimes the application has to be configured in a certain way, so you want to know up front how your high availability solution could affect the data and application. We can certainly do a call with any other software vendors to have that conversation up front.
Q: What version of Windows does everRun support?
everRun supports Windows Server 2003 32-bit and 64-bit and Windows Server 2008 64-bit.
Q: What kind of performance impact does the synchronous lock-step have on the system?
That varies by application, users, data, I/O, and other factors. In general, it can range from 10-20% on your application – we’ve seen less than that and more than that, depending on the system.
Q: Do you recommend WAN optimization to be used?
Our requirements are around bandwidth between the two systems if you want to separate the systems. WAN optimization tools don’t always help. It’s really a latency requirement to maintain good performance.
If you'd like to learn more about Niel's best practices for aligning business and IT resources, be sure to check out his new book, Stand Back and Deliver: Accelerating Business Agility.

Show Discussion / Comments (0)
Downtime
Availability
EverRun
High Availability
Webcast
Webinar
| More
Friday, October 2nd, 2009 - 10:17 am EDT
Using a Gap Analysis to Reduce Downtime
Congratulations to Thomas Burgdorf of Mii Management Group, the winner of a $100 American Express gift card from our recent everRun 2G demo webinar. If you weren’t able to attend the live event, a recording the everRun 2G demo webinar is now available for on-demand viewing here.
Be sure to join us for our next webinar on Oct. 8th, featuring IT process expert and author Niel Nickolaisen. We're really excited to have Niel as our guest speaker for this webinar. In addition to his 25+ years of IT experience, Niel is the CIO and Director of Strategic Planning at Headwaters, Inc. and also writes regular columns for the CIO Leadership Network and TechTarget's Search CIO. Niel is going to share his proven methods for reducing downtime, including:
* Conducting a gap analysis of your current IT processes
* Identifying weaknesses that can lead to downtime
* Simplifying IT processes so that your entire staff can understand and follow them
We're expecting a large group for this webinar, so be sure to register today to reserve your spot.
Show Discussion / Comments (0)
Webinar
Downtime
EverRun
High Availability
| More
Wednesday, September 23rd, 2009 - 10:09 am EDT
Protecting SQL Server from Downtime
In recent months, Marathon has put together a series of toolkits with materials on reducing downtime and data loss, including toolkits for Citrix XenApp and Microsoft Exchange 2007.
Our latest toolkit is now available, this time for Microsoft SQL Server. Protecting SQL Server from downtime has become even more critical in recent years, as businesses run more of their critical systems, including electronic commerce, online banking, just-in-time manufacturing and streaming media (just to name a few) on SQL.
This toolkit includes materials on SQL Server high availability in both physical and virtual environments.
White paper: 5 Secrets to SQL Server Availability This paper reviews five proven secrets to affordable SQL high availability that will help IT managers implement a SQL Server environment with little or no downtime - and zero data loss.
White paper: The SSWUG.org Increasing Reliability and Availability in a Virtualized SQL Server Environment white paper, authored by Microsoft SQL Server MVP Stephen Wynkoop, provides IT professionals with best practices and considerations for designing and implementing a virtualized SQL environment including:
• Potential pitfalls to avoid when virtualizing SQL Server
• How to increase reliability and availability of a virtualized SQL Server environment
• A SQL Server virtualization case study (Sullivan Group)
On-Demand Webinar: SQL Availability: Protecting your Database and Applications featuring Microsoft SQL Server MVP Stephen Wynkoop, helps IT administrators understand SQL back-up and restore options. Wynkoop also presents his Concentric Rings of Recovery plan, which covers the four levels of preparedness for local, alternate, off-site and remote locations.
Also, be sure to check out some addtional SQL Server resources, including SQL user groups, SQL Server job boards, SQL MVP blogs and Twitter feeds, and other SQL-related info.
Show Discussion / Comments (0)
SQL
Availability
Downtime
High Availability
| More
Tuesday, July 21st, 2009 - 5:35 pm EDT
Q&A with David Hanna of Microsoft
If you’ve been thinking about upgrading to Windows Server 2008, be sure to attend our July 30th webinar featuring guest speaker David Hanna, Information Architect at Microsoft. David will review the new Web tools, virtualization technologies, security enhancements, and management utilities available in Windows Server 2008. You’ll also have a chance to ask David any specific questions you have about Windows Server 2008 during the live Q&A portion of the webcast.
In preparation for the webinar, we asked David to answer a few of the common questions that we have been hearing from our customers in recent months.
Q: One of the biggest concerns we hear from our customers and partners is that in this current economy, IT departments are being asked to do a lot more with less people. How can Windows Server 2008 help with this issue?
Across all of my customers, everyone is talking about cutting costs, and getting more out of their current investments. When we start digging into the features of Windows Server 2008, customers are finding tremendous opportunity to optimize their environments. A few of the major areas of cost savings I’m seeing are:
- Reduced deployment time and costs with Windows Deployment Services
- Reduced management cost and effort with PowerShell and Server Manager
- Hardware and Workload Consolidation with Hyper-V
- Licensing consolidation with Enterprise and Datacenter models for virtual environments.
Q: What about the challenge of managing remote and branch office locations?
Branch offices have consistently been a challenge to manage, primarily due to lack of on-site staff. Windows Server 2008 brings some major new components to the picture that will greatly ease branch office management. These features include the Read-Only Domain controller, which makes the remote DC secure, and replaceable, Distributed File System, Windows Remote Management, Server Core (lower surface attack area), and improved Terminal Services for application delivery.
Q: A lot of our customers work in “always-on” industries like manufacturing, healthcare and broadcast media, where server downtime can be very disruptive to their business. How does Windows Server 2008 support these demanding environments?
Windows Server has always addressed high availability with Clustering Services. Windows Server 2008 has brought some huge enhancements to the Cluster Service that will reduce the complexity of clustering, while increasing availability. Failover Clustering in Server 2008 has a new validation wizard that will validate hardware and software configurations, resulting in easier, more reliable cluster deployments. The reliance on a quorum drive has also been removed, so there is no longer a single point of failure in the cluster. Also, Failover Clustering has been enhanced to support multi-site clusters to support organizations that need site-to-site failover. And, as always, when organizations need to take availability to the next level, Microsoft continues to work with partners like Marathon to extend the native capabilities of Windows Server.
***********************************************************************************************
During the webinar, Michael Bilancieri, Sr. Director of Products for Marathon, will discuss how to extend the high availability features of Windows Server 2008 to fault tolerant protection with Marathon’s everRun software and how organizations can now confidently migrate mission critical applications from Unix or proprietary platforms to realize big cost savings.
Registrations for this webinar are limited and we are expecting a large turnout, so be sure to save your spot by registering today.
Show Discussion / Comments (0)
Webinar
Availability
Clustering
Clusters
Downtime
EverRun
Fault Tolerance
Fault Tolerant
High Availability
Webcast
| More
Wednesday, April 1st, 2009 - 7:51 am EDT
everRun and Exchange 2007 Mailbox Servers
When planning your VM workloads, you should be aware of what level of availability each server will need. By splitting the amount of users across multiple VM’s you can provide a level of availability to each set of users based upon your SLA with each business unit in your company. Looking back to the availability pyramid you can choose which level of availability for each mailbox server is needed. For example if you have an executive group that needs to be up with a 24/7 uptime and only limited downtime then level 3 should be your selection on a separate mailbox server. If all of your business units require the same level of availability and have the same SLA in place then you will split your mailbox servers according to usage. Using the chart from section one we can split the users based upon the type of user. For example if you have 1,000 heavy users we would assign 2 vCPU’s to the virtual machine. Always follow Microsoft best practices when deploying the amount of users per core or vCPU.

If we look at Figure 1 we can see that that we have 4 active VM’s spread across two servers with 2 vCPU’s assigned to each. Looking at the example chart above and using figure one we can see that this design example would support 4,000 “Heavy users”. We achieve this by allowing our storage groups on each mailbox VM to support 1,000 “Heavy” users.

Let’s take a look at a basic design with 3 separate types of users spread across 4 servers. We have an executive mailbox store, a mid-management store, and a general user store. In looking over what each teams HA requirement is we have come to the following, the executive team needs 24/7 up time with no downtime except for a maintenance window once a month. The mid-management team can handle some downtime, but only a few minutes each week. The general users have no HA requirement they can be down for an hour a week if needed. So how do we decided what level of availability we would like to use, it’s easy we simply look at the application availability pyramid and we put the appropriate mailbox store at each level:

By using this simple plan you can simplify you’re HA strategy for Exchange. By distributing the mailbox stores across multiple servers on the same hardware you can save rack space as well as provide individual levels of availability based upon different business unit needs.
Show Discussion / Comments (0)
Availability
Downtime
EverRun
Exchange
Exchange 2007
Marathon
Virtual Machine
| More
Wednesday, December 10th, 2008 - 7:11 am EST
Webinar: Assessing the Impact of Planned and Unplanned Downtime in the Contact Center
Business continuity planning ranks among the top trends in a recent Dimension Data report on contact center technology. Yet many call centers aren’t equipped to deal with unexpected downtime from a system failure. These centers would lose productivity and sacrifice service levels when mission-critical tools like real-time reporting systems go dark.
Real-time reporting provider Inova Solutions, along with new partner Marathon Technologies, will host a webinar to discuss best practices for business continuity and high availability in the contact center. Presenter Scott Thompson from Marathon Technologies will discuss how to protect your real-time reporting investment from costly downtime and data loss.
Participants can register for the webinar here. Details are below:
What: Webinar: “Assessing the Impact of Planned and Unplanned Downtime in the Contact Center”
When: Wednesday, December 10, 2008, 2:00 pm EST
Show Discussion / Comments (2)
Availability
Business Continuity
Downtime
High Availability
Marathon
Partners
Webinar
| More
Monday, November 24th, 2008 - 3:11 pm EST
UNDERSTANDING DIALABLE AVAILABILITY

As many of you know, one of the key components of everRun VM is the ability to dial up or dial down the level of availability needed to protect business-critical applications. With buzz surrounding the release of Citrix’ XenServer 5, we have been approached with questions like “what should I use to protect my low-priority applications” and “how do I know when something should or shouldn’t be protected with the lockstep option?” To help explain the three levels of availability and when they would be used, we’ve put together these tips:
LEVEL 1: BASIC FAILOVER WITH XENSERVER HA
The first level of availability, basic failover and recovery, is appropriate for applications where recovery is not absolutely critical, and where manual intervention, while not desirable, is acceptable. These may include infrastructure applications or dev and test systems.
XenServer HA provides:
- Basic failover to another host within the same Xen pool, with resource calculation to determine whether adequate resources are available within the pool to handle a defined number of simultaneous host failures (XenServer HA does not check the health of available devices, such as network and storage)
- Monitoring of health of the hosts within a pool (Network and storage health are not monitored)
- No storage or data protection – using this level requires a shared-storage configuration
LEVEL 2: COMPONENT-LEVEL FAULT TOLERANCE WITH everRun VM
For applications with business-critical roles, everRun VM provides component-level fault tolerance: the ability to withstand the loss of an individual network or storage component without interruption or downtime.
The attributes of Level-2 availability include:
- Automated setup and fault management: policies handle system, network and disk I/O failures without IT intervention
- Assured recovery of virtual machines
- Zero downtime due to I/O failures and zero data loss
- Synchronous data mirroring between hosts; no need for shared storage
- Continuous active validation of all components on production and standby system to ensure complete redundancy at all times for recovery in the event of a failure
- Comprehensive availability including system, network, and data availability, all in one integrated solution
LEVEL 3: SYSTEM-LEVEL FAULT TOLERANCE WITH everRun VM AND LOCKSTEP OPTION
For the most mission-critical systems, Marathon everRun VM with Lockstep Option provides system-level fault tolerance, with continuous availability in the face of component or system-wide failures. Level 3 will be available in 2009 and offers protection for systems that cannot experience any downtime and must maintain transaction state at all costs. everRun VM with Lockstep Option offers all of the benefits of everRun VM (Level 2), together with:
- Zero downtime even for complete host failures
- Application state maintained during failures
- Memory state maintained during failures
For more information on the different levels of availability please visit here.
Show Discussion / Comments (0)
Availability
Citrix
Continuous Availability
Downtime
EverRun
EverRun VM
Fault Management
Fault Tolerance
Marathon
Virtual Machine
XenServer
XenServer HA
| More
Thursday, November 13th, 2008 - 8:46 am EST
How A Large Furniture Retailer Benefitted From Protecting MS Exchange
Every day, companies around the world rely on the features of Microsoft Exchange for their business-critical applications like email, calendaring, contacts, mobile support, web-based information accessing and data storage support. While we’ve discussed the importance of maintaining Microsoft Exchange high availability and steps to simpler Exchange HA, we thought this would be a good opportunity to share a case study from one of our customers.
Connecting Employees, Vendors and Customers Without Interruption
A large U.S. furniture retailer was becoming increasingly dependent on Microsoft Exchange 2003 for internal communication and collaboration, and for communication with both vendors and customers. Since the retailers primary revenue-generating activities relied on e-mail, downtime would have resulted in serious consequences. As a result, Exchange protection became a requirement and top priority for senior leaders.
everRun –An Alternative to Clustering
The IT staff had previous experience with traditional clustering and was looking for an easier, more robust solution. They selected everRun and implemented a solution using a pair of IBM servers with local boot disks and a fibre channel SAN for the datastore. Currently, the system supports close to 1200 users.
No Exchange Failures = Increased Competitive Advantage
In over a year of operation, they have not experienced a single unplanned disruption of their Exchange system. In turn, this has allowed them to keep their revenue-generating activities operating at full speed. As a result, they are currently looking at adding the everRun SplitSite option to allow geographical separation of their systems for additional protection.
Do you have a story when protecting Exchange would have been a better option than what resulted? How did it affect you or your company?
Show Discussion / Comments (0)
Case Study
Clustering
Downtime
EverRun
Exchange
High Availability
Marathon
SplitSite
Virtualization
| More
Wednesday, October 29th, 2008 - 7:21 am EDT
FIVE STEPS TO SIMPLER EXCHANGE HIGH AVAILABILITY
As we noted in our last post, Exchange High Availability has become increasingly important to businesses of all sizes. To help you get started, we’ve put together these five tips, which are easily-digestible pieces from our “Protecting Microsoft Exchange in Physical and Virtual Environments” white paper.
STEP ONE – PROTECT AGAINST SERVER FAILURES WITH QUALITY HARDWARE AND COMPONENT REDUNDANCY
Server core components include power supplies, fans, memory, CPUs and main logic boards. Purchasing robust, name brand servers, performing recommended preventative maintenance, and monitoring server errors for signs of future problems can all help reduce the chances of Exchange downtime due to catastrophic server failure.
Downtime caused by server component failures can be significantly reduced by adding redundancy at the component level. Examples are: redundant power and cooling, ECC memory, with the ability to correct single-bit memory errors, and combining Ethernet cards with RAID.
STEP TWO – GET RID OF STORAGE FAILURES WITH STORAGE DEVICE REDUNDANCY AND RAID
Storage protection relies on device redundancy combined with RAID storage algorithms to protect data access and data integrity from hardware failures. There are distinct issues for both local disk storage and for shared, network storage.
For local storage, it is quite easy to add extra disks configured with RAID protection. A second disk controller is also required if you want to protect against controller failures.
Access to shared storage relies on either a fibre channel or Ethernet storage network. To assure uninterrupted access to shared storage, these networks must be designed to eliminate all single points of failure. This requires redundancy of network paths, network switches, and network connections to each storage array.
STEP THREE – PREVENT NETWORK FAILURES WITH REDUNDANT NETWORK PATHS, SWITCHES AND ROUTERS
The network infrastructure itself must be fault-tolerant, consisting of redundant network paths, switches, routers and other network elements. Server connections can also be duplicated to eliminate failovers caused by the failure of a single server or network component. Take care to ensure that the physical network hardware does not share common components. For example, dual-ported network cards share common hardware logic, and a single card failure can disable both ports. Full redundancy requires either two separate adapters or the combination of a built-in network port along with a separate network adapter.
STEP FOUR – FORGET SITE FAILURES WITH DATA REPLICATION TO ANOTHER SITE
Site failures can range from an air conditioning failure or a leaking roof that affects a single building, a power failure that affects a limited local area, or a major hurricane that affects a large geographic area. Site disruptions can last anywhere from a few hours to days or even weeks.
There are two methods for dealing with Site Disasters. One method is to tightly couple redundant servers across high speed/low latency links, to provide zero data-loss and zero downtime. The other method is to loosely couple redundant servers over medium speed/higher latency/greater distance lines, to provide a disaster recovery (DR) capability where a remote server can be restarted with a copy of the application database, which only misses the last few updates. In the latter case, asynchronous data replication is used to keep a backup copy of the data.
Data replication is combined with error detection and failover tools to help get a disaster recovery site up and running in minutes or hours, rather than days.
STEP FIVE – CONSIDER VIRTUALIZING EXCHANGE FOR BETTER HIGH AVILABILITY
The latest server virtualization technologies, while not required for protecting Exchange, do offer some unique benefits that can make Exchange protection both easier and more effective. Virtualization makes it very easy to set up evaluation test and development environments without the need for additional, dedicated hardware. Virtualization also allows resources to be adjusted dynamically to accommodate growth or peak loads.
To help you make the business case for virtualization Exchange, we’re producing a live webinar with Citrix on November 11th: Virtualizing Exchange - The Cold, Hard Numbers on Why Citrix XenServer + everRun VM is the Best Platform. Register for the webinar here.
Show Discussion / Comments (0)
Downtime
EverRun VM
Exchange
High Availability
Virtualization
Webinar
| More
Monday, October 27th, 2008 - 6:46 am EDT
The Importance of Maintaining Microsoft Exchange High Availability
For most organizations, email is single-handedly the most important tool for accomplishing business objectives. Without access to email, companies are at an immediate disadvantage in today’s “I want it now” marketplace. For example, let’s look at the impact email downtime has on productivity: Assuming that your employees are 25% less productive when email is unavailable, and their annual salary is $60,000, then every hour of downtime for an organization of 500 people results in more than $7,200 in lost employee productivity. Can your organization bare a $7,200/hour loss? In today’s economy? Probably Definitely not.
Avoiding the aforementioned consequence is an option, but in order to do so you need to guarantee continuous availability for your organizations email server. According to Paul Rubens at ServerWatch, 2007 forecasts from Gartner revealed that Microsoft Exchange 2007 will own 70% of the email market share by 2010. Now, whether Microsoft will actually return those results, it’s still too early to tell. However, as more and more companies rely on Exchange servers to run business functions, all potential causes of unplanned downtime need to be identified and eliminated.
Over the next month, we will be providing you with some recommendations on how to improve Exchange high availability through planned and unplanned downtime – starting with a webinar on November 11 titled “Virtualizing Exchange – The Cold, Hard Numbers on Why Citrix XenServer and everRun VM is the Best Platform.” For this webinar, Jerry Melnick, Marathon CTO, and Matt Fairbanks, VP of Product Marketing for Citrix Virtualization and Management Division, will team up to discuss how the latest server virtualization technologies keep users continuously connected to Microsoft Exchange servers in the easiest and most effective manner. We encourage you to register online for the webinar if you haven’t already.
Is there anything in particular related to protecting your Exchange severs you would like us to address in the next few weeks? Leave us a comment below and we will be sure to put it on our radar.
Show Discussion / Comments (0)
Availability
Citrix
Continuous Availability
Downtime
EverRun
EverRun VM
Exchange
High Availability
Marathon
Virtualization
Webinar
XenServer
| More
Friday, October 24th, 2008 - 11:38 am EDT
Asking the right questions to ensure the right solution
As a result of economic turbulence, companies of all sizes continue to explore virtualization as an option for shedding costs. With the growing number of virtualization options available, it’s important not to let your organization fall victim to virtualization buzzwords. Not all vendors offer the benefits of virtualization, yet many claim they do.
With that being said, when deciding which solution to implement within your organization, IT decision makers should be prepared with an arsenal of questions to ask each provider – doing so will eliminate the typical “fluff” vendors use to sell their supposed virtualization solutions. Having all your questions answered will ensure that you get the most appropriate and highest quality solution for the applications you wish to protect. The following is a list of questions that might assist IT professionals in making their virtualization-related decisions, and some other considerations we offer:
- Should I start to deploy on a small scale or implement everything at once? Answers will vary depending on the size and flexibility or your organization. It’s important that the vendor understand the nature of your business and the value of your critical data before making a suggestion. Whether you are a small, nimble organization with the ability to deploy on all critical apps, or a large enterprise with procedural requirements that prevent you from total deployment, the implementation strategy should be tailored to your needs. There is not a “one size fits all” virtualization strategy.
- How much should I consolidate? We usually suggest phased deployment – start from scratch with the applications and environments that aren’t so mission critical, and then continue deploying as you see appropriate. It’s important to make sure that the vendor you have chosen can support your initiative.
- If I do decide to consolidate, does the server virtualization option I have chosen also meet my application availability needs? Since the implications of downtime in virtual environments have become greater, understanding solutions used to protect business critical applications is crucial. Some important things to consider are:
- Is the solution a “one-size-fits-all” approach, or does it offer flexible protection?
- Does it support different levels of availability for your applications?
- What will my cost savings be?
- How am I going to manage the virtualization solution? The manageability of virtual machines is a different dynamic, especially if this is your first time dealing with virtual machines. The IT processes and management needs are very different. A plan for management must be in place in order to have a successful solution – otherwise you’ll find a lot of redundancy and the need for unnecessary maintenance.
- What do I need for security? As higher applications are moved to the virtualization environment, security disciplines need to move as well.
These are just a few examples that should help get the conversation going. Has anyone deployed a virtual environment that wasn’t the right fit? What were the repercussions and what needed to be done to correct it?
If anyone has any questions they wish they had asked prior to purchasing, please leave them in the comments below and we will be sure to add them to the list.
Show Discussion / Comments (1)
Availability
Downtime
Marathon
Virtual Machine
Virtualization
Virtualization.info
| More
Tuesday, August 26th, 2008 - 11:54 am EDT
Vehicle Manufacturing Executives Talk About everRun
In the vehicle manufacturing industry, companies want an efficient and economical way to ensure smooth operation of all servers, software and applications. Any instance of unscheduled downtime could lead to a loss of data, or in a worst case scenario, to a complete disruption of production and services.
Serve customer needs online without interruption
One European vehicle manufacturer, who understands the importance of protection against downtime, has been using Marathon solutions since 2000. As their security needs as an organization have grown, so has their relationship with Marathon. They began by using the Endurance 4000 system to help protect their forklift management system. Three years later, they upgraded to everRun FT to further safeguard files and applications and to ensure continuous server availability.
The implementation of everRun FT gave the company the opportunity to undertake other IT projects to maximize efficiency and reliability. They were able to establish a centralized network to allow the entire staff to access all applications and system updates remotely.
Defend 24/7 operations with Marathon everRun FT software
With these new initiatives in place, one company executive says that it is now more important than ever for applications and servers to be accessible 24/7 – no matter what. “A disruption to the provision of data and applications would affect every employee, and in the worst case scenario, halt operations altogether,” said the executive.
The company uses both Marathon’s everRun FT and SplitSite to allow two servers to operate simultaneously in 100 percent lock-step. SplitSite provides an additional layer of protection against larger scale failures and disasters. This means that the two servers create a single virtual environment and if one fails, no downtime will occur and all software, applications and data will continue to run on the remaining server.
The organization utilizes several levels of security, including a single server, a Windows cluster, and a Marathon System, but all of their most important and mission-critical applications are operated on everRun FT.
Elimination of system failure and increased competitive advantage
Any instance of unscheduled downtime would impact not only the company’s main factory, but also their several hundred other outlets. If an employee was unable to connect to the network because the server was down, all data on customers and products would become unavailable – this could mean a stall on productivity and unhappy customers. The company executive maintains that this is no longer a concern, thanks to Marathon. “With everRun FT, we no longer have to worry about downtime.”
Show Discussion / Comments (0)
Availability
Case Study
Downtime
EverRun
Manufacturing
Marathon
| More
Wednesday, July 30th, 2008 - 11:56 am EDT
Preventing Disaster Rather than Recovering from It
We all like to think that we will be prepared in the event of an emergency, or a disaster. Hospitals exist if we fall sick; fire stations surround us if flames break loose; we are constantly preparing so if a catastrophe strikes, we are ready.
Preparing for a system’s disaster is no different. However, how to go about preparing for an event like this can be confusing. There are many options out there when it comes to protecting your system, each best suited for specific requirement. Unfortunately, many vendors use terms like disaster recovery and high availability interchangeably to describe their solutions when in fact they are usually designed for one or the other.
Disaster Recovery (DR) is the way to recover applications and from a system failure. DR is a reactive solution where if a failure occurs, IT relocates the data, builds the system over, and brings everything back up to working order. This takes time, a precious commodity that typically businesses relying on critical applications don’t have. In addition, recovering applications could bring about a number of side effects which you really don’t want to endure every time some minor failure happens.
But what if I could tell you that instead of worrying about how to recover from a computer system failing, you could simply prevent it from occurring at all?
Disaster tolerance (DT) is a proactive way to prevent system failure from impacting application and data availability. A disaster tolerant solution isn’t going to recover the data if there’s a disaster. Instead it will tolerate the fault if a disaster occurs – keeping an organization’s critical applications up and running at all times. It is not recovery, but rather prevention. And with solutions like our everRun SplitSite, separate servers don’t even need to be in the same building – they can be up to 100 miles apart with fault-tolerant protection between the two locations.
DR solutions are good for applications that can afford some downtime while you recover them. But for essential applications like Microsoft Exchange, SQL, and SharePoint, which need to be available all the time, disaster tolerance is often the best way to go.
So what combination of DT and DR protection would work best for your company’s applications?
Show Discussion / Comments (0)
Availability
CIO
Disaster Recovery
Disaster Tolerance
Downtime
EverRun
Exchange
Fault Tolerance
High Availability
Marathon
Sharepoint
| More
Tuesday, June 17th, 2008 - 6:41 am EDT
Current HA Solutions Fail to Deliver What Customers Want
A research report by IDC’s virtualization guru, John Humphreys, The Future of Virtualization: Leveraging Mobility to Move Beyond Consolidation highlights the fact that the automatic restart used by most high availability solutions for virtualization fails to deliver what most customers really want and need. Here is what John has to say:
“To address unplanned downtime today virtualization companies are providing an automatic restart capability if the hypervisor or host go down for whatever reason. While this is a good start to trying to combat the lost revenue associated with unplanned outages, ultimately knowing what is happening at the hypervisor and hardware layers fails to deliver customers what they most want — application-level awareness and action. In this way, current HA solutions in the virtualization market are "blind from the waist up." That is, they do not know what is happening inside the virtual machine. They do not know if the operating system or application has stopped working, and that is ultimately what IT professionals charged with delivering application services most care to know.”
If you would like to learn more about high availability for virtualization, how to get application-level awareness and what that can buy you, we encourage you to join the webinar Thursday, June 26 at 11:30 EST. with John Humphreys (IDC), Simon Crosby (Citrix) and Jerry Melnick (Marathon).
For more information or to register visit here.
Show Discussion / Comments (0)
Availability
Citrix
Downtime
High Availability
Hypervisor
IDC
Marathon
Simon Crosby
Virtual Machine
Virtualization
| More
Wednesday, April 23rd, 2008 - 1:30 pm EDT
The greener side of everRun VM
Do you know what capacity of your server is currently being utilized by your organization? No? Well you should. If you can recognize where the gaps in your current server environments are, then it’s easier to analyze the potential savings and opportunities for deploying green technology.
Each year on average an organizations data center only outputs 10-15% of its total processing capacity. By deploying a virtualization strategy, organizations can raise their hardware use to 70-80% resulting in a much more efficient use of corporate capital.
We’ve seen for quite some time now that executives are still iffy on whether or not to deploy a virtualization strategy. For the most part they understand the benefits: reducing energy expenditures, consolidating physical server space, decreasing their environmental footprint, etc. What seem to be the common causes for concern are the effects suffered during the deployment – fear of data loss and the possibility of downtime.
If VARs introduce their customers to high availability technology, such as everRun VM, and offer it alongside their virtualization solutions, then customers will increase their willingness to virtualize their applications – knowing they will be protected from downtime and data loss.
This week, Steve Keilen, VP of Marketing for Marathon, discussed the green side of everRun with eWeek Channel Insider and how it has aided VARs.
What green strategies does your organization currently have in place to maximize productivity and cut energy costs?
Show Discussion / Comments (0)
Availability
Downtime
EverRun
High Availability
Marathon
Virtualization
| More
Monday, April 14th, 2008 - 5:57 am EDT
Top Tips to Know Before Getting Started with Server Virtualization
As demand rises for IT managers and C-level executives to embrace virtualization, we thought it would be worthwhile to share our insights into adopting server virtualization. The tips below are designed to help companies determine if server virtualization is right for them and how they should prepare to ensure a successful initial implementation.
Tip #1 – Make the business case for server virtualization
Before implementing server virtualization, IT executives should assess whether the technology will provide a reasonable return on investment. They should first take a look at how they are using servers today and ask themselves:
- Do they have common applications running on a number of different servers?
- Do they have enough servers that could and should be consolidated? Is the number of applications increasing and the capacity required for the applications continuing to expand?
- Do they expect the number of servers they have to buy each year to increase? If so, by how much?
- Is the business planning to undertake other large-scale technology implementations, and if so, how will this fit with a possible virtualization development?
In short, the IT department has to make a strong business case and justify the virtualization investment to ensure executive management support. To help conduct a quick assessment, ROI calculators are available from VMware and Marathon. Marathon also has a webinar available with John Humphreys, Program Vice President, Virtualization Software for IDC, “Making the Business Case for Server Virtualization,” in which John provides essential guidance in making your business case.
Tip #2 – Consider the license and support implications
IT managers need investigate what, if any, impact virtualization will have on their application licenses and support. Depending on the application, the original licensing terms and conditions may no longer apply after the applications have been migrated to the virtualized environment. In addition, it may be that the providers of some of the software applications do not support virtualization systems, and are unwilling to offer technical support for the applications after the migration to a virtualized environment.
Tip #3 – Afford to spend the time to plan
As any IT professional knows, implementing a new system requires dedicated resources, budget and time. Industry experts have estimated that the planning stage constitutes 90% of a virtualization implementation project. The actual migration is relatively simple to undertake provided that the implementation has been well-planned. Any system information to be migrated should be collated and backed-up up to six months before the start of the migration. IT managers should remember to assess how much hardware each virtual machine needs in order to operate efficiently. They also need to ensure that the number of virtual environments residing in a single hardware does not sprawl out of control - this could have serious consequences on the stability of the environment and application availability. A thorough implementation plan will help businesses minimize any hiccups that might arise.
Tip #4 – Assess levels of application availability and risk to business continuity
Despite the many benefits of virtualization, businesses are beginning to realize that there are risks associated with the technology. While virtualization is useful for protecting applications from planned downtime, protecting virtual environments from unplanned downtime is a different matter. Today, the cost of just a few minutes of unplanned downtime can be hugely detrimental and with virtual environments the risk is greater because server consolidation often results in a single point of failure for multiple applications. Businesses should therefore consider a solution that combines virtualization technology with the high availability protection necessary to keep the business going through disruptions.
Tip #5 – Demonstrate that virtualization won’t impact end users
After the implementation, it is critical to demonstrate to executive management that virtualization can be accomplished without hindering applications performance and without diminishing service to end users. IT managers should monitor closely the performance of initial deployments, and if necessary, modify hardware and networking configurations to ensure that the virtual environment is completely transparent to end-users.
Emerging virtualization technologies are opening doors by removing existing barriers of entry such as cost and complexity. More and more businesses of different sizes are starting to reap the benefits of server virtualization. But, before embarking on a virtualization project, organizations should assess their needs carefully, choose the right technology, make sure the implementation doesn’t impact end user performance - and start small.
If you have additional tips or recommendations that have helped you and your organization successfully deploy server virtualization, please leave a comment here and join the discussion!
Show Discussion / Comments (2)
Availability
Downtime
High Availability
Marathon
Virtual Machine
Virtualization
VMware
| More
Wednesday, February 13th, 2008 - 11:53 am EST
How BlackBerry’s outage could have been prevented
This past Monday millions of BlackBerry users were hindered with email downtime for three hours due to the company’s second major outage in less than a year. Customers immediately jumped into online discussions and on blogs like here and here to find the root of the problem. The outage is believed to have been caused by the failure of one of two Internet addresses that relay e-mail from corporate servers. Jack Gold, a technology analyst from J.Gold Associates stated something we are very familiar with:
“Any time you got a system that's got a NOC, a Network Operations Center, you have the potential for a single point of failure.”
A great point Gold raises (where our expertise in high-availability comes into play) is that if a company isn’t able to have enough redundancy in the NOC, then why don’t they have a technology in place to make sure there isn’t a single point of failure?
There’s no way of knowing how much business was lost for BlackBerry or BlackBerry cell phone carriers during the downtime; however one thing is for sure, if RIM had implemented a fault tolerant solution, then “routine upgrades” would not create such a fiasco amongst customers.
Hopefully they recognized this mishap as a lessoned learned.
Show Discussion / Comments (0)
Availability
Downtime
Fault Tolerant
| More
Wednesday, December 19th, 2007 - 9:16 am EST
The fate of Christmas lies in the hands of everRun FT
Twas the night before Christmas and all through the warehouse not a computer was working not even the mouse (OK we’ll stop there)….While everyone else is sipping on “spiked” egg nog and enjoying a hearty Christmas meal, we at Marathon are working to ensure that children worldwide are met with rewards for their yearly good behavior. Here’s the “official” press release for your enjoyment:
X December 2007— Saint Nick Enterprises (SNE) today announces the results of a new implementation using Marathon Technologies’ business continuity software. The technology has enabled SNE, the world’s busiest toy manufacturing and delivery business, to attain less than one hour’s downtime in the past 365 days.
SNE CEO, Santa Claus, employs 2,500 full-time elves at its company headquarters in Lapland to supply its multi-billion customer base from over xxx countries globally. As such, the company relies on the maximum possible number of operating-days per annum to avoid delivery disappointments.
Saint Nick Enterprises is using Marathon everRun FT to protect the IT infrastructure of its gift factory and global distribution centre. everRun FT is providing fault-tolerance across the SNE mission-critical document management system, for letter registry. This ensures that letters and other communications received are forwarded for processing to Claus’ helpers, who assess each child’s wishes against whether they have been well-behaved in the past 12 months. Since implementing everRun FT, the letter registration system, gift factory and distribution centre have suffered no downtime at all.
Claus, also known as Père Noël and Father Christmas around the world, comments, “We need to be able to not only produce quality toys, but also ensure that all presents are delivered on time to children and child-minded adults around the world. That is why it is absolutely crucial that our IT systems won’t let us down during the busiest time of the year in the lead up to Christmas.”
All data is also safeguarded by Marathon’s continuous data protection tool, everRun CDP, which captures and replicates data on children and their Christmas wish lists in real-time, allowing elves to quickly and simply roll back and recover information from any point in time in case of a catastrophic failure.
To protect other important systems from disruptions, including the automated reindeers’ Nourishment Satisfaction Platform (NSP), SNE uses everRun HA high-availability technology. In the six months that the technology has been in place, the NSP has achieved 99.9 per cent availability, ensuring that the world famous high-performance delivery system, led by Rudolph and his team, have not missed a single meal because of unplanned downtime.
Marathon SplitSite provides a further level of protection for the factory. even if the entire site fails, all IT systems will remain operational on a remote site located approximately 100 miles from the main site, in the north-eastern corner of Lapland. Claus comments: “Mrs Claus is an avid baker but on occasion, she does forget to turn off all the industrial cooking machinery. A few weeks ago after having spent hours preparing hundreds of mice pies for the elves, she went to bed having left the oven on. The whole kitchen and many nearby rooms, including our server room, went up in flames. But thanks to SplitSite, operations in the factory were not disrupted for even a second.”
Nick Turnbull, Director of Sales, Marathon Technologies, adds: “Any application downtime would be disastrous for Saint Nick Enterprises. If data was lost, Mr Claus might deliver the wrong presents to the children: well-behaved children might not receive any presents, or badly-behaved children might get the best toys. Any such mistake would not only ruin Christmas for millions of families around the world but also severely compromise Santa’s reputation.”
Happy Holidays from all of us here at Marathon! :)
Show Discussion / Comments (1)
Downtime
EverRun
High Availability
Marathon
| More
