Zoho Outage on Sep 26, 2013

We experienced an outage yesterday, September 26, that affected all Zoho services. The outage started around 2:32 pm, PST. This was due to the network disturbance caused by a misbehaving access switch in our primary datacenter LAN. This switch actually has redundancy built in. The switch was losing packets, but didn’t fail fully, and so the backup switch didn’t take over.

Our team took the troublesome switch off the network, and had the backup switch take over. Most of the Zoho services including Zoho CRM which were hosted in a different network were up within the first hour. Actual downtime of Zoho CRM and other services was 52 minutes. Zoho Mail, which was hosted in the same network as the failed switch, was the most affected, and it took around three hours to restore full service.

We are still analyzing the root cause of this issue, and we will post our observations, corrective actions, as we get more insights into the events that led to the outage.

Any downtime is painful, and we are investing in both infrastructure and R&D to avoid downtime. We apologize for letting you down yesterday.

Comments

3 Replies to Zoho Outage on Sep 26, 2013

  1. Sridhar, with all due respect. The level of service we receive from Zoho Corporation has fallen short of expectation. The product is getting buggier. I am not surprised you are having outages. The same processes that Service Desk brings to the table (ITIL) are not followed by your own organization. Its time for a change and improvement.

Leave a Reply

Your email address will not be published.

The comment language code.
By submitting this form, you agree to the processing of personal data according to our Privacy Policy.

Related Posts