As you’ve undoubtedly noticed, OSNews has been down most of the day. We’re back up now, running like we always do. We rarely have this kind of extensive downtime, so in the interest of full disclosure, here’s what happened. If you’re expecting some sexy story – think again.
After I notified our sysadmin that OSNews was dishing out 324s, he got back to me they were already working on it. We had the same issue for a really short while a few days ago, but it got sorted then. Our sysadmin, Jon Jensen, over at Endpoint checked remotely what the heck was going on, and confirmed it wasn’t actually a software issue – it was a hardware issue, more specifically, a RAID issue. Had the RAID controller failed?
Since this obviously couldn’t be addressed remotely, Jon contacted the datacentre, and when they got back to him, the true extent of the problem was revealed. The quote from the datacenter tech? “We have fixed the cabling issue with your server and your server is now back online.”
Believe it or not, the connections of the disk cables to the controller had come loose. After the first outage happened a few days ago, they were apparently still somewhat connected, only to fully come loose today. Quite a feat, for an immobile server in an immobile rack.
The funny coincidence is that this all happened while we were also working to set up a playground subdomain for the next version of OSNews we’re currently working on. Maybe it’s a sign…