Saturday 27 December 2014

4613 errors

My drive monitor just reported 4613 errors on one of my servers. Investigation revealed several problems.

1) The raid array had "dropped out", and couldn't be accessed. That was generating all the errors.

2) Another drive was away with the fairies, and I still don't know why.

and after I rebooted ...

3) A third drive was reporting that it was a zero gigabyte drive. This is a known problem with Seagate 1tb drives (and others).

I gave that third drive the usual treatment for this problem, but it didn't fix it, so that drive has joined my pile of drives for putting in geocaches, and has been replaced. The second drive decided to return to the land of the living, and the raid worked again after a reboot, and just needs a good fscking to get it back into action.

The drive that is now pushing up daisies, was one of my backups. I think I described my backup system in another blog post; this is the backup that is done on the 11th to the 20th of each month. So all I need to do, is redo that backup, and the server will be ready for full use again.

