Starting at May 20 14:45:12 and lasting until 15:05 CDT today, ipHouse
experienced problems from a NetApp filer that stopped responding on
one of its two redundant controllers. To the redundancy system,
everything appeared up, but to the outside systems mounting this
filer, things weren't responding from the first controller, but the
second controller was normal.
This affected various forms of webhosting & mailboxes at ipHouse that
stopped and started at random, but nothing for managed colo, or
virtual servers of any kind. Since all the backend systems were
blocking waiting for disk access, they started falling out of the load
balancers almost all at once.
Service was restored by having the working controller take over all
the duties for both, which automatically had the trouble controller
reboot and take it back when done.
Everything is running fine now, and all health checks pass on both
controllers with nothing in the logs as to what happened. :-[
Any email pending delivery during this period would have been queued
and the sending servers will try again for delivery.
If you still have any problems or questions please let us know at
[log in to unmask], or call us up at 612-337-6340.
Doug McIntyre <[log in to unmask]>
-- ipHouse/Goldengate/Bitstream/ProNS --
Network Engineer/Provisioning/Jack of all Trades