I have a 3 node cluster with a iSCSI / MPIO CSV. This has been running for about 1 1/2 years with no issue.
Host 1 is the current owner of the CSV. I verified that by going to Disk Management on host 2 and host 3, and seeing that for the CSV disk, they both have the 'disk is offline because of a policy set by an administrator' message. Host 1 does not have that message which makes it the owner.
Recently, whenever any VM function is attempted on host 2 (like start a VM, live migrate a VM, shutdown a VM, etc..) I get a non-stop flow of event id 153 and whatever process was started takes longer than normal to complete. If the VM does start after a long delay, access to the VM is slow and choppy from the end user's standpoint.
If I migrate or shutdown all VMs on host 2, the 153 messages stop.
Host 2 itself is never slow or laggy. Only the VM operations are slow.
Host 1 and host 3 DO NOT have ANY event id 153.
Does anyone have any ideas why this single node is displaying this behavior?
Thanks in advance!