Quantcast
Channel: High Availability (Clustering) forum
Viewing all 2306 articles
Browse latest View live

Unable to correct cluster quorum settings

$
0
0

I have a Windows Server 2012 cluster with 2 servers, a witness disk, and a storage disk. The quorum is set to Node majority but the recommended setting is Node and Disk majority. This can't be set however because the wizard says "an appropriate disk could not be found".

How do I correct this?


Jonathan


Windows 2008 R2 SP1 Exchange DAG event 1196 and 1579 Bad DNS Key

$
0
0

Hi,

I'm running an Exchange 2010 SP1 DAG on Windows 2008 R2 SP1.  We are using an INFOBLOX appliance for DNS resolution.

DNS resolution is working and SRV records are created.

On one of the two clusternodes I get warning 1579 and error 1196 every 15 minutes.  After looking around on the internet and technet, I found kb 977158 stating the problem and resolution.  However this hotfix is not applicabel on Windows 2008 R2 SP1 .

How can I resolve these annoying events?

 

Event 1196 failoverClustering:

Cluster network name resource 'Cluster Name' failed registration of one or more associated DNS name(s) for the following reason:

DNS bad key.

.

Ensure that the network adapters associated with dependent IP address resources are configured with at least one accessible DNS server.

 

Event 1579 Failover Clustering

Cluster network name resource 'Cluster Name' failed to update the DNS record for name 'DAG.domain.local' over adapter 'NIC1-LAN'. The error code was 'DNS bad key. (9017)'. Ensure that a DNS server is accessible from this cluster node and contact your DNS server administrator to verify the cluster identity can update the DNS record 'DAG.domain.local'.

 

 

Frederik

Cluster Drive letter missing after failover

$
0
0

Hi Everyone,

I have a 2 node file cluster on windows server 2008 R2 SP1, was well working before. However since few days its giving me trouble. 

Node1=FS01

Node2=FS02

when ever there is a failover from FS01 to FS02 the E Drive (shared SAN drive) doesn't appear on the server however the Quorum is available, the disk management does not allow me to bring the E Drive online as the option is degraded also there is an error "disk offline due to policy set by administrator" something like that. When the cluster is moved back to FS01 the E Drive works fine.

Regards

Abdul Majeed.

SoFS multi-site

$
0
0
Is there any MS documentation on SoFS in a multi-site architecture?

Clustering in Windows Server 2008 R2 SP1- Running Applications

$
0
0

We have 6 Servers each Having .net based applications and SQL 2008 R2 Databases.

We want to make all these applications High Available.

We have about 6 Free Servers; so my questions are:

1- Can we make ready one server and make cluster with the Running Server to achieve the High Availability?

2- What other Possible options to gain lowest level of High Availability in current setup?

Thanks

Windows 2003 Cluster issue

$
0
0

HI I am running windows 2003 OS which is configured clustering ( 2node ) All the while it was running successfully today after reboot one of the node does not come up, I verified the disk signature, hidden drivers where the cluster drive is started , Cluster Netwrok drive had an ! yellow exclamatory mark which i fixed.

Now on the faulty server i see the disk as "unreadable disk"

i tried to run /fixquroum and the cluster server starts and non of resource or disk is online.

I will be pasting the cluster log shortly.

Thanks

Dexter

2012 R2 - Clustering a High Availability File Server (File Server for General Use)

$
0
0

As per the post title.

I have a new 2012 R2 cluster on a Dell VRTX (2 Blades and local shared storage)

Quorum created plus large CSV for the Hyper-V's etc.

I run the wizard to create a High Availability File Server, then choose 'File Server for General use', enter server name and IP address. Then get a window asking for me to select storage - it empty and tells me I don;t have any storage.

I have a 2.7TB CSV, is that not available to use?

What do I need to make available for this?

Am I missing something obvious?

Any help, assistance, suggestions appreciated, thanks!


Is it OK to modify a CSV directly now in Windows Server 2012?

$
0
0

I have opened a case with MS Support and went back and forth for about 2 months trying to get them to answer this question and ultimately they said they could not and directed me to post here.

Here is the simple question:

In Windows Server 2008 R2, there was a pop-up that said basically not to make changes to a CSV through non-VMM tools. I interpreted this to mean, as did many others including OEMs like NetApp, do NOT make changes through Explorer or CMD line tools without bringing the CSV offline in maintenance mode. This is a real issue for uptime, because as you know, when deleting a VM in Hyper-V, the VM's directory and associated files are not deleted. This leaves orphaned files and takes up space on the CSV drives. We would like to delete these files, but in 2008 R2 we did not because we were warned by OEMs and the pop-up; so we had to open maintenance mode each time.

In Server 2012, is this issue still true? Can we open Explorer or a CMD line and create or delete files or directories on a CSV without bringing the CSV into maintenance mode? We need a response from the MS Product Group, as this is going to become our new standard.

Thank you,



DPR



Destroying/Re-creating a Failover Cluster - Server 2012

$
0
0

Hi all,

Some background:

We have a 4-node cluster hosting our Hyper-V environment, and for some or other reason we keep having issues where nodes will just become unresponsive, with the logs indicating an issue on the cluster.

Short of destroying the cluster, we've tried everything to resolve this, i.e. remove and re-install Hyper-V and Failover Clustering, checked Windows updates, double-checked the configuration on the hardware, etc.

Now, my question:

If I destroy a cluster through cluster manager, and then re-create it, adding back the CSV storage, will the "new" cluster pick up that there are Hyper-V servers in those CSVs and recover those roles? Is there a way to back up and restore the existing roles prior to destroying the cluster?

Thanks in advance for any guidance.

Sebastian

Moving a VM

$
0
0
How do you move a VM server from one hyper V manager to another?

ClusterPrepareSharedVolumeForBackup() fails on Window 2012 CSV

$
0
0

I've a Windows sever 2012 RC Hyper-V cluster with VM using CSV. There I'm seeing ClusterPrepareSharedVolumeForBackup() failing with error ERROR_INVALID_PARAMETER. It works fine on 2008 r2. Any idea what could be the issue here.

Unexplained redirected IO activity / Excessive network traffic on private cluster network

$
0
0

Hello,

I'm seeing a lot of traffic going over my cluster network that may be causing stability issues with my cluster.  Communication to nodes would be randomly lost and kicked out of the cluster; Access to fail over cluster manager would stop working on certain nodes due to rpc errors; CSV time out issues. 

All member nodes have been fully patched with windows updates and recommended fail over cluster hotfixes.

I noticed on one of my nodes that a large number of network packets were going through a network interface that was designated for cluster heartbeat traffic only. About 7-10k packets/s read and write.

After further investigation and looking at some CSV perfmon counters, I noticed there was a significant amount of redirected WRITE IO occuring even though my CSV volumes were not in redirect mode.    Does anybody have any thoughts as to why there is so much redirect IO, and why it's going over my cluster network?

Any help would be appreciated.

Kind Regards,

Norman Phengvath

Cluster enabler software requirement

$
0
0

I need to implement Microsoft Geo cluster across two different data centers. I am using EMC stroage. Do I need to Install Cluster enabler software ?

If yes, What is the need for storage vendor cluster enabler software(SRDF) ?

One my friends have installed MS geo clusters with emc storage with out cluster enabler software.

Please help me to understand the need for cluster enabler software.


Extend CSV size after extending Lun

$
0
0

I have a Hyper-v cluster with Netapp Storage. After enlarging the LUN on the netapp is ran the diskpart and did an extend. This is showing up in diskmanager but the CSV stays the same size. How can i extend that.


Windows 2008 R2 Failover

$
0
0
We have a 2 node Failover cluster created in a Node and Disk Majority (recommended) condition.  The nodes have shared Quorum and Data Disks on iSCSI SAN.   The only application on these nodes is SQL Server.   When we tested the failover by moving the application, we found that the Quorum disk did not failover and the services did not start automatically.    My readings on Windows 2008 R2 failover suggests that the first situation with the Quorum is normal but I don't know about the services.   Can anyone assist?   This is our first go at Failover.

Cluster shared volume disappear... STATUS_MEDIA_WRITE_PROTECTED(c00000a2)

$
0
0

Hi all, I am having an issue hopefully someone can help me with. I have recently inherited a 2 node cluster, both nodes are one half of an ASUS RS702D-E6/PS8 so both nodes should be near identical. They are both running Hyper-V Server 2008 R2 hosting some 14 VM's.

Each node is hooked up via cat5e to a PromiseVessRAID 1830i via iSCSI using one of the servers onboard NICs each, whose cluster network is setup as Disabled for cluster use (the way I think it is supposed to be not the way I had originally inherited it) on it's own Class A Subnet and on it's own private physical switch...

The SAN hosts a 30GB CSV Witness Disk and 2 2TB CSV Volumes, one for each node labeled Volume1 and Volume2. Some VHD's on each.

The Cluster Clients connect to the rest of the company via the Virtual ExternalNIC adapters created in Hyper-V manager but physically are off of Intel ET Dual Gigabit adapters wired into our main core switch which is set up with class c subnets.

I also have a crossover cable wired up running to the other ports on the Intel ET Dual Port NICs using yet a third Class B Subnet and is configured in the Failover Cluster Manger as internal so there are 3 ipv4 Cluster networks total.

Even though the cluster passes the validation tests with flying colors I am not convinced all is well. With Hyperv1 or node 1, I can move the CSV's and machines over to hyperv2 or node 2, stop the cluster service on 1 and perform maintenance such as a reboot or install patches if needed. When it reboots or I restart the cluster service to bring it back online, it is well behaved leaving hyperv2 the owner of all 3 CSV's Witness, Volume 1 and 2. I can then pass them back or split them up any which way and at no point is cluster service interrupted or noticed by users, duh I know this is how it is SUPPOSED to work but...

if I try the same thing with Node 2, that is move the witness and volumes to node 1 as owner and migrate all VM's over, stop cluster service on node 2, do whatever I have to do and reboot, as soon as node 2 tries to go back online, it tries to snatch volume 2 back, but it never succeeds and then the following error is logged in cluster event log:

Hyperv1

Event ID: 5120

Source: Microsoft-Windows-FailoverClustering

Task Category: Cluster Shared Volume

The listed message is:Cluster Shared Volume 'Volume2' ('HyperV1 Disk') is no longer available on this node because of 'STATUS_MEDIA_WRITE_PROTECTED(c00000a2)'. All I/O will temporarily be queued until a path to the volume is reestablished.

Followed 4 seconds later by:

Hyperv1

event ID: 1069

Source: Microsoft-Windows-FailoverClustering

Task Catagory: Resource Control Manager

Message: Cluster Resource 'Hyperv1 Disk in clustered service or application '75d88aa3-8ecf-47c7-98e7-6099e56a097d' failed.

- AND -

2 of the following:

Hyperv1

event ID: 1038

Source: Microsoft-Windows-FailoverClustering

Task Catagory: Physical Disk Resource

Message: Ownership of cluster disk 'HyperV1 Disk' has been unexpectedly lost by this node. Run the Validate a Configuration wizard to check your storage configuration.

Followed 1 second later by another 1069 and then various machines are failing messages.

If you browse to\\hyperv-1\c$\clusterstorage\ or\\hyperv-2\c$\Clusterstorage\, Volume 2 is indeed missing!!

This has caused me to panic a few times as the first time I saw this I thought everything was lost but I can get it back by stopping the service on node 1 or shutting it down, restarting node 2 or the service on node 2 and waiting forever for the disk to list as failed and then shortly thereafter it comes back online. I can then boot node 1 back up and let it start servicing the cluster again. It doesn’t pull the same craziness node 2 does when it comes online; it leaves all ownership with 2 unless I tell I to move.

I am very new to clusters and all I know at this point is this is pretty cool stuff but basically if it is running don’t mess with it is the attitude I have taken with it but there is a significant amount of money tied up in this hardware and we should be able to leverage this as needed, not wonder if it is going to act up again. 

To me it seems for a ‘failover’ cluster it should be way more robust than this...

I can go into way more detail if needed but I didn’t see any other posts on this specific issue no matter what forum I scoured. I’m obviously looking for advice on how to get this resolved as well as advice on whether or not I wired the cluster networks correctly. I am also not sure about what protocols are bound to what nics anymore and what the binding order should be, could this be what is causing my issue?

I have NVSPBIND and NVSPSCRUB on both boxes if needed.

Thanks!

-LW

Failover cluster node - You do not have administrative privileges on the server 'servername' ?

$
0
0
Hi Hello & Good morning Technet's,

I would like to post a question which i really expecting a solution.

I got 2 domain in one single forest.

Domain 1hg.corp
Domain 2iac.corp (iac.corp is a tree domain under hg.corp forest)
Trust : Transitive trust between hg.corp and iac.corp

Domain controller 1dc.hg.corp (for hg.corp)
Domain controller 2iacdc.iac.corp (for iac.corp)

I want to make a Fail-over cluster between this 2 domain controllers ( But both are in different domain literally, but in same forest )

Process Validate cluster in dc.hg.corp
dc.hg.corp can added, but iacdc.iac.corp failed (Error: You do not have administrative privilages on the server 'iacdc')

Process Validate cluster in iacdc.iac.corp
iacdc.iac.corp can added, but dc.hg.corp failed (Error: You do not have administrative privilages on the server 'dc.hg.corp')

Technet please provide me a solution for this issue, So i can reduce server box counts.

Thank you & Have a nice day.
Shamil Mohamed

Remove a hyper-v cluster network

$
0
0

Hi,

I accidently enabled cluster communication on network cards that were intended to be solely for Hyper-V VM guests. I disabled that, but one node is still showing the nic as failed, and the cluster network is still present as well.

ipconfig/all shows the nic as "Media disconnected" (it is not).

I want to get rid of the cluster network. What can I do?

Thanks in advance!

Tim

Server 2012 cluster - virtual machine live migration does not work

$
0
0

Hi,

We have a hyper-v cluster with two nodes running Windows Server 2012. All the configurations are identical.

When I try to make a Live migration from one node to the other I get an error message saying:

Live migration of 'Virtual Machine XXXXXX' failed.

I get no other error messages, not even in event viewer. This same happens with all of our virtual machines.

A normal Quick migration works just fine for all of the virtual machines, so network configuration should not be an issue.

The above error message does not provide much information.

how to delete a failed file share witness

$
0
0

I have a 2 node 2008SP2 (Geographically separated) cluster. I use file and node majority for quorum. There is a file share at third site and both nodes can ping/open the share and the nodes have full controll of the share. Also, there are other cluster/nodes using the same share for quorum. This has worked for several years. After the last round of MS patches the file share witness resource would not come back online for 1 cluster. I tried forcing it online several times but I cancelled after 10 minutes or so. I found that I can add another folder in the original location and reran the qurom wizard, and it came online. I am guessing that maybe the multiple clusters using the same single folder is not good, not sure why it worked for so long though. Anyway, the delete function is gone from the failed file share witness, how can I get rid of the old failed resource? 

 

Thanks

~M

Viewing all 2306 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>