Quantcast
Channel: High Availability (Clustering) forum
Viewing all 2306 articles
Browse latest View live

Hyper-V 2019 Update-ClusterFunctionalLevel = MAJOR OUTAGE!

$
0
0

Help!!!

We have just finished upgrading all our Hyper-V nodes from 2016 to 2019.  Mainly due to all the bugs in 2016.  We have found 2019 to be much, much better.  Each one was first evicted before having a full format/reinstall before being added back into the cluster.  There were eight nodes in total SERVER01-08.

Yesterday we ran the Update-ClusterFunctionalLevel command and all hell broke lose.  VMs went offline, blue screen, disk corruption you name it.  Took us hours to get everything back.  It looks like everything on SERVER08 had the problem.

We evicted SERVER08 and rebuilt it again.  The problem now is that it won't rejoin the cluster.

Error 0x5b4

The image we are using has been perfect, fully tested and rock solid.  Network connectivity is good, as is the connection to the SAN.  No issues with pinging the cluster name or any of the other nodes.  Running the validation checks comes back all green.

Config and setings on all nodes are 100% identical as everything is ran from scripts, so no human error.

Everything was done by the book, following Microsofts instructions to the letter.

Functional Level is still at 9 which I believe is 2016

any ideas?








Hyper V cluster

$
0
0

Hello. 

I am trying to add a hyper V cluster to Virtual Machine Manager Console. However it says that it cannot connect to the cluster with error 25300/ ID 13805.

But I have other Hyper V clusters that are successfully added. 

Network/Ip address wise, the successfully added hyper V cluster and the failed cluster are both not in the same segment as the Virtual Machine Manager console. 

Both successful and failed hyper V clusters are able to telnet 5985 to and fro to SCVMM. 

Not sure if this is a scvmm issue or cluster issue, I wish to find out what settings that I might have missed in comparison to the two Hyper V clusters settings.

I have posted the same question on the scvmm thread. 

to check high availability of a VM in failover cluster which powershell command should I run

$
0
0

I have W2K12 R2 Hyper-v cluster. I manage virtual environment using Microsoft SCVMM. I need to find whether failover cluster is properly configured to support highly available Vms. Which powershell cmdlet you will use?

Test-cluster

or

Test-clusterresourcefailure


Multi domain VM of Hyper-v Host in CSV of failover Clustering

$
0
0

My company has one AD DS forest that contains 2 domains. All servers run Windows Server 2K12 R2. My company uses ISCSI and Fibre channel storage. I've plan to deploy single Hyper-v cluster that will use Cluster shared volumes (CSV). The cluster must include VM's from both domains.What should I do?

Which option shall I follow:

Join each hyper-v host server to the same AD DS domain

Deploy clustered storage spaces

Deploy serially attached SCSI (SAS)

Join each hyper-v host server to different AD DS domains.


Hyper-V Clustering failed with Cluster Shared Volume

$
0
0

Hello, All

I deployed and configured hyper-v cluster environment with windows server 2016 std evaluation editon.

Successfully, it worked normally without any sort of issue or problem.

But later, I changed its edition to data center by using dism command.

Wired thing is during changing primary node's edition, error happend however; somehow I have it changed normally.

At the second node, there was no any issues, it's successfully changed to data center.

After that, in the primary node, all storages repeated On line(No Access) and Pending status.

Eventually, All VMs are required to be migrated to second node.

What I have done for solving this is

Verifying Firewall, all firewalls are disabled now.

Shutdown and startup both nodes.

copy and paste registry key : There was no parameters key in the registry, HKLM - System - CurrentControlset - Clussvc

So I copied second node has and paste into primary. after that cluster service was possible to startup, before I did it, cluster service was also impossible.

At this point, Rhs key was also copied and paste entirely, I suspect it shouldn't be the same both node but I don't have any idea about it.

Here are captrued shot showing event id and messages.

Please any one of you know about or solved ever, let me know what I should do.

Thanx.

ReFS for CSV HyperV

$
0
0
I just built a brand new Server 2016(1607) Failover cluster for use as HyperV nodes. I am trying to figure out if i should use ReFS or NTFS. Most of the information seems old and points to this webpage as proof that you should not use ReFS 

https://docs.microsoft.com/en-us/windows-server/storage/refs/refs-overview

However this page states you can use ReFS for CSV

The following features are available on ReFS and NTFS:

Functionality: Cluster Shared Volume (CSV) support 
ReFS: Yes  
NTFS: Yes

Any guidance here?


*EDIT*

Ok i am coming to the same conclusion everyone else is. Use NTFS, the reason is ReFS runs in FileSystemRedirected. I tested this with my current cluster which matches the information i am reading everywhere.

http://www.itprotoday.com/windows-8/ntfs-or-refs-cluster-shared-volumes-windows-server-2016

I am eager to see this limitation lifted. 




multisubnetfailover and clsuter parameters

$
0
0

please clarify if it would still need to change HostRecordTTL value if the MultiSubnetFailover= True is set in the additional Connection Parameter at conenct to server in SSMS 2017.  

SQL 2016 standard installed on SERVER 2012 R2

A Cluster with 2 nodes created in 2 subnets

a warning shows "The HostRecordTTL property for network name 'Name: ClusterNAME' is set to 1200 ( 20 minutes). For multi-site clusters the suggested value is 300 (5 minutes)." in the failiver cluster validition report. 

I wonder if it would do either HostRecordTTL value change or MultiSubnetFailover= True. please advise.

thanks

John

Create one shared volume from three physical LUNs

$
0
0

Hi community,

I have 3 LUNs of 1TB. Can I create a 3TB shared volume on them ?

I my knowledge, to add a disk in the cluster it should be basic. but we cannot merge 3 basic disks. Also Failover clustering doesn't support dynamic disks.

Any help please?

Regards,


Storage Spaces Direct, server specs for SSDs

$
0
0

Hi All,

Looking to build an R&D VDi platform between two nodes using local disks.

I'm planning on buying two servers each with 4 x 1.92tb 6gbps sata SSDs.  My research tells me this:

2 servers meaning 2-way mirror

all ssds so no caching required

auto calculated reserve space

Usable capacity = 6.9tb

fileshare witness hosted away from the cluster

This is the first time I've looked into storage spaces direct as I've always gone with the traditional route of compellent sans.  My servers have an HBA330 card which is needed for this technology (ie, no raid at the hardware level).  I'm confused right from the off regarding installing windows on each server.  Usually I go with 2xssd raid1 for the OS then map my iscsi targets for the storage.  How do I go about setting up the disks so I can get windows installed before then installing the roles to support storage?  Is it simply a case of specing the server with say 2x250gb nvme (raid1) on its own controller card?

I'm going with two network cards.  The first one will give me dual 25gbps for the storage (dedicated fibre switch for storage only), and I'm going with a second card which is dual 40gbps to the LAN.  We have plenty of ports available on our fibre core switch so might as well make use of it all.  Does this sound like a good idea, or should I look into swapping the disks for sas 12gbps ones and upgrading the network card from 25gbps to 40gbps for storage?

The two nodes will also be running hyper-v failover clustering so we can live migrate critical desktop vms (although not all will need to failover)

also, when I add a third (and maybe forth) server I can change to 3-way mirror on the fly?

Thanks!!













How can we move the Quorum Disk from Node1 to Node2 ? - Windows 2012 R2 - Hyper-V Clustering

$
0
0

Hello,

We have created a cluster with 2 Nodes and created a role for File share. There are totally 3 Disks in the cluster, among three we have allocated 1 disk as Quorum Disk.

When Node1 is powered off, all the 3 disks are moving to Node2 automatically. But I would like to know how can we move the Quorum Disk from Node1 to Node 2 when both the nodes are active ?

We can move the 2 Disks from Node1 to Node2 while both the Nodes are Powered On, but through the same option I am unable to Move the Quorum Disk from Node1 to Node2 (Right Click on the Disk -> Move -> Select Node).

Kindly suggest on this  !!

Thanks & Regards,

Anoop Nair.


Anoop Nair

Performance Issue on Storage Space Direct Server 2019 - Getting high read and write Latency

$
0
0

Hello All,

On S2D i am getting performance issue, getting high read and write Latency. From some days getting more issues, not getting constant IOPs, in every second IOPs reach thousands and in next second it came to hundreds, same thing happening with Throughput read and write speed, earlier having performance issue but getting constant IOPs. In admin center it's creating peeks on IOP's and Throughput, due to this hosted VPS are getting hang and slow.

I have configured S2D with 4 nodes having Nvme for caching and SSD for storage as below:

Node 1 : 1x250 Nvme, 3x1TB SSD, Not Having Hyper-v role

Node 2 : 1x500 Nvme, 3x1TB SSD, Not Having Hyper-v role

Node 3 : 2X250 Nvme, 4x1TB SSD, Having Hyper-v role

Node 4 : 2X250 Nvme, 4x1TB SSD, Not Having Hyper-v role

Node 5,6,7 : Not having any SSD or Nvme for storage,  Only having Hyper-V role

All server are connected with 10 GB Ethernet and using CSV to storing the VM files.

Please suggest how to resolve the issue.

Cluster network name resource failed to find the associated computer object in Active Directory.

$
0
0

We have set up a Cluster on Windows Server 2016. Initial validation succeeded, however I moved the computer object generated by the Cluster in active directory from it's default location to AD - Computers OU and now seeing this error: 

"Cluster network name resource failed to find the associated computer object in Active Directory. This may impact functionality that is dependent on Cluster network name authentication.

Network Name: Cluster Name
Organizational Unit: OU=Windows DSC,DC=XXXXXXX,DC=Local"

Guidance:

Restore the computer object for the network name from the Active Directory recycle bin.

(domain blanked for security reasons) 

Log Name:      System
Source:        Microsoft-Windows-FailoverClustering
Date:          7/06/2019 12:51:57 PM
Event ID:      1685
Task Category: Network Name Resource
Level:         Error
Keywords:      
User:          SYSTEM
Computer:      XXXXXXXXX.XXXXXXX.Local
Description:
Cluster network name resource failed to find the associated computer object in Active Directory. This may impact functionality that is dependent on Cluster network name authentication.

Network Name: Cluster Name
Organizational Unit: OU=Windows DSC,DC=XXXXXXX,DC=Local

Guidance:

Restore the computer object for the network name from the Active Directory recycle bin.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="Microsoft-Windows-FailoverClustering" Guid="{BAF908EA-3421-4CA9-9B84-6689B8C6F85F}" />
    <EventID>1685</EventID>
    <Version>0</Version>
    <Level>2</Level>
    <Task>19</Task>
    <Opcode>0</Opcode>
    <Keywords>0x8000000000000000</Keywords>
    <TimeCreated SystemTime="2019-06-07T02:51:57.836490300Z" />
    <EventRecordID>10392</EventRecordID>
    <Correlation ActivityID="{C0BF5C0C-E484-4BDC-A006-D7B5895DE02C}" />
    <Execution ProcessID="4572" ThreadID="7176" />
    <Channel>System</Channel>
    <Computer>XXXXXXX.XXXXXX.Local</Computer>
    <Security UserID="S-1-5-18" />
  </System>
  <EventData>
    <Data Name="ResourceName">Cluster Name</Data>
    <Data Name="OrganizationalUnit">OU=Windows DSC,DC=XXXXXX,DC=Local</Data>
  </EventData>
</Event>

It was fine until I moved it to the computers OU. My question is, does it need to be in it's default location to work? 

Hyper-V 2012 R2 - Cannot bring CSV online

$
0
0

All -

Thanks for making this great forum available to us!

I have a 2-node Windows Server 2012 R2 Hyper-V Failover Cluster.  Node 2 of the cluster experienced an unexpected power outage.  After the outage, the two Cluster Shared Volumes that are part of the Failover Cluster will not come online.

After looking at the Event Logs and Cluster logs, I believe the issue is that the Cluster cannot locate the CNO since Active Directory is fully virtualized and (obviously) offline.

Is there a way to force these CSVs to come back online without an AD to talk to?  I read that the AD dependency was removed in 2012.  Or, based on the information given, could this be caused by something else?

I'll be glad to run any commands and provide output back to the thread.

Thanks for your help!


Syst3m32 https://www.sysadminsoup.com

Adding New Version Node Higher Than Cluster with Bottom Version

$
0
0
Hi,

I have to add a new node with 2016 to the cluster with server 2012.

Can I do the inclusion using using (Failover Cluster) of 2012 or do I have to use the (Faiolver Cluster) of 2016?

Thank you.

votes on SQL Server AlwaysOn Availability groups based on WFCS

$
0
0

Hi,

In my lab environment I have 3 SQL Servers participating in Always On AG that based on WFCS,

I set a file share witness so i have 4 potential votes.

from what i understood, if there are 3 nodes and 1 file share, the file share's vote is not takes into account,
and only if 1 of the nodes fails, the cluster will reorgenize the vote system and give 1 vote to the file share and 2 to the other nodes that still alive...

my question is, when I've entered the Cluster Quorum Information in the AO AG Dashboard (in the ssms), I saw that there are 4 members, 3 nodes and 1 file share witness, but under the vote count I saw that all 4 members got 1 vote, so there is an even number of vote and it does not make any sense to me... how it is that there are odd number of nodes and the file share vote is taken into account?

illustration picture (imagine that there is a "file share witness" with 1 "vote count", additional to those 3 members):


Cluster

$
0
0

Dear Friends,

Have configured cluster successfully, While add disk in failover cluster manager getting error below.

[Window Title]
Information

[Content]
No disks suitable for cluster disks were found.  For diagnostic information about disks available to the cluster, use the Validate a Configuration Wizard to run Storage tests.

[OK]


ITandIT

Alternatives to Guest Clustering?

$
0
0

Hello,

We currently use Guest Clustering for a File Server and while its great when we don't have to do maintenance on it but when you have to perform maintenance such as Live Storage Migrations or expanding storage its not so great. Does anyone do anything differently for File Servers? currently this is being used to serve up a Share for IIS Shared Configuration and other applications. We thought SOFS might be able to help but it also depends on having the Shared Disks.

Always on Cluster Error

$
0
0

Hi;

I had 3 nodes Always on Cluster. The DRC node had a problem, so i had did evict node. After i added the DRC Cluster node to Domain again. After that i could add node to the windows cluster. As a result the cluster disk nodes can not online on the DRC server. The cluster disk resources can not online, too.

Thanks

Cluster resource 'Virtual Machine VMNAME' of type 'Virtual Machine' in clustered role 'VMNAME' failed.

$
0
0

Hello!

I have Hyper-V Failover Cluster with 3 node. 

NODE1: Windows SRV2016

NODE2: Windows SRV2016

NODE3: Windows SRV2019

There are 30 VMs in failover cluster. I can move VMs with Live Migration to all node except one. The one of the VM can move with Live Migration from NODE2 to NODE 1 and NODE1 to NODE2, but I can't move from NODE1 and NODE2 to NODE3 and I get the following error:

Event id: 1069

Cluster resource 'Virtual Machine VMNAME' of type 'Virtual Machine' in clustered role 'VMNAME' failed.

Based on the failure policies for the resource and role, the cluster service may try to bring the resource online on this node or move the group to another node of the cluster and then restart it.  Check the resource and group state using Failover Cluster Manager or the Get-ClusterResource Windows PowerShell cmdlet.

Event id: 1205

The Cluster service failed to bring clustered role 'VMNAME' completely online or offline. One or more resources may be in a failed state. This may impact the availability of the clustered role.

Why should be the problem?

Thank You.

ADAM (AD LDS) sync issue

$
0
0

Dear All

I am facing issue in syncing ADAM server. Syncing is started and has been running for 1 day now. In LDP.exe it also shows state "running". Log size is increase to around 12 GB now. My Question is that have you faced this issue and taking this much long time is normal with this size of log ? Another thing is using LDP i tried to find some users but no record found till now. I do not know will it give results when sync is complete or its not bringing anything at all that is why it is running and expanding the log while getting nothing from AD. 

Viewing all 2306 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>