SQL Cluster node1 Failing

  • Hi All,

    Thanks in advance guys, hopefully somebody can help me, well here's the picture,

    1.- Two nodes (Active, Passive).

    2.- Windows Server 2003 E.E.

    3.- SQL 2000 SP4 EE.

    Well basically last week i received an alert from my instance was down and i found the following entries in the windows event viewer,

    Event Type: Warning

    Event Source: ClusSvc

    Event Category:Node Mgr

    Event ID: 1096

    Date: 7/16/2012

    Time: 5:45:51 PM

    User: N/A

    Computer: CLU02

    Description:

    Cluster service cannot use network adapter Broadcom NetXtreme Gigabit Ethernet because it does not have a valid IP address assigned to it.

    Event Type: Warning

    Event Source: ClusSvc

    Event Category: Node Mgr

    Event ID: 1124

    Date: 7/16/2012

    Time: 5:45:52 PM

    User: N/A

    Computer: CLU02

    Description:

    The node determined that its interface to network 'Local Area Connection' failed.

    Event Type: Error

    Event Source: ClusSvc

    Event Category: Resource Monitor

    Event ID: 1145

    Date: 7/16/2012

    Time: 5:50:08 PM

    User: N/A

    Computer: CLU01

    Description:

    Cluster resource SQL Server timed out. If the pending timeout is too short for this resource, consider increasing the pending timeout value.

    Event Type: Warning

    Event Source: Dhcp

    Event Category: None

    Event ID: 1007

    Date: 7/16/2012

    Time: 5:45:51 PM

    User: N/A

    Computer: CLU02

    Description:

    Your computer has automatically configured the IP address for the Network Card with network address 001143D930DA. The IP address being used is XXX.XXX.XXX.XXX.

    Event Type: Error

    Event Source: W32Time

    Event Category: None

    Event ID: 29

    Date: 7/16/2012

    Time: 5:45:51 PM

    User: N/A

    Computer: CLU02

    Description:

    The time provider NtpClient is configured to acquire time from one or more time sources, however none of the sources are currently accessible. No attempt to contact a source will be made for 1 minutes. NtpClient has no source of accurate time.

    Then after my Node2 (CLU02) was down and the SQL Group was in the Node1(CLU01) but down also. I noticed both Nic's with errors.

    I contacted the windows guys and they fixed the Nic's, and tada!!!! the Node2(CLU02) was able to bring online the SQL Group. But the Node1 still failing when i try to do a failover.

    Event Type: Error

    Event Source: MSSQLSERVER

    Event Category: (3)

    Event ID: 19019

    Date: 7/20/2012

    Time: 12:47:09 PM

    User: N/A

    Computer: CLU01

    Description:

    The description for Event ID ( 19019 ) in Source ( MSSQLSERVER ) cannot be found. The local computer may not have the necessary registry information or message DLL files to display messages from a remote computer. You may be able to use the /AUXSOURCE= flag to retrieve this description; see Help and Support for details. The following information is part of the event: [sqsrvres] ODBC sqldriverconnect failed.

    I hope anybody can point me in the right way, thanks all.

    Regards.

  • I just found a couple entries in my cluster error log i think it's related to disk failures.....not quite sure.

    334:8f8.07/20[11:57:31.314](000334) INFO [FM] FmpRmOfflineResource: RmOffline() for a13f3122-5c69-46ed-9b8a-f453754fe090 returned error 997

    334:8f8.07/20[11:57:31.314](035307) INFO [FM] FmpRmOfflineResource: RmOffline() for 31d434b7-f555-4d8e-acc8-5a4af1d934f2 returned error 997

    334:8f8.07/20[11:57:31.314](035308) INFO [FM] FmpRmOfflineResource: RmOffline() for 814e2fc7-0c66-421d-a5a4-91db19b059f9 returned error 997

    334:8f8.07/20[11:57:31.330](035309) INFO [FM] FmpRmOfflineResource: RmOffline() for cd04ffe4-cec5-4e7c-8b7a-f60bbded2f7d returned error 997

    3c4:73c.07/20[11:57:31.330](035310) WARN Network Name <SQL Network Name(SQLCLU)>: Failed to delete server name MIASQLCLU, status 2114.

    3c4:73c.07/20[11:57:31.330](035310) WARN Network Name <SQL Network Name(SQLCLU)>: Failed to delete server name MIASQLCLU, status 2114.

    3c4:768.07/20[11:57:31.345](035318) ERR IP Address <Cluster IP Address>: WorkerThread: GetClusterNotify failed with status 6.

    3c4:d70.07/20[11:57:35.392](035321) INFO Physical Disk <G Disk>: DiskCleanup returning final error 0

    3c4:17c.07/20[11:57:35.392](035321) INFO Physical Disk <J Disk>: DiskCleanup returning final error 0

    3c4:17c.07/20[11:57:35.392](035321) INFO Physical Disk <J Disk>: Offline, Returning final error 0.

    3c4:d70.07/20[11:57:35.392](035323) INFO Physical Disk <G Disk>: Offline, Returning final error 0.

    3c4:66c.07/20[11:57:37.392](035325) INFO Physical Disk <I Disk>: DiskCleanup returning final error 0

    3c4:66c.07/20[11:57:37.392](035325) INFO Physical Disk <I Disk>: Offline, Returning final error 0.

    3c4:3a8.07/20[12:03:56.309](035415) WARN Physical Disk <Disk Q:>: [DiskArb] Assume ownership of the device.

    3c4:814.07/20[12:03:56.465](035418) ERR Physical Disk <Disk Q:>: DisksMountDrives: error creating default share Q$. Error: 2114.

    3c4:814.07/20[12:03:56.465](035419) INFO Physical Disk <Disk Q:>: Online, returning final error 0 ResourceState 2 Valid 1

    334:4dc.07/20[12:03:56.465](035419) INFO [Qfs] QfsFindFirstFile Q:\MSCS\ => ffffffff, error 2

    334:4dc.07/20[12:03:56.465](035419) INFO [DM] DmpQuoObjNotifyCb: FindFirstFile on path Q:\MSCS\ failed, Error=2 !!!

    3c4:754.07/20[12:03:57.122](035430) WARN [ClNet] Tcpip is not bound to adapter 89D0A5FB-3E0A-472F-A15A-1AC3EA0D581F.

    3c4:754.07/20[12:03:57.122](035430) WARN [ClNet] Tcpip is not bound to adapter 4FB682B4-3CFA-4F65-926D-1E33F52F9F91.

    3c4:754.07/20[12:03:57.137](035430) WARN [ClNet] Tcpip is not bound to adapter 04947411-B047-467A-B824-AEED05ABFA3D.

    3c4:754.07/20[12:03:57.137](035430) WARN [ClNet] Tcpip is not bound to adapter CF36BD27-DE0B-41D4-9571-B4172D0CAC56.

    3c4:754.07/20[12:03:57.153](035430) WARN [ClNet] Tcpip is not bound to adapter 7361038C-017E-4C2B-A477-273121965DED.

    3c4:754.07/20[12:03:57.153](035430) WARN [ClNet] Tcpip is not bound to adapter 892ABEB4-1211-48CD-8193-E7BF70698EF5.

    3c4:754.07/20[12:03:57.153](035430) WARN [ClNet] Tcpip is not bound to adapter AD9BE1CA-531A-41C2-AADA-4F0487B62C59.

    3c4:e78.07/20[12:03:59.653](035434) WARN Network Name <SQLCLU>: Unable to read ResourceData parameter, error=2

    3c4:e78.07/20[12:03:59.653](035434) WARN Network Name <SQLCLU>: Unable to read CreatingDC parameter, error=2

    3c4:e78.07/20[12:03:59.809](035436) WARN [ClNet] Tcpip is not bound to adapter 89D0A5FB-3E0A-472F-A15A-1AC3EA0D581F.

    3c4:e78.07/20[12:03:59.809](035436) WARN [ClNet] Tcpip is not bound to adapter 4FB682B4-3CFA-4F65-926D-1E33F52F9F91.

    3c4:e78.07/20[12:03:59.809](035436) WARN [ClNet] Tcpip is not bound to adapter 04947411-B047-467A-B824-AEED05ABFA3D.

    3c4:e78.07/20[12:03:59.825](035436) WARN [ClNet] Tcpip is not bound to adapter CF36BD27-DE0B-41D4-9571-B4172D0CAC56.

    3c4:e78.07/20[12:03:59.825](035436) WARN [ClNet] Tcpip is not bound to adapter 7361038C-017E-4C2B-A477-273121965DED.

    3c4:e78.07/20[12:03:59.825](035436) WARN [ClNet] Tcpip is not bound to adapter 892ABEB4-1211-48CD-8193-E7BF70698EF5.

    3c4:e78.07/20[12:03:59.840](035436) WARN [ClNet] Tcpip is not bound to adapter AD9BE1CA-531A-41C2-AADA-4F0487B62C59.

    3c4:e78.07/20[12:03:59.997](035439) WARN Network Name <SQLCLU>: Unable to enumerate server tranports, error 2114.

    3c4:e78.07/20[12:03:59.997](035439) WARN Network Name <SQLCLU>: Unable to verify that server name SQLCLU does not already exist.

    3c4:e78.07/20[12:03:59.997](035439) WARN Network Name <SQLCLU>: Failed to delete server name SQLCLU, status 2114.

Viewing 2 posts - 1 through 1 (of 1 total)

You must be logged in to reply to this topic. Login to reply