Jump to content
RESET Forums (homeservershow.com)

SBS2011 Freezes during client backups


tojoski

Recommended Posts

  • Replies 24
  • Created
  • Last Reply

Top Posters In This Topic

  • ikon

    10

  • tojoski

    8

  • jmwills

    6

  • Joe_Miner

    1

Top Posters In This Topic

jmwills

They are open Mon - Sat, but I wasnt available over the weekend to do any troubleshooting.

 

The overnight memtest I ran last night came out squeaky clean, 14 passes with no issues.

 

I did find some interesting errors in the event log which I initially thought were unrelated, but I researched them and fixed them and so far the backups are working again.

 

The errors I found were:

 

In the Application log:

Log Name:		 Application
Source:		 VSS
Date:			 3/6/2012 1:33:48 PM
Event ID:		 8193
Task Category: None
Level:		   Error
Keywords:		 Classic
User:			 N/A
Computer:		 SERVER.AUDIOEXPRESS.local
Description:
Volume Shadow Copy Service error: Unexpected error calling routine RegOpenKeyExW(-2147483646,SYSTEM\CurrentControlSet\Services\VSS\Diag,...).  hr = 0x80070005, Access is denied.

 

This was corrected by adding the "Network Service" full control permissions of the registry key "HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\services\VSS"

 

The there was this in the System log:

 

Log Name:	  System
Source:		Microsoft-Windows-WinRM
Date:		  3/6/2012 1:33:48 PM
Event ID:	  10154
Task Category: None
Level:		 Warning
Keywords:	  Classic
User:		  N/A
Computer:	  SERVER.AUDIOEXPRESS.local
Description:
The WinRM service failed to create the following SPNs: WSMAN/SERVER.AUDIOEXPRESS.local; WSMAN/SERVER.

 

This was corrected by adding the "Validated Write to Service Principal Name" permission for the "Network Service" account to the server's computer account in Active Directory.

 

I'm cautiously optimistic at this point..

 

Seems like I do remember that from a couple of years ago, but in my case they were just failing, or timing out.

 

Here is a good resource for SBS:

 

http://blog.mpecsinc.ca/

Link to post
Share on other sites
jmwills

If I rememebr correctly, there is an update that breaks that permission. I found it on Susan Bradley's site.

Link to post
Share on other sites

Well, it turned out to be a short-lived victory. It froze again about 30% into the 3rd client backup.

 

Yesterday evening before I left I added a 2TB drive (attached to one of the extra ports on the raid controller) and moved the client backups over to it. At the same time I re-enabled the 2nd nic and disabled the one we had been using prior.

 

I remoted into it last night and was able to do a manual backup of all the machines, and then at 2am each client did another backup without issue.

 

I woke back up at 4:30am and check on it and it was still ok, but apparenty shortly after they got there at about 8:30 it froze and they had to restart it..

 

So it's looking more and more like its not really an issue with the backups at all, more or less just the stress that the actual backup process puts on it is causing it to freeze.

 

Looking at the event viewer, the server was restarted at 9:07a and the last thing in the log before that was at 8:17a and that was and informational:

 

Disk 1

Log Name:	  Application
Source:	    MSSQL$SQLEXPRESS
Date:		  3/7/2012 8:17:54 AM
Event ID:	  17137
Task Category: Server
Level:		 Information
Keywords:	  Classic
User:		  SYSTEM
Computer:	  SERVER.AUDIOEXPRESS.local
Description:
Starting up database 'ReportServer$SQLEXPRESSTempDB'.

 

So at this point the event viewer really isnt all that helpful..

 

My gut says this is a raid controller / disk issue, but if thats the case I would have thought that it would have frozen while writing to the 2Tb as well, as it was also attached to that controller.

 

I can also see the SMART data for the disks from the raid controller's interface, and everything looks peachy there:

 

Device Type  SATA(5001B4D419635010)
Device Location  Enclosure#1 Slot#1
Model Name  WDC WD5003ABYX-01WERA0
Serial Number  WD-WMAYP1405383
Firmware Rev.  01.01S01
Disk Capacity  500.1GB
Current SATA Mode  SATA300+NCQ(Depth32)
Supported SATA Mode  SATA300+NCQ(Depth32)
Disk APM Support  Yes
Device State  Normal
Timeout Count  0
Media Error Count  0
Device Temperature  30 ºC
SMART Read Error Rate  200(51)
SMART Spinup Time  139(21)
SMART Reallocation Count  200(140)
SMART Seek Error Rate  200(0)
SMART Spinup Retries  100(0)
SMART Calibration Retries  100(0)

 

Disk 2

Device Type  SATA(5001B4D419635011)
Device Location  Enclosure#1 Slot#2
Model Name  WDC WD5003ABYX-01WERA0
Serial Number  WD-WMAYP1315290
Firmware Rev.  01.01S01
Disk Capacity  500.1GB
Current SATA Mode  SATA300+NCQ(Depth32)
Supported SATA Mode  SATA300+NCQ(Depth32)
Disk APM Support  Yes
Device State  Normal
Timeout Count  0
Media Error Count  0
Device Temperature  31 ºC
SMART Read Error Rate  200(51)
SMART Spinup Time  141(21)
SMART Reallocation Count  200(140)
SMART Seek Error Rate  200(0)
SMART Spinup Retries  100(0)
SMART Calibration Retries  100(0)

 

Disk 3

Device Type  SATA(5001B4D419635012)
Device Location  Enclosure#1 Slot#3
Model Name  WDC WD5003ABYX-01WERA0
Serial Number  WD-WMAYP1313026
Firmware Rev.  01.01S01
Disk Capacity  500.1GB
Current SATA Mode  SATA300+NCQ(Depth32)
Supported SATA Mode  SATA300+NCQ(Depth32)
Disk APM Support  Yes
Device State  Normal
Timeout Count  0
Media Error Count  0
Device Temperature  31 ºC
SMART Read Error Rate  200(51)
SMART Spinup Time  144(21)
SMART Reallocation Count  200(140)
SMART Seek Error Rate  200(0)
SMART Spinup Retries  100(0)
SMART Calibration Retries  100(0)

 

Disk 4

Device Type  SATA(5001B4D419635013)
Device Location  Enclosure#1 Slot#4
Model Name  WDC WD5003ABYX-01WERA0
Serial Number  WD-WMAYP1304942
Firmware Rev.  01.01S01
Disk Capacity  500.1GB
Current SATA Mode  SATA300+NCQ(Depth32)
Supported SATA Mode  SATA300+NCQ(Depth32)
Disk APM Support  Yes
Device State  Normal
Timeout Count  0
Media Error Count  0
Device Temperature  31 ºC
SMART Read Error Rate  200(51)
SMART Spinup Time  142(21)
SMART Reallocation Count  200(140)
SMART Seek Error Rate  200(0)
SMART Spinup Retries  100(0)
SMART Calibration Retries  100(0)

 

Disk 5 (Hot Spare)

Device Type  SATA(5001B4D419635014)
Device Location  Enclosure#1 Slot#5
Model Name  WDC WD5003ABYX-01WERA0
Serial Number  WD-WMAYP1315115
Firmware Rev.  01.01S01
Disk Capacity  500.1GB
Current SATA Mode  SATA300+NCQ(Depth32)
Supported SATA Mode  SATA300+NCQ(Depth32)
Disk APM Support  Yes
Device State  Normal
Timeout Count  0
Media Error Count  0
Device Temperature  31 ºC
SMART Read Error Rate  100(51)
SMART Spinup Time  142(21)
SMART Reallocation Count  200(140)
SMART Seek Error Rate  200(0)
SMART Spinup Retries  100(0)
SMART Calibration Retries  100(0)

 

I think the next step might be to do a restore back to a single hard drive, attached directly to the motherboard.. at this point I'm up for any ideas..

 

Thanks guys

Link to post
Share on other sites
Joe_Miner

My apologies if this has been covered before:

  • Have you looked at overheating of the CPU?
    • Air filters plugged?
    • vents dusty/plugged?

    [*]Have you checked into power quality issues?

    • under powered PSU?
    • failing UPS or no UPS and inadequate circuit protection/filtering?
    • low/high voltage?
    • What new devices have been added to the Circuit that feeds you Server?
      • Anything with rotating machines will add LOTS of harmonics to your circuit
      • florissant and energy saving lights?

    [*]Grounding issues? Floating grounds?

Edited by Joe_Miner
Link to post
Share on other sites

I'm here now working on it, and I'm convinced now that its the raid controller after reading similar stories around the net about this exact controller.

 

Temps are fine, in any case I should know for sure here in a few minutes about the controller.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×
×
  • Create New...