locked
Chkdsk.exe infinite loop revisited... RRS feed

  • Question

  • Summary: SMART reads 0 bad sector but chkdsk /r thinks otherwise.


    Searching around turned up some similar threads , but none had a conclusive answer, and I believe my circumstances may provide a bit more insight into this relatively rare problem.

    My setup is WHS with Power Pack 3, no add-on installed except DiskKeeper. In the storage pool I have 3 Hitachi Deskstar 2TB drives. Shortly after I installed the server and populated it with data one, whenever chkdsk is ran against the drives' mount points as part of the automatically scheduled nightly maintenance around midnight, it goes into an endless loop at the verifying index stage. Output is below:

     

    The type of the file system is NTFS.

    Volume label is DATA.



    WARNING! F parameter not specified.

    Running CHKDSK in read-only mode.



    CHKDSK is verifying files (stage 1 of 3)...

    0 percent complete. (0 of 6032 file records processed)

    1 percent complete. (604 of 6032 file records processed)

    2 percent complete. (1207 of 6032 file records processed)

    3 percent complete. (1810 of 6032 file records processed)

    4 percent complete. (2413 of 6032 file records processed)

    5 percent complete. (3016 of 6032 file records processed)

    6 percent complete. (3620 of 6032 file records processed)

    7 percent complete. (4223 of 6032 file records processed)

    8 percent complete. (4826 of 6032 file records processed)

    9 percent complete. (5429 of 6032 file records processed)

    1060326032

    6032 file records processed.

    File verification completed. 1000

    0 large file records processed. 1000

    0 bad file records processed. 1000

    0 EA records processed. 1000

    0 reparse records processed.

    CHKDSK is verifying indexes (stage 2 of 3)...

    11 percent complete. (290 of 23891 index entries processed)

    12 percent complete. (577 of 23891 index entries processed)

    13 percent complete. (855 of 23891 index entries processed)

    14 percent complete. (1137 of 23891 index entries processed)

    15 percent complete. (1424 of 23891 index entries processed)

    16 percent complete. (1705 of 23891 index entries processed)

    ...



    2452 percent complete. (456213 of 23891 index entries processed)

    Because the maintenance chkdsk creates a log, this goes on until all space on C: is filled. This ONLY happens on 1 of the 3 identical Hitachi drives.

    My initial thought being the drive is going bad, I migrated all the data off of it without problem, took it out of the pool and formatted and mounted it as a standalone NTFS primary parition. Then I ran chkdsk /r on it. This turned up a ton of bad sectors, about 50% of the drive. Wanting to get a second opinion, I ran Hitachi's own drive diagnostics tool and used smartmontools to read the SMART info, both of which turned up 0 bad sector counts and a 100% healthy drive. Below is the SMART output:

     

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
    2 Throughput_Performance 0x0005 133 133 054 Pre-fail Offline - 101
    3 Spin_Up_Time 0x0007 118 118 024 Pre-fail Always - 613 (Average 612)
    4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 34
    5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0
    7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
    8 Seek_Time_Performance 0x0005 112 112 020 Pre-fail Offline - 39
    9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 311
    10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 33
    192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 36
    193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 36
    194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 36 (Lifetime Min/Max 24/42)
    196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0

    SMART Error Log Version: 0
    No Errors Logged
    Either chkdsk or the SMART readout is wrong about the bad sector count. The next thing for me to try is put the affected drive back into the pool and see if I can reproduce the initial chkdsk infinite loop.

     

     

    Tuesday, May 25, 2010 5:14 PM

All replies

  • Can you file a bug regarding this on Connect, please?
    I'm not on the WHS team, I just post a lot. :)
    Wednesday, May 26, 2010 1:52 PM
    Moderator
  • I will if I can reproduce this behavior.
    Wednesday, May 26, 2010 7:23 PM
  • I'm having this exact problem as well but with a Hitachi 1TB Drive. All of my diags show that the drive is fully functional and I'm deleting logs pretty much daily. Only solution I've seen is to "Reinstall WHS"
    Friday, June 11, 2010 9:19 PM