Jump to content

Patb

Members
  • Posts

    38
  • Joined

  • Last visited

Everything posted by Patb

  1. I am receiving the following error which appears to be a memory error but my server (IBM X3630 M4) is not reporting memory errors. I ran a memory test last night to try to pinpoint the problem via the IBM Dynamic System Analysis and no problems were found. Edit: I added the diagnostics file May 22 03:44:42 Tower kernel: mce: [Hardware Error]: Machine check events logged May 22 03:44:42 Tower kernel: EDAC sbridge MC1: HANDLING MCE MEMORY ERROR May 22 03:44:42 Tower kernel: EDAC sbridge MC1: CPU 6: Machine Check Event: 0 Bank 5: 8c00004000010092 May 22 03:44:42 Tower kernel: EDAC sbridge MC1: TSC 280bb741d6ca May 22 03:44:42 Tower kernel: EDAC sbridge MC1: ADDR 8f9f1a3c0 May 22 03:44:42 Tower kernel: EDAC sbridge MC1: MISC 40262686 May 22 03:44:42 Tower kernel: EDAC sbridge MC1: PROCESSOR 0:206d7 TIME 1558511082 SOCKET 1 APIC 20 May 22 03:44:42 Tower kernel: EDAC MC1: 1 CE memory read error on CPU_SrcID#1_Ha#0_Chan#2_DIMM#0 (channel:2 slot:0 page:0x8f9f1a offset:0x3c0 grain:32 syndrome:0x0 - area:DRAM err_code:0001:0092 socket:1 ha:0 channel_mask:4 rank:1) May 22 04:40:01 Tower root: Fix Common Problems Version 2019.05.18a May 22 04:40:06 Tower root: Fix Common Problems: Error: Machine Check Events detected on your server Thanks tower-diagnostics-20190523-1127.zip
  2. Thanks for the clarification. That's a little confusing under Memory but that makes more sense.
  3. I recently started using Unraid and upgraded to 6.7. I'm running a number of dockers and in the new version I see that my Docker memory usage is at 44%. Nothing to be alarmed at right now but given then I'm running a server with 64gb of RAM and that my overall usage is at only 7% I would like to understand the significance of this and how to allocate more RAM to docker. Much appreciated.
  4. I seem to be having the same problem. Did you ever find a solution? I'm running on an IBM server and the integrated management is not showing any errors.
  5. I'm just starting with Unraid and am wondering if the following process can be followed to upgrade the parity drive (i.e. swap the parity disk with a larger disk).? Based on the searches I've done I see that the basic steps are to power down, remove parity disk, install new disk and power back up and let parity rebuild. The question I have is can I make use of the second parity to maintain a safe(r) state? i.e.: power down, install new disk and set it as parity 2 and let it rebuild. Once done power down and remove parity 1 and reassign that to grow the array. Essentially my question comes down to: once I go to 2 parity disks can I go back to single parity? Obviously this would only be in a single parity environment. Thanks
  6. Patb

    CRC errors

    So the problem ended up being with the HBA controller... after replacing the backplane and the SAS cables I tried installing an M1015 flashed to IT mode and that seems to have put an end to the CRC errors for now. I'm preclearing the drives and the SSD has cleared without error and not seeing the CRC error count creeping up anymore. This is a relief. Thanks for the assistance. I will finally be able to start to test Unraid on this server
  7. Patb

    CRC errors

    Thanks, I was wondering about that. I'm not lacking in USB keys. I'm ok to start over since I didn't really get it off the ground yet.
  8. Patb

    CRC errors

    It took some work to get the firmware update working correctly but finally got it. I flashed the backplane but still getting errors. I'm now waiting to receive a replacement backplane from Ebay and some SAS cables... The CRC error count on one of my hard drives is now up to over 6,000,000 as a result :( As a result of the time I'm spending on this I have a question on the trial key... is it tied to the particular USB key or does it fingerprint the system? I'm wondering because at this rate I won't have been able to actually test anything before it runs out. I know that I'll be able to request an extension but I may burn through that as well if I don't identify the issue soon.
  9. Patb

    CRC errors

    ordered a replacement backplane on ebayu. I'm also going to try a different controller once I flash a m1015 I have lying around to IT mode.
  10. Patb

    CRC errors

    It seems to be all disks at different rate. I guess I'll have to find a replacement backplane
  11. Patb

    CRC errors

    Thanks. I'm going to start by reseating the cards and the SAS cables and see if that has an impact.
  12. I recently purchased an IBM X3630 M4 with 14 drive bays and I'm trying out Unraid for the first time. I tried different drives (some old, some recently shucked) in different slots I'm getting a ridiculous number of CRC errors. I ended up rebooting when I hit over 200,000 on one of the drives. I tried to search the forums but didn't find anything on point. Any assistance would be greatly appreciated. Thanks Patrick tower-diagnostics-20190402-0001.zip
  13. Ordered a shot glass today (March 19, 2019), those always come in handy.
×
×
  • Create New...