Jump to content

sagman

Members
  • Posts

    1
  • Joined

  • Last visited

Posts posted by sagman

  1. I have the same problem with the same hardware.

    Have you been able to fix it?

     

     

    On 2/8/2020 at 9:45 PM, plindberg said:

    I have the same setup as you,


    - Ryzen 7 2700

    - Asrock Rack X470D4U
    - 2x Kingston 16GB ECC DDR4 (KSM26ED8/16ME).

    OS is Debian.

    I assembled this system today and have seen three corrected ECC errors in about 4 hours of operation, seems to be the same errors as you've reported here. All three of mine are on different addresses, though. Currently trying to figure out if this is normal.

    I also noticed that my RAM is running at 2400 and not 2666. This is according to dmidecode.

    May be interesting to note that all three errors happened just as I was running a command in the terminal. One of them happened when I ran sensors-detect, another when I terminated a running stress-test. I don't recall what I was doing when the third one occurred.

    Since then I've been running memory stress tests using stress-ng to see if I can trigger more errors (going on for about one or two hours now), but haven't seen anything yet. I also ran a few loops of memtester and everything passed.

    The fact that we seem to have the exact same issue (if it even is an issue, I'm still holding out on that) on virtually identical systems makes me believe it could be related to some configuration issue. I have left all BIOS settings to their defaults for now - maybe the memory configuration is not optimal for these sticks? Maybe a BIOS update is in order.

    Log entries:

    
    [Hardware Error]: Corrected error, no action required.
    [Hardware Error]: CPU:0 (17:8:2) MC16_STATUS[-|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0x9c2040000000011b
    [Hardware Error]: Error Addr: 0x00000000018f03c0
    [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x00000a400a400103
    [Hardware Error]: Unified Memory Controller Extended Error Code: 0
    [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
    [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
    
    —————————
    
    [Hardware Error]: Corrected error, no action required.
    [Hardware Error]: CPU:0 (17:8:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
    [Hardware Error]: Error Addr: 0x00000003e7eb9800
    [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x00000a400a400103
    [Hardware Error]: Unified Memory Controller Extended Error Code: 0
    [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
    [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
    
    —————————————
    
    [Hardware Error]: Corrected error, no action required.
    [Hardware Error]: CPU:0 (17:8:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
    [Hardware Error]: Error Addr: 0x0000000000c10980
    [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x00000a400a400102
    [Hardware Error]: Unified Memory Controller Extended Error Code: 0
    [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
    [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD


     

     

×
×
  • Create New...