• [6.8-RC1] ECC error with ryzen 3700x and ECC ram


    trott
    • Minor

    I just updated to 6.8-RC1, with ryzen 3700x and ECC ram, I got following error in log, it suppose to be a kernel issue

     

    root@Tower:~# dmesg | grep EDAC
    [    0.505230] EDAC MC: Ver: 3.0.0
    [   21.472853] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.473283] EDAC amd64: F17h detected (node 0).
    [   21.473715] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.474154] EDAC amd64: Error: Error probing instance: 0
    [   21.496227] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.496659] EDAC amd64: F17h detected (node 0).
    [   21.497100] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.497537] EDAC amd64: Error: Error probing instance: 0
    [   21.517185] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.517620] EDAC amd64: F17h detected (node 0).
    [   21.518067] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.518504] EDAC amd64: Error: Error probing instance: 0
    [   21.537210] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.537643] EDAC amd64: F17h detected (node 0).
    [   21.538090] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.538527] EDAC amd64: Error: Error probing instance: 0
    [   21.558282] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.558716] EDAC amd64: F17h detected (node 0).
    [   21.559166] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.559603] EDAC amd64: Error: Error probing instance: 0
    [   21.579233] EDAC amd64: Node 0: DRAM ECC enabled.
    [   21.579666] EDAC amd64: F17h detected (node 0).
    [   21.580114] EDAC amd64: Error: F0 not found, device 0x1460 (broken BIOS?)
    [   21.580552] EDAC amd64: Error: Error probing instance: 0




    User Feedback

    Recommended Comments

    7 hours ago, johnnie.black said:

     

    Most likely a bios issue with new kernel, did you look for a bios update?

    Bios is hte newest, it is the EDAC bug for ryzen 2, there are some patch in kernel 5.4, I think I have to wait

    Link to comment
    On 10/15/2019 at 3:20 PM, trott said:

    Bios is hte newest, it is the EDAC bug for ryzen 2, there are some patch in kernel 5.4, I think I have to wait

    Please post link to the patch.

    Link to comment
    4 hours ago, trott said:

    thanks, I got information from below link, I'm not serach for those patch myself as I don't know how to patch it

    Thank you. It's actually 7 patches.  There is a slight chance those patches will make their way into 5.3 kernel.  Otherwise will have to wait until we move to 5.4.

    Link to comment

    Also seeing this same behavior, just to throw my hat into this mix. Same CPU, 3700X with ECC memory on ASRock X470D4U2-2T Motherboard.

    image.thumb.png.89f26e721f100f674651b6a3fabea854.png

    Edited by cybrnook
    Link to comment

    someone on reddit gave me the info needed to patch Linux 5.3 kernel

    https://www.reddit.com/r/Amd/comments/dw3p1l/confusion_about_am4_and_ecc/

    scroll down a little, hope this helps

    Link to comment
    3 minutes ago, cybrnook said:

    @limetech before 6.8 is feature frozen, do you think we could get these ecc patches added in?

    I doubt it, those patches are quite extensive, more than I want to do in an -rc.  Don't worry, 6.9-rc0 should be out about same time as 5.4 kernel.

    • Like 1
    Link to comment
    13 hours ago, limetech said:

    I doubt it, those patches are quite extensive, more than I want to do in an -rc.  Don't worry, 6.9-rc0 should be out about same time as 5.4 kernel.

     

    thanks, should the 5.4 kernel will be release in 2 days?

    Link to comment

    I was checking to see if this problem was fix.  My sever keeps crashing after I upgraded from 16G to 32G of ram.  Running unraid 6.8.2 now.  See my log file below.

     

     

    image.png.d3ad85a9e83baec2088fa14dcdb73490.png

      

     

     

    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: Bluetooth: hci0: read Intel version: 370810225019140f34
    Feb  2 12:22:59 Mediaserv kernel: Bluetooth: hci0: Intel device is already patched. patch num: 34
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: ata9: SATA link down (SStatus 0 SControl 300)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: ata10: SATA link down (SStatus 0 SControl 300)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 0: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: Node 1: DRAM ECC disabled.
    Feb  2 12:22:59 Mediaserv kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
    Feb  2 12:22:59 Mediaserv kernel: Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
    Feb  2 12:22:59 Mediaserv kernel: (Note that use of the override may cause unknown side effects.)

     

    Link to comment

     

    Same errors here with NON ECC RAM : ASUS x570 Strix E, 3900x, 2x16GB Gskill Trident z Neo running on XMP 3600Mhz.

    Installed BIOS is newest Version: 1407 https://www.asus.com/Motherboards/ROG-Strix-X570-E-Gaming/HelpDesk_BIOS/

    This didn't happen on 6.8.x.

     

    Also I get some other CPU related hardware error. This one I also got on 6.8.x:

    mce: [Hardware Error]: Machine check events logged
    Mar 14 11:27:19 XXX kernel: [Hardware Error]: Corrected error, no action required.
    Mar 14 11:27:19 XXX kernel: [Hardware Error]: CPU:0 (17:71:0) MC27_STATUS[-|CE|MiscV|-|-|-|SyndV|-|-|-]: 0x982000000002080b
    Mar 14 11:27:19 XXX kernel: [Hardware Error]: IPID: 0x0001002e00000500, Syndrome: 0x000000005a020001
    Mar 14 11:27:19 XXX kernel: [Hardware Error]: Power, Interrupts, etc. Ext. Error Code: 2, Link Error.
    Mar 14 11:27:19 XXX kernel: [Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: GEN, part-proc: SRC (no timeout)

     

    Link to comment

    You can stop this by adding 

    blacklist amd64_edac_mod

    To the /lib/modprobe.d and /etc/modprobe.d/ create a new .conf, I named it amd64_edac_mod.conf then within that file place this blacklist amd64_edac_mod

     

    Source

     

     

    Edited by Fiservedpi
    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.