Jump to content
We're Hiring! Full Stack Developer ×
  • [7.0.0-beta.1] kernel bug


    bpwats
    • Annoyance

    With the latest beta, I run into a kernel bug. UnRaid keeps running in the sense that it is not a total crash.
    However:

    • Loading the Docker page takes forever (but ultimately shows)
    • Container update progress stops (see popover window that opens on update) and on Docker refresh page hve not completed
    • The Docker section on the apps tab takes too long to load. Obviously, Apps and Docker are linked.
    • reboot does not work. A terminal will report system going down for reboot, but this does not happen. Physically pressing the on/off button is needed to restart and UnRaid 

     

    Jul  4 21:21:40 Fractal kernel: BUG: unable to handle page fault for address: 0000000200000002
    Jul  4 21:21:40 Fractal kernel: #PF: supervisor instruction fetch in kernel mode
    Jul  4 21:21:40 Fractal kernel: #PF: error_code(0x0010) - not-present page
    Jul  4 21:21:40 Fractal kernel: PGD 24af2f067 P4D 24af2f067 PUD 0 
    Jul  4 21:21:40 Fractal kernel: Oops: 0010 [#1] PREEMPT SMP NOPTI
    Jul  4 21:21:40 Fractal kernel: CPU: 5 PID: 1324 Comm: arc_prune Tainted: P           O       6.8.12-Unraid #3
    Jul  4 21:21:40 Fractal kernel: Hardware name: ASUS System Product Name/PRIME H510M-E, BIOS 2402 12/18/2023
    Jul  4 21:21:40 Fractal kernel: RIP: 0010:0x200000002
    Jul  4 21:21:40 Fractal kernel: Code: Unable to access opcode bytes at 0x1ffffffd8.
    Jul  4 21:21:40 Fractal kernel: RSP: 0018:ffffc9000098fd30 EFLAGS: 00010246
    Jul  4 21:21:40 Fractal kernel: RAX: 0000000200000002 RBX: ffff8884f4070000 RCX: 0000000000000011
    Jul  4 21:21:40 Fractal kernel: RDX: ffffffffa0cc54b8 RSI: ffffc9000098fd68 RDI: ffff8881c13ac580
    Jul  4 21:21:40 Fractal kernel: RBP: ffffc9000098fdcc R08: 0000000000000000 R09: 00000000001d001c
    Jul  4 21:21:40 Fractal kernel: R10: 0000000000000000 R11: ffffc9002186fee8 R12: 000000000000bbda
    Jul  4 21:21:40 Fractal kernel: R13: ffff8881c13ac580 R14: ffff8881c84bfc00 R15: ffff88811176a100
    Jul  4 21:21:40 Fractal kernel: FS:  0000000000000000(0000) GS:ffff88883e740000(0000) knlGS:0000000000000000
    Jul  4 21:21:40 Fractal kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Jul  4 21:21:40 Fractal kernel: CR2: 0000000200000002 CR3: 0000000251da4006 CR4: 00000000003706f0
    Jul  4 21:21:40 Fractal kernel: Call Trace:
    Jul  4 21:21:40 Fractal kernel: <TASK>
    Jul  4 21:21:40 Fractal kernel: ? __die_body+0x1a/0x5c
    Jul  4 21:21:40 Fractal kernel: ? page_fault_oops+0x332/0x37f
    Jul  4 21:21:40 Fractal kernel: ? put_cpu_partial+0x62/0x8e
    Jul  4 21:21:40 Fractal kernel: ? spl_kmem_cache_free+0x3a/0x180 [spl]
    Jul  4 21:21:40 Fractal kernel: ? exc_page_fault+0xf9/0x116
    Jul  4 21:21:40 Fractal kernel: ? asm_exc_page_fault+0x22/0x30
    Jul  4 21:21:40 Fractal kernel: ? zfs_prune+0xec/0x2ec [zfs]
    Jul  4 21:21:40 Fractal kernel: ? zpl_prune_sb+0x32/0x50 [zfs]
    Jul  4 21:21:40 Fractal kernel: ? arc_prune_task+0x1b/0x2e [zfs]
    Jul  4 21:21:40 Fractal kernel: ? taskq_thread+0x2d4/0x3c1 [spl]
    Jul  4 21:21:40 Fractal kernel: ? __pfx_default_wake_function+0x10/0x10
    Jul  4 21:21:40 Fractal kernel: ? __pfx_taskq_thread+0x10/0x10 [spl]
    Jul  4 21:21:40 Fractal kernel: ? kthread+0xf4/0xff
    Jul  4 21:21:40 Fractal kernel: ? __pfx_kthread+0x10/0x10
    Jul  4 21:21:40 Fractal kernel: ? ret_from_fork+0x21/0x36
    Jul  4 21:21:40 Fractal kernel: ? __pfx_kthread+0x10/0x10
    Jul  4 21:21:40 Fractal kernel: ? ret_from_fork_asm+0x1b/0x30
    Jul  4 21:21:40 Fractal kernel: </TASK>
    Jul  4 21:21:40 Fractal kernel: Modules linked in: nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip_set nf_tables xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_addrtype br_netfilter bridge stp llc nfsd auth_rpcgss oid_registry lockd grace sunrpc bluetooth ecdh_generic ecc md_mod tcp_diag inet_diag ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs macvtap macvlan tap intel_rapl_common x86_pkg_temp_thermal i915 intel_powerclamp coretemp zfs(PO) kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel sha512_ssse3 iosf_mbi sha256_ssse3 drm_buddy sha1_ssse3 ttm aesni_intel crypto_simd i2c_algo_bit cryptd drm_display_helper drm_kms_helper input_leds rapl spl(O) mei_hdcp mei_pxp intel_cstate wmi_bmof drm nvme intel_uncore e1000e hid_apple led_class nvme_core mei_me intel_gtt i2c_i801 agpgart i2c_smbus mei ahci
    Jul  4 21:21:40 Fractal kernel: i2c_core libahci thermal fan tpm_crb video tpm_tis tpm_tis_core tpm wmi backlight button acpi_tad acpi_pad
    Jul  4 21:21:40 Fractal kernel: CR2: 0000000200000002
    Jul  4 21:21:40 Fractal kernel: ---[ end trace 0000000000000000 ]---
    Jul  4 21:21:40 Fractal kernel: pstore: backend (efi_pstore) writing error (-5)
    Jul  4 21:21:40 Fractal kernel: RIP: 0010:0x200000002
    Jul  4 21:21:40 Fractal kernel: Code: Unable to access opcode bytes at 0x1ffffffd8.
    Jul  4 21:21:40 Fractal kernel: RSP: 0018:ffffc9000098fd30 EFLAGS: 00010246
    Jul  4 21:21:40 Fractal kernel: RAX: 0000000200000002 RBX: ffff8884f4070000 RCX: 0000000000000011
    Jul  4 21:21:40 Fractal kernel: RDX: ffffffffa0cc54b8 RSI: ffffc9000098fd68 RDI: ffff8881c13ac580
    Jul  4 21:21:40 Fractal kernel: RBP: ffffc9000098fdcc R08: 0000000000000000 R09: 00000000001d001c
    Jul  4 21:21:40 Fractal kernel: R10: 0000000000000000 R11: ffffc9002186fee8 R12: 000000000000bbda
    Jul  4 21:21:40 Fractal kernel: R13: ffff8881c13ac580 R14: ffff8881c84bfc00 R15: ffff88811176a100
    Jul  4 21:21:40 Fractal kernel: FS:  0000000000000000(0000) GS:ffff88883e740000(0000) knlGS:0000000000000000
    Jul  4 21:21:40 Fractal kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Jul  4 21:21:40 Fractal kernel: CR2: 0000000200000002 CR3: 0000000251da4006 CR4: 00000000003706f0
    Jul  4 21:21:40 Fractal kernel: note: arc_prune[1324] exited with irqs disabled
    Jul  4 21:21:50 Fractal kernel: veth95a1b70: renamed from eth0

     

    What have I done to remedy?

    1. check my Asus BIOS version, is latest.
    2. run MemTest. No errors appear.
    3. revert to 6.12.10 solves the problem
    4. if second USB (with Ubuntu ISO for troubleshooting boot drive) is inserted, the first line becomes
      Jul  4 21:21:40 Fractal kernel: BUG: unable to handle page fault for address: 0000000000000000

       

    looking around the internet for this issue, makes me think it is a faulty kernel.

     

    Diagnostics:

    fractal-diagnostics-20240704-2212.zip

     




    User Feedback

    Recommended Comments

    Try booting in safe mode to rule out any plugins issues, there are previous reports of similar crashes caused by a plugin, possibly a zfs related plugin.

    Link to comment
    8 minutes ago, ydddj said:

    Have same problem.

    Are you also using a docker folder? If yes 

     

    22 minutes ago, JorgeB said:

    try using a docker image instead of a folder.

     

    Link to comment
    13 minutes ago, JorgeB said:

    Are you also using a docker folder? If yes 

     

     

    yes. docker folder.

    ok. i change image and have a try.

    Link to comment

    I'm now almost certain that this is not a plugin issue, but a zfs issue with docker, this was just reported today, looks like the exact same issue:

     

    https://github.com/openzfs/zfs/issues/16324

     

    I think that the best bet for now, is to change to a docker image, or if you really want a docker folder, switch to xfs or btrfs.

    Link to comment
    On 7/5/2024 at 12:21 PM, ydddj said:

    Have same problem.


     

    Thank you for chiming in!

    Link to comment
    42 minutes ago, JorgeB said:

    I'm now almost certain that this is not a plugin issue, but a zfs issue with docker, this was just reported today, looks like the exact same issue:

     

    https://github.com/openzfs/zfs/issues/16324

     

    I think that the best bet for now, is to change to a docker image, or if you really want a docker folder, switch to xfs or btrfs.

    I agree it is most likely not plugin related. Earlier, to make sure a plugin would not be the culprit, I removed the Intel ones and Sanoid especially, but would not make a difference.

     

    I will try switching to a Docker image soon.

    Link to comment
    6 hours ago, JorgeB said:

    I'm now almost certain that this is not a plugin issue, but a zfs issue with docker, this was just reported today, looks like the exact same issue:

     

    https://github.com/openzfs/zfs/issues/16324

     

    I think that the best bet for now, is to change to a docker image, or if you really want a docker folder, switch to xfs or btrfs.

    Yes. That's the problem. I have change to btrfs image and it's running well now. what's the differents about between xfs and btrfs? which one is more recommended to use?Thanks very much.

    Link to comment

    and there have another 2bug of docker.

    a.if have 2 or more network interface. docker just can appear br0. can not appear br1. but VMS both appear.

    b.can not switch to macvlan. change to macvlan, after save it's also ipvlan..

    thanks~

     

    Link to comment
    10 hours ago, ydddj said:

    and there have another 2bug of docker.

    Please start a new thread for this.

     

    10 hours ago, ydddj said:

    what's the differents about between xfs and btrfs? which one is more recommended to use?

    For single device pools xfs is probably better for the typical user, if you want a mirror then use btrfs.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...