Jul 26 21:22:50 Tower kernel: nvme nvme0: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0x10
Jul 26 21:22:50 Tower kernel: blk_update_request: I/O error, dev nvme0n1, sector 627533784 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
Jul 26 21:22:50 Tower kernel: nvme 0000:02:00.0: enabling device (0000 -> 0002)
Jul 26 21:22:50 Tower kernel: nvme nvme0: Removing after probe failure status: -19
NVMe device dropped offline, this can sometimes help:
Some NVMe devices have issues with power states on Linux, try this, on the main GUI page click on flash, scroll down to "Syslinux Configuration", make sure it's set to "menu view" (on the top right) and add this to your default boot option, after "append initrd=/bzroot"
nvme_core.default_ps_max_latency_us=0
e.g.:
append initrd=/bzroot nvme_core.default_ps_max_latency_us=0
Reboot and see if it makes a difference.
P.S. server is running out of RAM, you should limit resources.