Jump to content

Help Diagnosing Random Crash/Reboot While Transferring Files


Recommended Posts

I am new to Unraid and server building in general. I am running unRAID server Trial, version 6.7.2. I built a box with the x8sil-f board with an x3470 and 32 gigs of ecc ram. I have a 10tb shucked western digital for parity and 2 more shucked for the array (8 & 10tb). I have a 240g sunbow msata in an msata to sata enclosure as cache and a 1 tb silicon power ssd as unassigned where I moved the app data to. Everything seemed to set up well until I started moving media to the new server. The server will crash and reboot itself randomly while transferring data. It does not seem to correlate to a certain file size or volume of information moving over. Sometimes I can transfer two or three files after another, other times the system will reboot on back to back file copies. I thought it might be due to the janky sunbow setup, so I made the share skip the cache when writing and go directly to the array; it still crashed. I've been running a memtest for the past 48 hours and it just completed its 5th pass with 0 errors. Any ideas what I should try next?

 

Here is the System Overview:

Quote

System Overview

unRAID system:unRAID server Trial, version 6.7.2

Model:Custom

Motherboard:Supermicro - X8SIL

Processor:Intel® Xeon® CPU X3470 @ 2.93GHz

HVM:Enabled

IOMMU:Enabled

Cache:L1-Cache = 256 kB (max. capacity 256 kB)

L2-Cache = 1024 kB (max. capacity 1024 kB)

L3-Cache = 8192 kB (max. capacity 8192 kB)

Memory:32 GB (max. installable capacity 32 GB)*

DIMM1A = 8192 MB, 800 MT/s

DIMM1B = 8192 MB, 800 MT/s

DIMM2A = 8192 MB, 800 MT/s

DIMM2B = 8192 MB, 800 MT/s

Network:bond0: fault-tolerance (active-backup), mtu 1500

eth0: not connected

eth1: 1000Mb/s, full duplex, mtu 1500

Kernel:Linux 4.19.56-Unraid x86_64

OpenSSL:1.1.1c

P + Q algorithm:7425 MB/s + 10226 MB/s

Uptime:0 days, 0 hours, 6 minutes, 29 seconds

 

Plugins:

  • CA Auto Update Applications
  • CA Backup / Restore Appdata
  • CA Cleanup Appdata
  • Community Applications
  • Dynamix Active Streams
  • Dynamix Cache Directories
  • Dynamix Date Time
  • Dynamix S3 Sleep
  • Dynamix SSD TRIM
  • Dynamix System Information
  • Dynamix System Statistics
  • Dynamix System Temperature
  • Fix Common Problems
  • Nerd Tools
  • Preclear Disks
  • Unassigned Devices
  • unBALANCE

 

 

 

 

Here are the syslogs from before and after the reboots that I got by tailing through a putty terminal. Unfortunately, the reboot appears to wipe everything that might point to the problem.

 

Nov 29 14:52:28 Tower kernel: mdcmd (39): stop
Nov 29 14:52:28 Tower kernel: md1: stopping
Nov 29 14:52:29 Tower kernel: md2: stopping
Nov 29 14:52:29 Tower avahi-daemon[6826]: Server startup complete. Host name is Tower.local. Local service cookie is 1327877000.
Nov 29 14:52:30 Tower avahi-daemon[6826]: Service "Tower" (/services/ssh.service) successfully established.
Nov 29 14:52:30 Tower avahi-daemon[6826]: Service "Tower" (/services/smb.service) successfully established.
Nov 29 14:52:30 Tower avahi-daemon[6826]: Service "Tower" (/services/sftp-ssh.service) successfully established.
Nov 29 14:52:32 Tower root: error: /plugins/preclear.disk/Preclear.php: wrong csrf_token
Nov 29 14:57:18 Tower ntpd[1731]: kernel reports TIME_ERROR: 0x41: Clock Unsynchronized
Nov 29 16:06:35 Tower sshd[16271]: Accepted password for root from 192.168.1.22 port 54997 ssh2
Nov 29 16:07:30 Tower emhttpd: req (1): startState=STOPPED&file=&optionCorrect=correct&csrf_token=***************&cmdStart=Start
Nov 29 16:07:30 Tower emhttpd: shcmd (4618): /usr/local/sbin/set_ncq sdc 1
Nov 29 16:07:30 Tower kernel: mdcmd (40): set md_num_stripes 1280
Nov 29 16:07:30 Tower kernel: mdcmd (41): set md_sync_window 384
Nov 29 16:07:30 Tower kernel: mdcmd (42): set md_sync_thresh 192
Nov 29 16:07:30 Tower kernel: mdcmd (43): set md_write_method
Nov 29 16:07:30 Tower kernel: mdcmd (44): set spinup_group 0 0
Nov 29 16:07:30 Tower kernel: mdcmd (45): set spinup_group 1 0
Nov 29 16:07:30 Tower kernel: mdcmd (46): set spinup_group 2 0
Nov 29 16:07:30 Tower emhttpd: shcmd (4619): echo 128 > /sys/block/sdc/queue/nr_requests
Nov 29 16:07:30 Tower emhttpd: shcmd (4620): /usr/local/sbin/set_ncq sdb 1
Nov 29 16:07:30 Tower emhttpd: shcmd (4621): echo 128 > /sys/block/sdb/queue/nr_requests
Nov 29 16:07:30 Tower emhttpd: shcmd (4622): /usr/local/sbin/set_ncq sdf 1
Nov 29 16:07:30 Tower emhttpd: shcmd (4623): echo 128 > /sys/block/sdf/queue/nr_requests
Nov 29 16:07:30 Tower kernel: mdcmd (47): start STOPPED
Nov 29 16:07:30 Tower kernel: unraid: allocating 20860K for 1280 stripes (4 disks)
Nov 29 16:07:30 Tower kernel: md1: running, size: 9766436812 blocks
Nov 29 16:07:30 Tower kernel: md2: running, size: 7814026532 blocks
Nov 29 16:07:30 Tower emhttpd: shcmd (4624): udevadm settle
Nov 29 16:07:30 Tower root: Starting diskload
Nov 29 16:07:30 Tower emhttpd: Mounting disks...
Nov 29 16:07:30 Tower emhttpd: shcmd (4628): /sbin/btrfs device scan
Nov 29 16:07:30 Tower root: Scanning for Btrfs filesystems
Nov 29 16:07:30 Tower emhttpd: shcmd (4629): mkdir -p /mnt/disk1
Nov 29 16:07:30 Tower emhttpd: shcmd (4630): mount -t xfs -o noatime,nodiratime /dev/md1 /mnt/disk1
Nov 29 16:07:30 Tower kernel: SGI XFS with ACLs, security attributes, no debug enabled
Nov 29 16:07:30 Tower kernel: XFS (md1): Mounting V5 Filesystem
Nov 29 16:07:30 Tower kernel: XFS (md1): Starting recovery (logdev: internal)
Nov 29 16:07:31 Tower kernel: XFS (md1): Ending recovery (logdev: internal)
Nov 29 16:07:31 Tower emhttpd: shcmd (4631): xfs_growfs /mnt/disk1
Nov 29 16:07:31 Tower root: meta-data=/dev/md1               isize=512    agcount=10, agsize=268435455 blks
Nov 29 16:07:31 Tower root:          =                       sectsz=512   attr=2, projid32bit=1
Nov 29 16:07:31 Tower root:          =                       crc=1        finobt=1, sparse=1, rmapbt=0
Nov 29 16:07:31 Tower root:          =                       reflink=0
Nov 29 16:07:31 Tower root: data     =                       bsize=4096   blocks=2441609203, imaxpct=5
Nov 29 16:07:31 Tower root:          =                       sunit=0      swidth=0 blks
Nov 29 16:07:31 Tower root: naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
Nov 29 16:07:31 Tower root: log      =internal log           bsize=4096   blocks=521728, version=2
Nov 29 16:07:31 Tower root:          =                       sectsz=512   sunit=0 blks, lazy-count=1
Nov 29 16:07:31 Tower root: realtime =none                   extsz=4096   blocks=0, rtextents=0
Nov 29 16:07:31 Tower emhttpd: shcmd (4632): mkdir -p /mnt/disk2
Nov 29 16:07:31 Tower emhttpd: shcmd (4633): mount -t xfs -o noatime,nodiratime /dev/md2 /mnt/disk2
Nov 29 16:07:31 Tower kernel: XFS (md2): Mounting V5 Filesystem
Nov 29 16:07:31 Tower kernel: XFS (md2): Starting recovery (logdev: internal)
Nov 29 16:07:31 Tower kernel: XFS (md2): Ending recovery (logdev: internal)
Nov 29 16:07:31 Tower emhttpd: shcmd (4634): xfs_growfs /mnt/disk2
Nov 29 16:07:31 Tower root: meta-data=/dev/md2               isize=512    agcount=8, agsize=268435455 blks
Nov 29 16:07:31 Tower root:          =                       sectsz=512   attr=2, projid32bit=1
Nov 29 16:07:31 Tower root:          =                       crc=1        finobt=1, sparse=1, rmapbt=0
Nov 29 16:07:31 Tower root:          =                       reflink=0
Nov 29 16:07:31 Tower root: data     =                       bsize=4096   blocks=1953506633, imaxpct=5
Nov 29 16:07:31 Tower root:          =                       sunit=0      swidth=0 blks
Nov 29 16:07:31 Tower root: naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
Nov 29 16:07:31 Tower root: log      =internal log           bsize=4096   blocks=521728, version=2
Nov 29 16:07:31 Tower root:          =                       sectsz=512   sunit=0 blks, lazy-count=1
Nov 29 16:07:31 Tower root: realtime =none                   extsz=4096   blocks=0, rtextents=0
Nov 29 16:07:31 Tower emhttpd: shcmd (4635): mkdir -p /mnt/cache
Nov 29 16:07:31 Tower emhttpd: shcmd (4636): mount -t btrfs -o noatime,nodiratime /dev/sde1 /mnt/cache
Nov 29 16:07:31 Tower kernel: BTRFS info (device sde1): disk space caching is enabled
Nov 29 16:07:31 Tower kernel: BTRFS info (device sde1): has skinny extents
Nov 29 16:07:31 Tower kernel: BTRFS info (device sde1): enabling ssd optimizations
Nov 29 16:07:31 Tower emhttpd: shcmd (4637): sync
Nov 29 16:07:31 Tower emhttpd: shcmd (4638): mkdir /mnt/user0
Nov 29 16:07:31 Tower emhttpd: shcmd (4639): /usr/local/sbin/shfs /mnt/user0 -disks 6 -o noatime,big_writes,allow_other   |& logger
Nov 29 16:07:31 Tower shfs: stderr redirected to syslog
Nov 29 16:07:31 Tower emhttpd: shcmd (4640): mkdir /mnt/user
Nov 29 16:07:31 Tower emhttpd: shcmd (4641): /usr/local/sbin/shfs /mnt/user -disks 7 2048000000 -o noatime,big_writes,allow_other  -o remember=0  |& logger
Nov 29 16:07:31 Tower shfs: stderr redirected to syslog
Nov 29 16:07:31 Tower emhttpd: shcmd (4643): /usr/local/sbin/update_cron
Nov 29 16:07:31 Tower cache_dirs: Arguments=-i appdata -i backups -i domains -i isos -i system -l off
Nov 29 16:07:31 Tower cache_dirs: Max Scan Secs=10, Min Scan Secs=1
Nov 29 16:07:31 Tower cache_dirs: Scan Type=adaptive
Nov 29 16:07:31 Tower cache_dirs: Min Scan Depth=4
Nov 29 16:07:31 Tower cache_dirs: Max Scan Depth=none
Nov 29 16:07:31 Tower cache_dirs: Use Command='find -noleaf'
Nov 29 16:07:31 Tower cache_dirs: ---------- Caching Directories ---------------
Nov 29 16:07:31 Tower cache_dirs: backups
Nov 29 16:07:31 Tower cache_dirs: domains
Nov 29 16:07:31 Tower cache_dirs: isos
Nov 29 16:07:31 Tower cache_dirs: system
Nov 29 16:07:31 Tower cache_dirs: ----------------------------------------------
Nov 29 16:07:31 Tower cache_dirs: Setting Included dirs: appdata,backups,domains,isos,system
Nov 29 16:07:31 Tower cache_dirs: Setting Excluded dirs:
Nov 29 16:07:31 Tower cache_dirs: min_disk_idle_before_restarting_scan_sec=60
Nov 29 16:07:31 Tower cache_dirs: scan_timeout_sec_idle=150
Nov 29 16:07:31 Tower cache_dirs: scan_timeout_sec_busy=30
Nov 29 16:07:31 Tower cache_dirs: scan_timeout_sec_stable=30
Nov 29 16:07:31 Tower cache_dirs: frequency_of_full_depth_scan_sec=604800
Nov 29 16:07:31 Tower cache_dirs: ERROR: included directory 'appdata' does not exist.
Nov 29 16:07:31 Tower cache_dirs: cache_dirs service rc.cachedirs: Started: '/usr/local/emhttp/plugins/dynamix.cache.dirs/scripts/cache_dirs -i "appdata" -i "backups" -i "domains" -i "isos" -i "system" -l off 2>/dev/null'
Nov 29 16:07:31 Tower root: Delaying execution of fix common problems scan for 10 minutes
Nov 29 16:07:31 Tower unassigned.devices: Mounting 'Auto Mount' Devices...
Nov 29 16:07:31 Tower unassigned.devices: Adding disk '/dev/sdd1'...
Nov 29 16:07:31 Tower unassigned.devices: Mount drive command: /sbin/mount -t xfs -o rw,noatime,nodiratime,discard '/dev/sdd1' '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489'
Nov 29 16:07:31 Tower kernel: XFS (sdd1): Mounting V5 Filesystem
Nov 29 16:07:31 Tower kernel: XFS (sdd1): Starting recovery (logdev: internal)
Nov 29 16:07:31 Tower kernel: XFS (sdd1): Ending recovery (logdev: internal)
Nov 29 16:07:31 Tower unassigned.devices: Successfully mounted '/dev/sdd1' on '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489'.
Nov 29 16:07:31 Tower unassigned.devices: Disk with serial 'SPCC_Solid_State_Disk_BB04079A025300047489', mountpoint 'SPCC_Solid_State_Disk_BB04079A025300047489' is not set as sharable and will not be shared...
Nov 29 16:07:31 Tower emhttpd: Starting services...
Nov 29 16:07:32 Tower emhttpd: shcmd (4645): /etc/rc.d/rc.samba restart
Nov 29 16:07:34 Tower root: Starting Samba:  /usr/sbin/nmbd -D
Nov 29 16:07:34 Tower root:                  /usr/sbin/smbd -D
Nov 29 16:07:34 Tower root:                  /usr/sbin/winbindd -D
Nov 29 16:07:34 Tower emhttpd: shcmd (4659): /usr/local/sbin/mount_image '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489/system/docker/docker.img' /var/lib/docker 20
Nov 29 16:07:34 Tower kernel: BTRFS: device fsid 7f651326-0a4a-472c-a807-87d3925acfe7 devid 1 transid 75 /dev/loop2
Nov 29 16:07:34 Tower kernel: BTRFS info (device loop2): disk space caching is enabled
Nov 29 16:07:34 Tower kernel: BTRFS info (device loop2): has skinny extents
Nov 29 16:07:34 Tower root: Resize '/var/lib/docker' of 'max'
Nov 29 16:07:34 Tower kernel: BTRFS info (device loop2): new size for /dev/loop2 is 21474836480
Nov 29 16:07:34 Tower emhttpd: shcmd (4661): /etc/rc.d/rc.docker start
Nov 29 16:07:34 Tower root: starting dockerd ...
Nov 29 16:07:34 Tower avahi-daemon[6826]: Joining mDNS multicast group on interface docker0.IPv4 with address 172.17.0.1.
Nov 29 16:07:34 Tower avahi-daemon[6826]: New relevant interface docker0.IPv4 for mDNS.
Nov 29 16:07:34 Tower avahi-daemon[6826]: Registering new address record for 172.17.0.1 on docker0.IPv4.
Nov 29 16:07:34 Tower kernel: IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready
Nov 29 16:07:35 Tower emhttpd: shcmd (4675): /usr/local/sbin/mount_image '/mnt/user/system/libvirt/libvirt.img' /etc/libvirt 1
Nov 29 16:07:35 Tower kernel: BTRFS info (device sde1): the free space cache file (34381758464) is invalid, skip it
Nov 29 16:07:35 Tower kernel: BTRFS: device fsid 9a073380-2098-42da-a0c5-7d8ddb4c1873 devid 1 transid 10 /dev/loop3
Nov 29 16:07:35 Tower kernel: BTRFS info (device loop3): disk space caching is enabled
Nov 29 16:07:35 Tower kernel: BTRFS info (device loop3): has skinny extents
Nov 29 16:07:36 Tower root: Resize '/etc/libvirt' of 'max'
Nov 29 16:07:36 Tower kernel: BTRFS info (device loop3): new size for /dev/loop3 is 1073741824
Nov 29 16:07:36 Tower emhttpd: shcmd (4677): /etc/rc.d/rc.libvirt start
Nov 29 16:07:36 Tower root: Starting virtlockd...
Nov 29 16:07:36 Tower root: Starting virtlogd...
Nov 29 16:07:36 Tower root: Starting libvirtd...
Nov 29 16:07:36 Tower kernel: tun: Universal TUN/TAP device driver, 1.6
Nov 29 16:07:36 Tower kernel: mdcmd (48): check correct
Nov 29 16:07:36 Tower kernel: md: recovery thread: check P ...
Nov 29 16:07:36 Tower kernel: md: using 1536k window, over a total of 9766436812 blocks.
Nov 29 16:07:36 Tower rc.docker: plex: started succesfully!
Nov 29 16:07:37 Tower kernel: L1TF CPU bug present and SMT on, data leak possible. See CVE-2018-3646 and https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/l1tf.html for details.
Nov 29 16:07:37 Tower kernel: virbr0: port 1(virbr0-nic) entered blocking state
Nov 29 16:07:37 Tower kernel: virbr0: port 1(virbr0-nic) entered disabled state
Nov 29 16:07:37 Tower kernel: device virbr0-nic entered promiscuous mode
Nov 29 16:07:37 Tower dhcpcd[1675]: virbr0: new hardware address: 42:ad:40:f1:4b:fe
Nov 29 16:07:37 Tower dhcpcd[1675]: virbr0: new hardware address: 52:54:00:52:c1:3c
Nov 29 16:07:37 Tower avahi-daemon[6826]: Joining mDNS multicast group on interface virbr0.IPv4 with address 192.168.122.1.
Nov 29 16:07:37 Tower avahi-daemon[6826]: New relevant interface virbr0.IPv4 for mDNS.
Nov 29 16:07:37 Tower avahi-daemon[6826]: Registering new address record for 192.168.122.1 on virbr0.IPv4.
Nov 29 16:07:37 Tower kernel: virbr0: port 1(virbr0-nic) entered blocking state
Nov 29 16:07:37 Tower kernel: virbr0: port 1(virbr0-nic) entered listening state
Nov 29 16:07:37 Tower dnsmasq[20241]: started, version 2.80 cachesize 150
Nov 29 16:07:37 Tower dnsmasq[20241]: compile time options: IPv6 GNU-getopt no-DBus i18n IDN2 DHCP DHCPv6 no-Lua TFTP no-conntrack ipset auth no-DNSSEC loop-detect inotify dumpfile
Nov 29 16:07:37 Tower dnsmasq-dhcp[20241]: DHCP, IP range 192.168.122.2 -- 192.168.122.254, lease time 1h
Nov 29 16:07:37 Tower dnsmasq-dhcp[20241]: DHCP, sockets bound exclusively to interface virbr0
Nov 29 16:07:37 Tower dnsmasq[20241]: reading /etc/resolv.conf
Nov 29 16:07:37 Tower dnsmasq[20241]: using nameserver 192.168.1.1#53
Nov 29 16:07:37 Tower dnsmasq[20241]: read /etc/hosts - 2 addresses
Nov 29 16:07:37 Tower dnsmasq[20241]: read /var/lib/libvirt/dnsmasq/default.addnhosts - 0 addresses
Nov 29 16:07:37 Tower dnsmasq-dhcp[20241]: read /var/lib/libvirt/dnsmasq/default.hostsfile
Nov 29 16:07:37 Tower kernel: virbr0: port 1(virbr0-nic) entered disabled state
Nov 29 16:07:38 Tower unassigned.devices: Mounting 'Auto Mount' Remote Shares...
Nov 29 16:07:38 Tower sudo:     root : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/bin/bash -c /usr/local/emhttp/plugins/unbalance/unbalance -port 6237
Nov 29 16:07:44 Tower emhttpd: req (2): startState=STARTED&file=&csrf_token=***************&cmdNoCheck=Cancel
Nov 29 16:07:44 Tower kernel: mdcmd (49): nocheck Cancel
Nov 29 16:07:44 Tower kernel: md: recovery thread: exit status: -4
Nov 29 16:18:37 Tower kernel: md2: running, size: 7814026532 blocks
Nov 29 16:18:37 Tower emhttpd: shcmd (31): udevadm settle
Nov 29 16:18:37 Tower emhttpd: Autostart disabled
Nov 29 16:18:37 Tower kernel: mdcmd (39): stop
Nov 29 16:18:37 Tower kernel: md1: stopping
Nov 29 16:18:37 Tower kernel: md2: stopping
Nov 29 16:18:38 Tower avahi-daemon[6837]: Server startup complete. Host name is                                                                                                              Tower.local. Local service cookie is 4207205978.
Nov 29 16:18:39 Tower avahi-daemon[6837]: Service "Tower" (/services/ssh.service                                                                                                             ) successfully established.
Nov 29 16:18:39 Tower avahi-daemon[6837]: Service "Tower" (/services/smb.service                                                                                                             ) successfully established.
Nov 29 16:18:39 Tower avahi-daemon[6837]: Service "Tower" (/services/sftp-ssh.se                                                                                                             rvice) successfully established.
Nov 29 16:23:19 Tower emhttpd: req (1): startState=STOPPED&file=&optionCorrect=correct&csrf_token=****************&cmdStart=Start
Nov 29 16:23:19 Tower emhttpd: shcmd (322): /usr/local/sbin/set_ncq sdc 1
Nov 29 16:23:19 Tower kernel: mdcmd (40): set md_num_stripes 1280
Nov 29 16:23:19 Tower kernel: mdcmd (41): set md_sync_window 384
Nov 29 16:23:19 Tower kernel: mdcmd (42): set md_sync_thresh 192
Nov 29 16:23:19 Tower kernel: mdcmd (43): set md_write_method
Nov 29 16:23:19 Tower kernel: mdcmd (44): set spinup_group 0 0
Nov 29 16:23:19 Tower kernel: mdcmd (45): set spinup_group 1 0
Nov 29 16:23:19 Tower kernel: mdcmd (46): set spinup_group 2 0
Nov 29 16:23:19 Tower emhttpd: shcmd (323): echo 128 > /sys/block/sdc/queue/nr_requests
Nov 29 16:23:19 Tower emhttpd: shcmd (324): /usr/local/sbin/set_ncq sdb 1
Nov 29 16:23:19 Tower emhttpd: shcmd (325): echo 128 > /sys/block/sdb/queue/nr_requests
Nov 29 16:23:19 Tower emhttpd: shcmd (326): /usr/local/sbin/set_ncq sdf 1
Nov 29 16:23:19 Tower emhttpd: shcmd (327): echo 128 > /sys/block/sdf/queue/nr_requests
Nov 29 16:23:19 Tower kernel: mdcmd (47): start STOPPED
Nov 29 16:23:19 Tower kernel: unraid: allocating 20860K for 1280 stripes (4 disks)
Nov 29 16:23:19 Tower kernel: md1: running, size: 9766436812 blocks
Nov 29 16:23:19 Tower kernel: md2: running, size: 7814026532 blocks
Nov 29 16:23:19 Tower emhttpd: shcmd (328): udevadm settle
Nov 29 16:23:19 Tower root: Starting diskload
Nov 29 16:23:19 Tower emhttpd: Mounting disks...
Nov 29 16:23:19 Tower emhttpd: shcmd (332): /sbin/btrfs device scan
Nov 29 16:23:19 Tower root: Scanning for Btrfs filesystems
Nov 29 16:23:19 Tower emhttpd: shcmd (333): mkdir -p /mnt/disk1
Nov 29 16:23:19 Tower emhttpd: shcmd (334): mount -t xfs -o noatime,nodiratime /dev/md1 /mnt/disk1
Nov 29 16:23:19 Tower kernel: SGI XFS with ACLs, security attributes, no debug enabled
Nov 29 16:23:19 Tower kernel: XFS (md1): Mounting V5 Filesystem
Nov 29 16:23:19 Tower kernel: XFS (md1): Starting recovery (logdev: internal)
Nov 29 16:23:20 Tower kernel: XFS (md1): Ending recovery (logdev: internal)
Nov 29 16:23:20 Tower emhttpd: shcmd (335): xfs_growfs /mnt/disk1
Nov 29 16:23:20 Tower root: meta-data=/dev/md1               isize=512    agcount=10, agsize=268435455 blks
Nov 29 16:23:20 Tower root:          =                       sectsz=512   attr=2, projid32bit=1
Nov 29 16:23:20 Tower root:          =                       crc=1        finobt=1, sparse=1, rmapbt=0
Nov 29 16:23:20 Tower root:          =                       reflink=0
Nov 29 16:23:20 Tower root: data     =                       bsize=4096   blocks=2441609203, imaxpct=5
Nov 29 16:23:20 Tower root:          =                       sunit=0      swidth=0 blks
Nov 29 16:23:20 Tower root: naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
Nov 29 16:23:20 Tower root: log      =internal log           bsize=4096   blocks=521728, version=2
Nov 29 16:23:20 Tower root:          =                       sectsz=512   sunit=0 blks, lazy-count=1
Nov 29 16:23:20 Tower root: realtime =none                   extsz=4096   blocks=0, rtextents=0
Nov 29 16:23:20 Tower emhttpd: shcmd (336): mkdir -p /mnt/disk2
Nov 29 16:23:20 Tower emhttpd: shcmd (337): mount -t xfs -o noatime,nodiratime /dev/md2 /mnt/disk2
Nov 29 16:23:20 Tower kernel: XFS (md2): Mounting V5 Filesystem
Nov 29 16:23:20 Tower kernel: XFS (md2): Starting recovery (logdev: internal)
Nov 29 16:23:20 Tower kernel: XFS (md2): Ending recovery (logdev: internal)
Nov 29 16:23:20 Tower emhttpd: shcmd (338): xfs_growfs /mnt/disk2
Nov 29 16:23:20 Tower root: meta-data=/dev/md2               isize=512    agcount=8, agsize=268435455 blks
Nov 29 16:23:20 Tower root:          =                       sectsz=512   attr=2, projid32bit=1
Nov 29 16:23:20 Tower root:          =                       crc=1        finobt=1, sparse=1, rmapbt=0
Nov 29 16:23:20 Tower root:          =                       reflink=0
Nov 29 16:23:20 Tower root: data     =                       bsize=4096   blocks=1953506633, imaxpct=5
Nov 29 16:23:20 Tower root:          =                       sunit=0      swidth=0 blks
Nov 29 16:23:20 Tower root: naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
Nov 29 16:23:20 Tower root: log      =internal log           bsize=4096   blocks=521728, version=2
Nov 29 16:23:20 Tower root:          =                       sectsz=512   sunit=0 blks, lazy-count=1
Nov 29 16:23:20 Tower root: realtime =none                   extsz=4096   blocks=0, rtextents=0
Nov 29 16:23:20 Tower emhttpd: shcmd (339): mkdir -p /mnt/cache
Nov 29 16:23:20 Tower emhttpd: shcmd (340): mount -t btrfs -o noatime,nodiratime /dev/sde1 /mnt/cache
Nov 29 16:23:20 Tower kernel: BTRFS info (device sde1): disk space caching is enabled
Nov 29 16:23:20 Tower kernel: BTRFS info (device sde1): has skinny extents
Nov 29 16:23:20 Tower kernel: BTRFS info (device sde1): enabling ssd optimizations
Nov 29 16:23:20 Tower emhttpd: shcmd (341): sync
Nov 29 16:23:20 Tower emhttpd: shcmd (342): mkdir /mnt/user0
Nov 29 16:23:20 Tower emhttpd: shcmd (343): /usr/local/sbin/shfs /mnt/user0 -disks 6 -o noatime,big_writes,allow_other   |& logger
Nov 29 16:23:20 Tower shfs: stderr redirected to syslog
Nov 29 16:23:20 Tower emhttpd: shcmd (344): mkdir /mnt/user
Nov 29 16:23:20 Tower emhttpd: shcmd (345): /usr/local/sbin/shfs /mnt/user -disks 7 2048000000 -o noatime,big_writes,allow_other  -o remember=0  |& logger
Nov 29 16:23:20 Tower shfs: stderr redirected to syslog
Nov 29 16:23:20 Tower emhttpd: shcmd (347): /usr/local/sbin/update_cron
Nov 29 16:23:20 Tower cache_dirs: Arguments=-i appdata -i backups -i domains -i isos -i system -l off
Nov 29 16:23:20 Tower cache_dirs: Max Scan Secs=10, Min Scan Secs=1
Nov 29 16:23:20 Tower cache_dirs: Scan Type=adaptive
Nov 29 16:23:20 Tower cache_dirs: Min Scan Depth=4
Nov 29 16:23:20 Tower cache_dirs: Max Scan Depth=none
Nov 29 16:23:20 Tower cache_dirs: Use Command='find -noleaf'
Nov 29 16:23:20 Tower cache_dirs: ---------- Caching Directories ---------------
Nov 29 16:23:20 Tower cache_dirs: backups
Nov 29 16:23:20 Tower cache_dirs: domains
Nov 29 16:23:20 Tower cache_dirs: isos
Nov 29 16:23:20 Tower cache_dirs: system
Nov 29 16:23:20 Tower cache_dirs: ----------------------------------------------
Nov 29 16:23:20 Tower cache_dirs: Setting Included dirs: appdata,backups,domains,isos,system
Nov 29 16:23:20 Tower cache_dirs: Setting Excluded dirs:
Nov 29 16:23:20 Tower cache_dirs: min_disk_idle_before_restarting_scan_sec=60
Nov 29 16:23:20 Tower cache_dirs: scan_timeout_sec_idle=150
Nov 29 16:23:20 Tower cache_dirs: scan_timeout_sec_busy=30
Nov 29 16:23:20 Tower cache_dirs: scan_timeout_sec_stable=30
Nov 29 16:23:20 Tower cache_dirs: frequency_of_full_depth_scan_sec=604800
Nov 29 16:23:20 Tower cache_dirs: ERROR: included directory 'appdata' does not exist.
Nov 29 16:23:20 Tower cache_dirs: cache_dirs service rc.cachedirs: Started: '/usr/local/emhttp/plugins/dynamix.cache.dirs/scripts/cache_dirs -i "appdata" -i "backups" -i "domains" -i "isos" -i "system" -l off 2>/dev/null'
Nov 29 16:23:20 Tower root: Delaying execution of fix common problems scan for 10 minutes
Nov 29 16:23:20 Tower unassigned.devices: Mounting 'Auto Mount' Devices...
Nov 29 16:23:20 Tower unassigned.devices: Adding disk '/dev/sdd1'...
Nov 29 16:23:20 Tower unassigned.devices: Mount drive command: /sbin/mount -t xfs -o rw,noatime,nodiratime,discard '/dev/sdd1' '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489'
Nov 29 16:23:20 Tower kernel: XFS (sdd1): Mounting V5 Filesystem
Nov 29 16:23:21 Tower kernel: XFS (sdd1): Starting recovery (logdev: internal)
Nov 29 16:23:21 Tower kernel: XFS (sdd1): Ending recovery (logdev: internal)
Nov 29 16:23:21 Tower unassigned.devices: Successfully mounted '/dev/sdd1' on '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489'.
Nov 29 16:23:21 Tower unassigned.devices: Disk with serial 'SPCC_Solid_State_Disk_BB04079A025300047489', mountpoint 'SPCC_Solid_State_Disk_BB04079A025300047489' is not set as sharable and will not be shared...
Nov 29 16:23:21 Tower emhttpd: Starting services...
Nov 29 16:23:21 Tower emhttpd: shcmd (349): /etc/rc.d/rc.samba restart
Nov 29 16:23:23 Tower root: Starting Samba:  /usr/sbin/nmbd -D
Nov 29 16:23:23 Tower root:                  /usr/sbin/smbd -D
Nov 29 16:23:23 Tower root:                  /usr/sbin/winbindd -D
Nov 29 16:23:23 Tower emhttpd: shcmd (363): /usr/local/sbin/mount_image '/mnt/disks/SPCC_Solid_State_Disk_BB04079A025300047489/system/docker/docker.img' /var/lib/docker 20
Nov 29 16:23:23 Tower kernel: BTRFS: device fsid 7f651326-0a4a-472c-a807-87d3925acfe7 devid 1 transid 77 /dev/loop2
Nov 29 16:23:23 Tower kernel: BTRFS info (device loop2): disk space caching is enabled
Nov 29 16:23:23 Tower kernel: BTRFS info (device loop2): has skinny extents
Nov 29 16:23:23 Tower root: Resize '/var/lib/docker' of 'max'
Nov 29 16:23:23 Tower kernel: BTRFS info (device loop2): new size for /dev/loop2 is 21474836480
Nov 29 16:23:23 Tower emhttpd: shcmd (365): /etc/rc.d/rc.docker start
Nov 29 16:23:23 Tower root: starting dockerd ...
Nov 29 16:23:23 Tower avahi-daemon[6837]: Joining mDNS multicast group on interface docker0.IPv4 with address 172.17.0.1.
Nov 29 16:23:23 Tower avahi-daemon[6837]: New relevant interface docker0.IPv4 for mDNS.
Nov 29 16:23:23 Tower avahi-daemon[6837]: Registering new address record for 172.17.0.1 on docker0.IPv4.
Nov 29 16:23:23 Tower kernel: IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready
Nov 29 16:23:25 Tower emhttpd: shcmd (379): /usr/local/sbin/mount_image '/mnt/user/system/libvirt/libvirt.img' /etc/libvirt 1
Nov 29 16:23:25 Tower kernel: BTRFS: device fsid 9a073380-2098-42da-a0c5-7d8ddb4c1873 devid 1 transid 11 /dev/loop3
Nov 29 16:23:25 Tower kernel: BTRFS info (device loop3): disk space caching is enabled
Nov 29 16:23:25 Tower kernel: BTRFS info (device loop3): has skinny extents
Nov 29 16:23:25 Tower root: Resize '/etc/libvirt' of 'max'
Nov 29 16:23:25 Tower kernel: BTRFS info (device loop3): new size for /dev/loop3 is 1073741824
Nov 29 16:23:25 Tower emhttpd: shcmd (381): /etc/rc.d/rc.libvirt start
Nov 29 16:23:25 Tower root: Starting virtlockd...
Nov 29 16:23:25 Tower root: Starting virtlogd...
Nov 29 16:23:25 Tower root: Starting libvirtd...
Nov 29 16:23:25 Tower kernel: tun: Universal TUN/TAP device driver, 1.6
Nov 29 16:23:25 Tower kernel: mdcmd (48): check correct
Nov 29 16:23:25 Tower kernel: md: recovery thread: check P ...
Nov 29 16:23:25 Tower kernel: md: using 1536k window, over a total of 9766436812 blocks.
Nov 29 16:23:25 Tower rc.docker: plex: started succesfully!
Nov 29 16:23:26 Tower kernel: L1TF CPU bug present and SMT on, data leak possible. See CVE-2018-3646 and https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/l1tf.html for details.
Nov 29 16:23:26 Tower kernel: virbr0: port 1(virbr0-nic) entered blocking state
Nov 29 16:23:26 Tower kernel: virbr0: port 1(virbr0-nic) entered disabled state
Nov 29 16:23:26 Tower kernel: device virbr0-nic entered promiscuous mode
Nov 29 16:23:26 Tower dhcpcd[1678]: virbr0: new hardware address: e6:03:a6:fe:f6:bf
Nov 29 16:23:26 Tower dhcpcd[1678]: virbr0: new hardware address: 52:54:00:52:c1:3c
Nov 29 16:23:26 Tower avahi-daemon[6837]: Joining mDNS multicast group on interface virbr0.IPv4 with address 192.168.122.1.
Nov 29 16:23:26 Tower avahi-daemon[6837]: New relevant interface virbr0.IPv4 for mDNS.
Nov 29 16:23:26 Tower avahi-daemon[6837]: Registering new address record for 192.168.122.1 on virbr0.IPv4.
Nov 29 16:23:26 Tower kernel: virbr0: port 1(virbr0-nic) entered blocking state
Nov 29 16:23:26 Tower kernel: virbr0: port 1(virbr0-nic) entered listening state
Nov 29 16:23:26 Tower dnsmasq[16289]: started, version 2.80 cachesize 150
Nov 29 16:23:26 Tower dnsmasq[16289]: compile time options: IPv6 GNU-getopt no-DBus i18n IDN2 DHCP DHCPv6 no-Lua TFTP no-conntrack ipset auth no-DNSSEC loop-detect inotify dumpfile
Nov 29 16:23:26 Tower dnsmasq-dhcp[16289]: DHCP, IP range 192.168.122.2 -- 192.168.122.254, lease time 1h
Nov 29 16:23:26 Tower dnsmasq-dhcp[16289]: DHCP, sockets bound exclusively to interface virbr0
Nov 29 16:23:26 Tower dnsmasq[16289]: reading /etc/resolv.conf
Nov 29 16:23:26 Tower dnsmasq[16289]: using nameserver 192.168.1.1#53
Nov 29 16:23:26 Tower dnsmasq[16289]: read /etc/hosts - 2 addresses
Nov 29 16:23:26 Tower dnsmasq[16289]: read /var/lib/libvirt/dnsmasq/default.addnhosts - 0 addresses
Nov 29 16:23:26 Tower dnsmasq-dhcp[16289]: read /var/lib/libvirt/dnsmasq/default.hostsfile
Nov 29 16:23:26 Tower kernel: virbr0: port 1(virbr0-nic) entered disabled state
Nov 29 16:23:27 Tower unassigned.devices: Mounting 'Auto Mount' Remote Shares...
Nov 29 16:23:27 Tower sudo:     root : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/bin/bash -c /usr/local/emhttp/plugins/unbalance/unbalance -port 6237
Nov 29 16:23:32 Tower emhttpd: req (2): startState=STARTED&file=&csrf_token=****************&cmdNoCheck=Cancel
Nov 29 16:23:32 Tower kernel: mdcmd (49): nocheck Cancel
Nov 29 16:23:32 Tower kernel: md: recovery thread: exit status: -4

I canceled the parity check each time after restarting the array because it takes ~20 hrs to run. 

tower-diagnostics-20191129-2015.zip

Link to comment
23 minutes ago, Gumdomike said:

Here are the syslogs from before and after the reboots that I got by tailing through a putty terminal. Unfortunately, the reboot appears to wipe everything that might point to the problem

On mobile now so can't look at Diagnostics. Go to Settings-Syslog Server and you can configure where to save syslog. 

Link to comment

My apologies, I did not know there was a difference. I have attached new diagnostics with the array running. I also had the syslog mirrored to the flash and ran another transfer that failed around the Nov 30 00:30:00 mark; however all I see in the log is my canceling the parity check at Nov 30 00:23:14 and then logging in again after the crash at Nov 30 00:32:06.

tower-diagnostics-20191130-0436.zip syslog

Link to comment

That is my Movies share. I had originally set it up to use the cache and then I read somewhere to skip the cache for the original ingest of data. I also thought that the msata cache drive in a sata enclosure might have been the issue; so by skipping the cache write, I could maybe bypass the issues. That was not the case. 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...