armbrust

Members
  • Posts

    56
  • Joined

  • Last visited

Converted

  • Gender
    Undisclosed

Recent Profile Visitors

The recent visitors block is disabled and is not being shown to other users.

armbrust's Achievements

Rookie

Rookie (2/14)

0

Reputation

  1. Thanks Warrentheo for the ideas.. removing xvga='yes' prevents it from crashing, but also prevents the VM from booting in either 6.7.2 or 6.6.1. I did dump the rom from this card, but I'll try to do it again - the problem is according to space invaders video you need to run it in a VM first.. I do have another GPU in the system.. perhaps I can somehow make it the primary.. Creating a new VM using the same image has the same results as the existing VM.
  2. A couple other things I tried without success: Bind the GPU to the vfio driver: apped vfio-pci.ids=10de:1d01,10de:0fb8 Append pcie_aspm=off video=vesafb:off,efifb:off in syslinux config
  3. Thanks for the comment, I appreciate the time. Agreed that this is old hardware. But pass through has been working well for me prior to 6.7.x. Something has changed from 6.x.x -> 6.7.x. Unfortunately I do have the latest BIOS. I'm hoping there is a work around for 6.7.x as I don't want to get left behind in versions, and don't want to buy new hardware.
  4. Moving from 6.6.7 to either 6.7.0, 6.7.1 6.7.2 all have the same issue. Everything works correctly, except starting a VM that has a GPU passed through. When starting this VM the system crashes. I've attached diagnostics from both versions (6.6.7 and 6.7.2), just before starting the VM. Nothing was changed in the configuration between runs. There is another VM running fine in both cases. It has nothing passed through. Also attached is the xml config of the problem VM. I tailed the syslog in both versions when starting the VM, and they look the same.. In both there is some sort of DMA fault, but in 6.7.2, it works fine. This is a tail of the syslog when starting the problem VM in 6.7.2: Jul 5 09:33:23 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered disabled state Jul 5 09:33:23 Tower kernel: device vnet2 entered promiscuous mode Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered forwarding state Jul 5 09:33:24 Tower avahi-daemon[7313]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jul 5 09:33:24 Tower avahi-daemon[7313]: New relevant interface vnet2.IPv6 for mDNS. Jul 5 09:33:24 Tower avahi-daemon[7313]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jul 5 09:33:24 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jul 5 09:33:25 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jul 5 09:33:25 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jul 5 09:33:28 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:33:30 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jul 5 09:33:30 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set Jul 5 09:33:30 Tower nginx: 2019/07/05 09:33:30 [crit] 7479#7479: *2093 connect() to unix:/var/tmp/Letsencrypt.sock failed (2: No such file or directory) while connecting to upstream, client: 192.168.1.101, server: , request: "GET /dockerterminal/Letsencrypt/ws HTTP/1.1", upstream: "http://unix:/var/tmp/Letsencrypt.sock:/ws", host: "tower" Here is the tail of the sys log on startup of the same VM in 6.6.7 for comparison. Jul 5 09:51:28 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered disabled state Jul 5 09:51:28 Tower kernel: device vnet2 entered promiscuous mode Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered forwarding state Jul 5 09:51:29 Tower avahi-daemon[6629]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jul 5 09:51:29 Tower avahi-daemon[6629]: New relevant interface vnet2.IPv6 for mDNS. Jul 5 09:51:29 Tower avahi-daemon[6629]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jul 5 09:51:29 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jul 5 09:51:30 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jul 5 09:51:30 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jul 5 09:51:32 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:51:34 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jul 5 09:51:34 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set I've tried with and without "iommu=pt" in syslinux config. Anybody have any ideas? Thanks tower-diagnostics-6.7.2-20190705-1324.zip tower-diagnostics-6.6.7-20190705-0918.zip Problem VM Config.xml
  5. Thanks for the reply, unfortunately no luck. This is the syslog at the time of VM start. Jun 25 21:44:17 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered blocking state Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered disabled state Jun 25 21:44:17 Tower kernel: device vnet2 entered promiscuous mode Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered blocking state Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered forwarding state Jun 25 21:44:18 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jun 25 21:44:18 Tower avahi-daemon[7180]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jun 25 21:44:18 Tower avahi-daemon[7180]: New relevant interface vnet2.IPv6 for mDNS. Jun 25 21:44:18 Tower avahi-daemon[7180]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jun 25 21:44:19 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jun 25 21:44:19 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jun 25 21:44:22 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 21:44:24 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jun 25 21:44:24 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set
  6. Hi, I tried upgrading from 6.6.7, and experiance a hard crash, when the VMs were started. For a few seconds, The GUI was available, but then the system rebooted, and came back with the VM manger disabled. When I enabled the VM manger, the system crashed (no web GUI didn't respond to pings), and didn't come back. I rebooted into safe mode, captured a diagnostic file, and reverted to 6.6.7. Back to 6.6.7 and all is well. The same thing happened when trying to upgrade to 6.7.0 from 6.6.7. The attached diagnostic file was captured in 6.7.1 safe mode. Thanks for any advice. I'm hoping it's not that my hardware is too out of date! Edit: more info: It seems to be a problem with one particular VM, which has pass through of GPU and USB controller. When starting this VM it dies. Here are the lines recorded in the syslog when I started the VM and the system froze. Jun 25 14:21:40 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered blocking state Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered disabled state Jun 25 14:21:40 Tower kernel: device vnet0 entered promiscuous mode Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered blocking state Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered forwarding state Jun 25 14:21:42 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jun 25 14:21:42 Tower avahi-daemon[7190]: Joining mDNS multicast group on interface vnet0.IPv6 with address fe80::fc54:ff:feb3:33ee. Jun 25 14:21:42 Tower avahi-daemon[7190]: New relevant interface vnet0.IPv6 for mDNS. Jun 25 14:21:42 Tower avahi-daemon[7190]: Registering new address record for fe80::fc54:ff:feb3:33ee on vnet0.*. Jun 25 14:21:42 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jun 25 14:21:42 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jun 25 14:21:45 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 14:21:47 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jun 25 14:21:47 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set tower-diagnostics-20190625-1740.zip
  7. Thanks, figure it out. Had to manually create the "docker" sub folder on the cache drive. The tab shows up now, so I assume I'm good to go. Thanks again
  8. Thanks for the reply. Yes, I do have a cache drive. Should I see a docker tab in the ui?
  9. Has something changed since when the guide was written? i don't have a main docker tab, only a docker option under settings. I change enable to yes, and hit apply, but nothing seems to happen.
  10. Update: After the last rebuild, the drive *seems* to be stable - hasn't dropped out yet. These were the changes I made: -Put the tunables back to default -Added acpi=off libata.force=noncq to syslinux.cfg
  11. Uraid 4.7 I have a segate BARRACUDA GREEN Model Number: ST2000DL003 drive that has started acting up. It is my parity drive. What happens is that it just dissapears from the system, parity is invalidated. I can restart the system (hard turn off power) and it returns. In the smart test everything appears normal. Also, I've tried different power and sata cables and sata ports. The last time this happened I restarted, and did a parity rebuild. It finished fine, but shortly after the drive dropped off. Here is a section of the system log after the partiy rebuild finished. Any Ideas? Dec 15 15:08:19 Tower kernel: md: sync done. time=46700sec rate=41831K/sec Dec 15 15:08:19 Tower kernel: md: recovery thread sync completion status: 0 Dec 15 15:23:34 Tower in.telnetd[14529]: connect from 192.168.1.2 (192.168.1.2) Dec 15 15:23:40 Tower login[14530]: ROOT LOGIN on `pts/1' from `DD-WRT' Dec 15 15:26:29 Tower kernel: ata6.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen Dec 15 15:26:29 Tower kernel: ata6.00: failed command: SMART Dec 15 15:26:29 Tower kernel: ata6.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 Dec 15 15:26:29 Tower kernel: res 40/00:00:46:47:00/00:00:00:00:00/e0 Emask 0x4 (timeout) Dec 15 15:26:29 Tower kernel: ata6.00: status: { DRDY } Dec 15 15:26:29 Tower kernel: ata6: hard resetting link Dec 15 15:26:35 Tower kernel: ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Dec 15 15:26:35 Tower kernel: ata6.00: link online but device misclassifed Dec 15 15:26:40 Tower kernel: ata6.00: qc timeout (cmd 0xec) Dec 15 15:26:40 Tower kernel: ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4) Dec 15 15:26:40 Tower kernel: ata6.00: revalidation failed (errno=-5) Dec 15 15:26:40 Tower kernel: ata6: hard resetting link Dec 15 15:26:45 Tower kernel: ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Dec 15 15:26:45 Tower kernel: ata6.00: link online but device misclassifed Dec 15 15:26:47 Tower cache_dirs: ============================================== Dec 15 15:26:47 Tower cache_dirs: command-args= Dec 15 15:26:47 Tower cache_dirs: vfs_cache_pressure=10 Dec 15 15:26:47 Tower cache_dirs: max_seconds=10, min_seconds=1 Dec 15 15:26:47 Tower cache_dirs: max_depth=9999 Dec 15 15:26:47 Tower cache_dirs: command=find -noleaf Dec 15 15:26:47 Tower cache_dirs: version=1.6.4 Dec 15 15:26:47 Tower cache_dirs: ---------- caching directories --------------- Dec 15 15:26:47 Tower cache_dirs: Backup-ReadOnly Dec 15 15:26:47 Tower cache_dirs: Backups Dec 15 15:26:47 Tower cache_dirs: Downloads Dec 15 15:26:47 Tower cache_dirs: Media Dec 15 15:26:47 Tower cache_dirs: Others Dec 15 15:26:47 Tower cache_dirs: Portable Dec 15 15:26:47 Tower cache_dirs: Sage Dec 15 15:26:47 Tower cache_dirs: ftp Dec 15 15:26:47 Tower cache_dirs: mysql Dec 15 15:26:47 Tower cache_dirs: sdf1 Dec 15 15:26:47 Tower cache_dirs: torrents Dec 15 15:26:47 Tower cache_dirs: ---------------------------------------------- Dec 15 15:26:47 Tower cache_dirs: cache_dirs process ID 14961 started, To terminate it, type: cache_dirs -q Dec 15 15:26:55 Tower kernel: ata6.00: qc timeout (cmd 0xec) Dec 15 15:26:55 Tower kernel: ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4) Dec 15 15:26:55 Tower kernel: ata6.00: revalidation failed (errno=-5) Dec 15 15:26:55 Tower kernel: ata6: limiting SATA link speed to 1.5 Gbps Dec 15 15:26:55 Tower kernel: ata6: hard resetting link Dec 15 15:27:01 Tower kernel: ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310) Dec 15 15:27:01 Tower kernel: ata6.00: link online but device misclassifed Dec 15 15:27:31 Tower kernel: ata6.00: qc timeout (cmd 0xec) Dec 15 15:27:31 Tower kernel: ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4) Dec 15 15:27:31 Tower kernel: ata6.00: revalidation failed (errno=-5) Dec 15 15:27:31 Tower kernel: ata6.00: disabled Dec 15 15:27:31 Tower kernel: ata6: exception Emask 0x40 SAct 0x0 SErr 0x800 action 0x6 frozen t4 Dec 15 15:27:31 Tower kernel: ata6: SError: { HostInt } Dec 15 15:27:31 Tower kernel: ata6: hard resetting link Dec 15 15:27:36 Tower kernel: ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310) Dec 15 15:27:36 Tower kernel: ata6.00: link online but device misclassifed Dec 15 15:27:36 Tower kernel: ata6: EH complete Dec 15 15:27:36 Tower kernel: sd 6:0:0:0: [sde] Unhandled error code Dec 15 15:27:36 Tower kernel: sd 6:0:0:0: [sde] Result: hostbyte=0x04 driverbyte=0x00 Dec 15 15:27:36 Tower kernel: sd 6:0:0:0: [sde] CDB: cdb[0]=0x2a: 2a 00 59 e2 33 6f 00 00 08 00 Dec 15 15:27:36 Tower kernel: end_request: I/O error, dev sde, sector 1507996527 Dec 15 15:27:36 Tower kernel: md: disk0 write error Dec 15 15:27:36 Tower kernel: handle_stripe write error: 1507996464/0, count: 1 Dec 15 15:27:36 Tower kernel: md: recovery thread woken up ... Dec 15 15:27:36 Tower kernel: md: recovery thread has nothing to resync
  12. Thanks for taking the time to comment. Will do. Now, buy a replacement drive, or move stuff off temporarily. Pondering...
  13. I have the same stituation/question -- And I guessing the same answer - RMA. I did a short test on the Drive. Here is the smartctl output after the short test. I'm running a long test now. Any comments appreciated. Thanks SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 194 194 051 Pre-fail Always - 12786 3 Spin_Up_Time 0x0027 186 163 021 Pre-fail Always - 5675 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 836 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 086 086 000 Old_age Always - 10705 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 21 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 12 193 Load_Cycle_Count 0x0032 171 171 000 Old_age Always - 88272 194 Temperature_Celsius 0x0022 115 107 000 Old_age Always - 35 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 197 000 Old_age Always - 180 198 Offline_Uncorrectable 0x0030 200 197 000 Old_age Offline - 14 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 195 000 Old_age Offline - 36 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 70% 10704 2928410663 SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  14. armbrust

    Tablet...

    2nd the transformer. Has an micro SD slot, plays most everything - use plex and a plex server and you can have access to all your online media. Much better deal then the ipad IMO.