RysXr200

Members
  • Posts

    17
  • Joined

  • Last visited

RysXr200's Achievements

Noob

Noob (1/14)

0

Reputation

1

Community Answers

  1. It's been over a week and I haven't had a recurrence of the issue. Hopefully that remains the case. What I did in addition to adding a better thermal paste to the heat sink was bend the bracket on the card to hold it in the slot better.
  2. Actually I am usually not stressing the board when this happens. Is it possible its just a bad board?
  3. Well it happened again. This time I removed the heat sink and tried to remove as much of that cement like thermal paste as possible and added some artic silver 5 to the chip before reinstalling the heatsink. Hopefully that is the issue. So we will see if that does the trick if not might have to add a fan to it or its just eol and I need to return it. Is there any way to monitor the temperature of the card from unraid?
  4. Funny you should mention that. And I am not quite ready to say definitively that this was the issue but the screw hole in my case and on the card dont line up really well and when forcing bracket over to line up with the hole in the case I believe that it pulled part of the card out of the PCIe slot and over a short amount time stopped making contact with the slot. So right now i just have it sitting in there unsecured but I should probably look at modifying the bracket so I can secure it incase the tension on the slot isnt enough to keep it from coming out.
  5. Its almost like the HBA card drops out or something because no disk that is running on that card shows up in the Diagnostics and when i stop the array they are no where to be found.
  6. I have been plagued by repeated disk errors for quite a while but lately it has gotten quite bad. Unraid has disabled disks because of this forcing me to do a rebuild to reenable them. I have replaced my case to connect directly to the drives I bought a LSI 9207-8i HBA with the latest IT firmware and I have replaced the SAS cables but nothing has seemed to work. This last time Unraid reported over 1000 errors on each on my array disks with the exception of the one that it disabled that one it only reported 3. Can anyone look at the diagnostic data and give me some insight on where to go from here because i am at a loss. Thanks, server-diagnostics-20230122-1102.zip
  7. One other question. If I disable IOMMU will I still be able to pass my z-wave usb dongle through to my home assistant vm?
  8. Yeah I didn't originally include the diagnostic because the log file wouldn't have gone beyond my last reboot. Ah, I have been debating whether I should look into getting a LSI card in it mode. I'll try disabling IOMMU to see if that helps.
  9. server-diagnostics-20230111-1129.zip
  10. This might be a whole thing, but I have been dealing with quite a few CRC errors on my disks. I have changed the cables power supply and even the case to do away with the back plane and connected my SAS to SATA card directly to the drives. Recently I have been rearranging my shares which means I have been moving a lot of data within the array. Several times during this process the webgui becomes unresponsive and slowly over the course of several hours I start losing docker containers and the ability to connect using SSH. Before I lose the ability to connect via SSH, powerdown doesn’t work and downloading the diagnostics just hangs up. Eventually I must force an ungraceful shutdown. Fortunately, or unfortunately, this hasn’t happened since I started keeping persistent logs. So I don’t have any from when that happened and I am not sure if this is related to the disk issues or not. I have noticed several disk errors on one particular drive located at ata14.00. Last night that drive disconnected from the array and I was only able to get unraid to recognize it again once I unplugged it and plugged it back in so it could have been a loose connection. During this time I rearranged several of the sata connectors and I am still getting the same errors this time on ata15.00 so I assume it is drive related. My array consists of mostly WD red 4 and 8 tb drives but the vast majority of errors are isolated to three 8tb white label EDAZ drives that I shucked from easystore or element enclosures. Its hard for me to imagine that all 3 of these drives are bad but that seems to be the common dominator so any input would be appreciated. I have attached the server logs that I have as well as the SMART data from the 3 drives. Any input would be appreciated. Thanks, server-smart-20230111-1014.zip server-smart-20230111-1013.zip server-smart-20230111-1015.zip syslog-10.20.1.2.log
  11. Disregard. Didn't realize you still have to set it up as a remote syslog server when you enabled the local server. Thanks i will check the logs next time this happens.
  12. Yeah I thought that might be the case. I am no expert on reading log files but they appeared to not go back too far. I do have a question though on setting up a syslog server. I enabled local syslog server and pointed it to a new share that I created but it doesn't appear to be collecting any logs. The share is completely empty. My understanding was that the syslogs were supposed to continuously write to the share.
  13. I have had repeated issues with the Webui failing. It typically happens when I am moving a bunch of data between disks. After it happens I can SSH into the server but it wont power down with the command and even the diagnostic command hangs. Eventually I lose access via SSH and if I leave it long enough I start losing access to my dockers and VM's. The only way to get everything back up and running is to physically force a shutdown or restart. I am hoping someone can review the diagnostics and shine some light on what the issue is. Thanks, server-diagnostics-20230107-2327.zip
  14. Turns out the CD Drive in the template got changed from SATA to IDE and not it seems to work find. I think it woulnt connect to the VNC because my server was being taxed pretty heaviliy with a parity check and the mover running.
  15. So I disabled the VM settings in order to delete the libvirt.img and once I restarted it libvirt was recreated. I have done that a few times. VNC now connects but the attached is what it shows.