kjarri

Members
  • Posts

    20
  • Joined

  • Last visited

Posts posted by kjarri

  1. 1 hour ago, JorgeB said:

    Docker image is corrupt, apparently because it ran out of space, delete and re-create.

    Hey JorgeB, 

    Yes, that seems to be because the Log partition is full. 

    image.png.be840364e3575d29ea435f8a049a1b8f.png

     

    And it seems to be that the docker logs are filling up the partition:

    Quote

    root@Heimanas:~# du -sm /var/log/*
    0       /var/log/btmp
    0       /var/log/cron
    0       /var/log/debug
    1       /var/log/dmesg
    128     /var/log/docker.log
    0       /var/log/faillog
    0       /var/log/lastlog
    0       /var/log/libvirt
    0       /var/log/maillog
    0       /var/log/messages
    0       /var/log/nfsd
    0       /var/log/nginx
    0       /var/log/packages
    0       /var/log/pkgtools
    0       /var/log/plugins
    0       /var/log/pwfail
    0       /var/log/removed_packages
    0       /var/log/removed_scripts
    1       /var/log/samba
    0       /var/log/scripts
    0       /var/log/secure
    0       /var/log/setup
    0       /var/log/spooler
    0       /var/log/swtpm
    1       /var/log/syslog
    0       /var/log/vfio-pci
    1       /var/log/wtmp

     

    Any way I can find out why?

  2. Hello, 

     

    I seem to be having an issue with my system. 

    I've been having some problems and weird behaviour with some of my docker container but now I need some help. 

     

    It seems like at the moment the log is compleatly filled up, only a little after 24 hours after restarting the system. 

    I don't want to restart the system right away because I want to find out what is causing the fill up, if I have some misconfigured container or something like that. 

     

    Also, I have an error on my docker page: 

    Quote

    failed to open stream: Read-only file system in /usr/local/emhttp/plugins/dynamix.docker.manager/include/DockerClient.php on line 90

    I tried looking for answers about this and one thread talked about ata errors causing the docker file to be corrupted. 

    I had that issue two weeks ago so I was wondering if I had to rebuild my docker file or if the issue is related to the log issue. 

     

    I have attached the diagnostic file here so hopefully that helps finding the issue.

     

    Here is the post with the ata errors and the diagnostic files 

     

    Thanks.

     

    heimanas-diagnostics-20220812-1126.zip

  3. 1 hour ago, JorgeB said:

    I would run memtest for a couple of hours at least.

    Okay, I have run memtest for around 1 hour now and the first pass completed without any error. 

     

    I'll keep on going maybe one more pass but is seems like the memory is not an issue at this point. 

     

    If no errors are detected by memtest, should I try a correcting parity check and then another parity check to see if the errors are gone?

  4. 3 minutes ago, trurl said:
    Aug  1 14:18:05 Heimanas kernel: ata1.00: failed command: WRITE FPDMA QUEUED
    Aug  1 14:18:05 Heimanas kernel: ata1.00: cmd 61/c0:f0:e0:9e:1d/02:00:be:00:00/40 tag 30 ncq dma 360448 out
    Aug  1 14:18:05 Heimanas kernel:         res 40/00:00:a8:df:4a/00:00:be:01:00/40 Emask 0x50 (ATA bus error)
    Aug  1 14:18:05 Heimanas kernel: ata1.00: status: { DRDY }
    Aug  1 14:18:05 Heimanas kernel: ata1: hard resetting link
    
    Aug  1 15:14:42 Heimanas kernel: md: recovery thread: P incorrect, sector=17722576
    Aug  1 15:14:42 Heimanas kernel: md: recovery thread: P incorrect, sector=17722888
    Aug  1 15:14:42 Heimanas kernel: md: recovery thread: P incorrect, sector=17722896
    Aug  1 15:14:42 Heimanas kernel: md: recovery thread: P incorrect, sector=17722904
    Aug  1 15:14:42 Heimanas kernel: md: recovery thread: P incorrect, sector=17722912
    

    All I see in these latest are the parity errors, no reason to expect them to go away until corrected.

    Ahh okay, 

    I'll let the non correcting parity check run for a little bit to see if the ATA error comes up again. 

     

    However, is it not concerning that the incorrect sectors now are not the same sectors that where faulting and corrected before the reboot?

     

    Or fx. sector 0 was incorrect before the reboot and after the reboot, even though that sector should have been corrected?

     

    5 minutes ago, trurl said:

    USB NOT recommended for array and pool disks for many reasons

     

    Yes, I got that solution before knowing more of the downfalls of this approach. Am working on increasing the capacity of the system to decommission the drives and the USB enclosure completely. 

  5. Hello, 

     

    I need some help with my server. I got some parity errors two weeks ago after an unclean shutdown. 

    After reading many posts here I assumed everything should be fine and I just corrected the parity and did the parity check again and got no errors so I assumed everything was okay. 

     

    Now during my monthly parity check I am getting more errors, I am up to 11.119 sync errors corrected so far. 

    Now I am suspecting that something is wrong that needs investigating and was wondering if you could help me. 

     

    What I suspect is that one of my 4 2TB drives in an external USB hard drive enclosure is failing. 

    I am unable to run smart tests on them but I suspect they are failing since they are very old and I have used them for a long time. 

    I have a hard drive ready to replace any of them if they are failing but for that I would need to know witch drive is the faulting one. 

     

    However I have also found some errors in the logs that could be indicating the error but I can't understand what they are telling me:

    Aug 1 14:13:35 Heimanas kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT0._GTF.DSSP], AE_NOT_FOUND (20200925/psargs-330)

    Aug 1 14:13:35 Heimanas kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT0._GTF due to previous error (AE_NOT_FOUND) (20200925/psparse-529)

    Aug 1 14:13:35 Heimanas kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT0._GTF.DSSP], AE_NOT_FOUND (20200925/psargs-330)

    Aug 1 14:13:35 Heimanas kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT0._GTF due to previous error (AE_NOT_FOUND) (20200925/psparse-529)

    Aug 1 14:18:05 Heimanas kernel: ata1.00: limiting speed to UDMA/33:PIO4

    Aug 1 14:18:05 Heimanas kernel: ata1.00: exception Emask 0x50 SAct 0x641000c2 SErr 0x4090800 action 0xe frozen

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: READ FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: READ FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: READ FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: READ FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: READ FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: WRITE FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1.00: failed command: WRITE FPDMA QUEUED

    Aug 1 14:18:05 Heimanas kernel: ata1: hard resetting link

    Aug 1 14:18:15 Heimanas kernel: ata1: COMRESET failed (errno=-16)

    Aug 1 14:18:15 Heimanas kernel: ata1: hard resetting link

     

    I have attached the diagnosis file below, hopefully that helps finding out the issue. 

     

    Thanks for the help in advance. 

    heimanas-diagnostics-20220801-1418.zip

  6. Okay sorry for the misunderstanding. I stopped the Read check, switched out the old parity with the new parity and started again in maintenance mode. And started the parity sync. The hard drive is still active and operating normally for now. Lets see if the parity sync is able to finish. 

  7. Okay, I tried to start it with the old parity drive but it warned me that it would wipe all data from it so I decided not to add it to the array for now, just started it in maintenance mode without parity. It seems that the drive started up so I started a Read check instead. That will take about half a day so hopefully that works. If that goes well I'm going to add the old parity drive, do the parity check and if that works do the parity drive switch again. Hopefully that works. Thanks for the help. 

  8. So I was upgrading my parity drive to a bigger one, from 4TB to 8TB. Everything was working well on the array before the upgrade.

    I shut down the server, took out the parity drive and replaced it with the new one and started the server. 

    I chose the new parity drive for the server and started the rebuild. As soon as the rebuild started another HDD failed. 

     

    I tried switching the parity drive back but the server would not take it because it thinks it is a new drive. (see screenshot)

    Is there anyway to make the server forcefully accept the old parity drive so I can switch the failed drive with a new one to rebuild it?

     

     

    Too many wrong or missing disks.png

  9. Hi all, 

     

    So my usb stick with the boot drive is dead. I have tried everything and it wont boot up. I don't have a Recent enough backup so I am not going to try that. Thats where a new drive comes in. I have flashed it and gotten the key file but I have read that you have to be careful selecting the correct Parity drive. I however am not sure witch one is my parity drive and was hoping for some help.

     

    Luckily I have a recent enough Diagnostics report for an unrelated issue and was hoping that somewhere there I could tie the parity drive to the serial number but I don't see where that is. Could someone help me identify my parity drive through this Diagnostics report?

     

    Thanks in advance

    heimanas-diagnostics-20191217-2329.zip

  10. Hey, I'm wondering if anyone else is having an issue with requested movies and tv shows are not shown as available under the requested tab. 

    I found many people having the same issue like this one: https://github.com/tidusjar/Ombi/issues/2626 

    In the end they all end up with updating the locale with this command sudo localectl set-locale LANG=en_US.UTF-8 exept my locale is already registered in the docker terminal: 

    ~# locale
    LANG=en_US.UTF-8
    LC_CTYPE="en_US.UTF-8"
    LC_NUMERIC="en_US.UTF-8"
    LC_TIME="en_US.UTF-8"
    LC_COLLATE=C
    LC_MONETARY="en_US.UTF-8"
    LC_MESSAGES="en_US.UTF-8"
    LC_PAPER="en_US.UTF-8"
    LC_NAME="en_US.UTF-8"
    LC_ADDRESS="en_US.UTF-8"
    LC_TELEPHONE="en_US.UTF-8"
    LC_MEASUREMENT="en_US.UTF-8"
    LC_IDENTIFICATION="en_US.UTF-8"
    LC_ALL=

     

    So I tried changing it to anything else with this command: LANG=en_IN.UTF-8 

    Then restarted the docker container. 

    That did not work. 

    So I tried updating the locale of my UnRAID mashine. 

    That also did not work. 

    So now I am at a loss because all articles and problems either lead to this fix. 

    Has anyone experienced this or has an Idea about a fix? 

    Running Unraid 6.7.1 and Ombi Latest

     

     

  11. Hello

     

    I am pretty new to unraid but I like to fool around with it, see what I can do. I have had no big issues, only a couple of minor bugs here and there but I was able to solve them one by one.

     

    The story starts today when I added a hard drive to my system. About an hour after I get a notification about a overheating problem on my hard drives. I thought it was because of the clearing process so I did not panic. Then when I got back I decidied to check the logs. There where millions of failed login attempts from ip adresses I did not know like:

     

    Jan  6 19:50:38 heimanas sshd[4431]: Failed password for root from 116.31.116.41 port 52195 ssh2

    Jan  6 19:50:38 heimanas in.telnetd[4501]: connect from 39.36.242.27 (39.36.242.27)

    Jan  6 19:50:38 heimanas sshd[4431]: Received disconnect from 116.31.116.41 port 52195:11:  [preauth]

    Jan  6 19:50:38 heimanas sshd[4431]: Disconnected from 116.31.116.41 port 52195 [preauth]

     

    And also from a website that seemed to be comcast:

     

    Jan  6 19:52:23 heimanas login[5940]: invalid password for 'UNKNOWN'  on '/dev/pts/3' from '173-10-58-34-Michigan.hfc.comcastbusiness.net

     

    So I blocked every port forward or anything simmular and I have had no incidents so far. I was wondering if there is anything I should do, report the IP adresses to some blacklist database or something or what to do to prevent this. I'll attach a part of the syslog, it was too large to upload.

    Thank you

    syslog.txt

  12. Hey

     

    My unraid server all of a sudden wont work through my web browser. Neither can I connect to it via Plex.

     

    Everything else looks  fine, I can putty in to it, I can browse all the files on all of the devices on the network and with no issues. I really need plex so if anyone has a suggestion I am up for it. I'll attach a link to the syslog from when I started it and when I wrote this.

     

    Thank you

    https://drive.google.com/file/d/0B7-BriBrM70zbWhNU3lGTkZRdDg/view?usp=sharing