Jump to content

RobJ

Members
  • Posts

    7,135
  • Joined

  • Last visited

  • Days Won

    4

Posts posted by RobJ

  1. 3 hours ago, mtruffa said:

    I started my conversion of 16 drives to XFS and when I got to disk4 I got the following error

     

    REISERFS warning (device md4): jdm-20001 reiserfs_xattr_get: Invalid magic for xattr (user.org.netatalk.supports-eas.axBWPi) associated with [2 4 0x0 SD]

     

    This looks like it might be an old issue returned, involving extended attributes and AFP.  I'm guessing you have accessed this drive over AFP at some point?  Read the discussions and fixes found in these threads:

     

       A "mover" issue?  (Mover uses rsync too)

       Running mover does not successfully move items

       Extended Attributes Fix

  2. The reason I asked for diagnostics was there were so many things wrong with the file system structures, that I was afraid that either he had tried to repair the drive instead of the partition (sde instead of sde1 or md4), or it wasn't an XFS drive, but a drive with a similar file system, enough to find file system pieces and try to construct an XFS file system.

     

    I'm afraid the damage is so severe that it's too much for the current xfs_repair, and it's crashing.  I really hope you have a backup for the data.

     

    I'm a bit intrigued by your statement of a 19 day old parity copy, but didn't understand what you mean by that.  Is that a *separate* disk that you can put back in?  And was Disk 4 fine at that time?  What exactly do you have from 19 days ago?

  3. unRAID is an all RAM based OS, built from scratch each time it boots, so the unRAID boot flash drive provides both the copies of the distro to be expanded into RAM, plus the 'persistent storage' for all configuration and logging, in /boot/config and /boot/logs.  Other user scripts and programs are generally also stored on the flash, although could be on a Cache or array drive if not needed until after array start.

     

    Probably best to move it back to the flash drive:

       mv /dskt /boot

     

    Then to run it, you would either run:

       /boot/dskt ...

    Or:

       cd /boot

       dskt ...

  4. Sorry, I didn't see that 68 or the "in the past".  Good catch, Brian.

     

    194 can be taken literally, 190 is "one hundred minus", so the 076 means it's currently 24C, the 045 means a max of 55C, and the 032 means it once reached 68C, well over 55C.

     

    I would Preclear it 3 times at least, and see if it holds up.  If it does, then it survived the over-temp.  I'd still monitor it for awhile.

  5. 12 hours ago, CarloC said:

    I, too, thank everyone who took the time to read and respond to my questions and requests for help because, it turns out that, no more than a few hours after you asked a question about doing a disk conversion from RFS to XFS, I asked a very similar one. I also used Unbalance to empty a disk and then re-Format the empty disk. I then used Unbalance to move the next set of files and so on. I am doing my third disk now.

    For me, it also appears to have worked seamlessly and with my array serving files the whole time except for just a few minutes. I am using Unraid 6.3.2

     

    But this was after I did follow the file conversion wiki to do my first conversion.

     

    To those who will know, I would ask why the wiki outlines, what seems to be, such a convoluted system of swapping drive assignments and new configurations while over-riding un-needed parity re-builds. At the same time, still very strongly cautioning about background processes that might be changing the files. It's fairly nerve-wracking for a newbie. 19 Steps is a daunting total.

     

    Can I assume it is because my array is very simple? It is just a home media file server. One user share and no disk shares. No dockers or any other programs running. No cache drives and just one parity disk. In essence, I don't care where the actual files are located, I just need Unraid to keep track of the 1 user share.

     

    If I had a more complex system, could I have brought the whole thing down by doing what I've done? Is that why keeping the files in the same disk location so important?

     

    I'll start by giving some background on the current wiki method, and then touch on a few things you may not have thought about.  I had noticed already several users using unBALANCE, and had been thinking about adding a method based on it, but I won't be able to completely flesh out the steps as I've never used unBALANCE, and can't because I don't use User Shares (which of course partly explains my own methodology).  I've always preferred knowing and controlling exactly where everything is, and I've a personal catalog file I keep open in an editor tab with all of my series and categories listed, with the drives they're on.  For example, I know that my Alias shows are on Disk 1, and all my PBS stuff is on Disk 4.  User Shares are convenient for many, but for me it just adds another layer to the access.  Off-topic though ...  But converting drives is a drive issue, only indirectly a User Share issue.  It's drives that are formatted, and that dominates the problem.


    I don't know for sure, but I suspect if you wrote out the steps for doing the entire array with unBALANCE, it would be close to 19, or maybe even more.  A general method for users at large has to handle every contingency and every situational variation, as safely as possible for the least technical of users.  That always takes longer, and often more steps are involved, 'just in case'.  It inevitably means lots more words!

     

    I do think my method is the most straightforward, safest, and fastest though, but then disk shares is what I work with and understand.  It does have an extra step or two to accommodate User Shares.  For User Share users, which is most or almost all unRAID users, unBALANCE sounds attractive, and I agree that it's a great plugin!  It uses basically the same rsync command, so should have the same speed and file verification checksumming for the individual transfers.  But in operation over multiple drives, it's likely to take longer, because it's moving the files off one drive to the others, then moving the files back when you do the next drive, and possibly moving some files multiple times, pushing them around the various drives to keep them out of the way of the next format operation.  With my method, files only move once.  The only way to make sure no files get moved more than once with unBALANCE is to carefully manage exactly where each file starts and where it ends up, and that sounds like something you wanted to avoid, if you don't want to care where the files are.

     

    To me, my method seems just as seamless, and allows full operation just as much.  Both ways, you have to stop the array, make changes, then restart the array, just as many times.  When I did it with 6.1, I didn't have to use New Config, just adjust the assignments and restart the array.  Unfortunately in 6.2, something changed and it won't allow that, so we had to add the New Config step, which I agree does add a little extra hassle.  6.0 and 6.1 users don't have to do that.  We have requested that to be corrected, so we can just restart the array, but as far as I know, it hasn't happened yet (haven't checked it recently though).

     

    The strong cautions though are just as important, for either method!  How do you know for sure that there were not files lost when you did it?  Because if there were any other processes (internal plugins, Dockers, scheduled scripts, or VM's, or externally scheduled backups) writing files to the array, they could very well have added files to the very drive you were emptying with unBALANCE.  In fact, the emptying of a shared drive makes it the most attractive target for writing new files to.  Either you make absolutely sure that nothing else could be writing to the array, or you should run unBALANCE a final time, just to make sure that nothing is left on the drive (written to it after unBALANCE had swept across it), before you format it.  Does unBALANCE after moving folders come back and check if they have reappeared on that drive, with more files in them?

     

    So I'll add a general unBALANCE-based method, and I think many users may prefer it, as it does hide some of the detail, but it will take longer to perform.  And they will have to manually configure unBALANCE for each drive emptying.  The steps will have to have just as many warnings.  Users who aren't running anything else at all will obviously have a simpler time, can ignore the warnings, but that's a special case.

     

    I really don't want any of this to be daunting or nerve-wracking, but each time we hear of a user doing something we hadn't thought of and getting into trouble, the word count increases.  The more words, the more daunting it appears ...  (just look at this post for instance!)

    • Upvote 2
  6. 49 minutes ago, Zippi said:
    Practically  yellow warning of "hard resetting link" on all the ports but no red errors and the speed is OK not degraded.
    Is it normal or is an issue?

     

    In this case, the yellow is a false positive, nothing wrong, although it's a little odd.  Some devices need a kick to initialize them, a hard reset to start them up.

     

    You won't see any speed degradation with only one drive attached.

  7. No, you can run without parity fine.  You've obviously learned the most important lesson, the importance of backups!

     

    What parity gives you is additional fault tolerance for this particular copy of your data.  If a drive fails, you replace it and using parity rebuild it exactly as it was.  That adds convenience when the next drive fails, and of course they all do sooner or later.  But so long as you have good and up to date backups, you can live without the convenience of parity assisted recovery.

     

    And of course, without a parity drive you don't need a cache drive.  There are other uses for it, but that's of much less importance.

  8. 3 hours ago, Pauven said:
    13 hours ago, ufopinball said:

     

    Sooooo, it seems like you are primarily seeing errors in MemTest and unRAID.

     

    Not to throw more fuel on the fire, but I noticed your screen shot showed "MemTest86 5.01"?  My screen is showing "PassMark MemTest86 V7.3".

     

    I don't remember all the details on the various different MemTest versions.  Might be worth checking both in Google to see which is latest, and if the PassMark version is better suited to your configuration?  Especially if Windows 10 and open SUSE are both happy on this RAM?

     

    5.01 is the Memtest86 that comes with the unRAID distro.  I don't think it would make much difference running the Windows flavor.

     

    Just to clarify, both Memtests are based on the same code, and neither are Linux or Windows programs, probably pure assembly based (maybe something added for the PassMark GUI).  If either one detects errors, you can trust there are problems.  The PassMark one is more advanced, supports the latest memory tech, and is probably the one you should use.  For more detail on both, including their history and differences, see my (unfinished) FAQ entry:  How do I test my RAM?

     

    The general rule is that if you can see any errors at all, there's a bad RAM stick, don't use it.  But I think it's possible to see RAM errors if it's configured wrong, wrong timings or wrong voltages.  Not an expert opinion though ...

     

    Edit:  Hah!  I see ufopinball just beat me!

  9. There's nothing important that changed, so yes you can just live with it.

     

    You clearly changed something in the BIOS, because its memory layout changed, and several other small things changed too, like the CPU being slightly faster, but not enough to be detectable!  Two changes did NOT happen that we wanted, but may not be important.  The 2 phantom ports are still there, and in IDE emulation mode, but who cares as you can't use them anyway.  The Parity2 drive still appears to have a loose connection on boot, with exactly the same errors as last time, so nothing you did changed that.  The only other possibility is a power draw issue to the drive - you can check that by spinning all the drives down, waiting a few seconds, then spinning them all up at the same time (the Spin Up button), and checking the syslog for the same error messages.  If none, guess you'll have to live with it.

     

    As to the BIOS betas, all new software and all upgrades of any software are beta until they have been tested on your own machine.  If there aren't adverse reports about them, and you have used your motherboard tools to backup your current BIOS, then it should be safe to try one of them.

  10. 28 minutes ago, dukiethecorgi said:

    Is there any progress to finding the root cause of this problem?  The forum has quite a few posts by people that are seeing call traces and unresponsive GUI, surely I'm not the only one seeing this problem.  Could I roll back to an earlier version that doesn't have this issue?

     

    You can roll back to 6.2.4, from the LimeTech download page, just replace the bz* files and reboot.  However, users are reporting the the Docker support in 6.3 is not backward compatible with 6.2, so you will have to redo the Docker setup.  I imagine you won't have to redo the Dockers themselves, just the main docker config and docker.img.  It will be interesting to see if an earlier kernel works better for you.

  11. 4 minutes ago, BradJ said:

    I'll look into the BIOS, I think I am one short of the latest.  I'll install it and see if anything changes.

     

    Post a syslog after you install it, and I'll compare the 2 for changes.

     

    On the 6th SATA port I have a cable set up but no hard drive attached.  That's where that error is coming from.

     

    I'm a little hesitant to agree.  A dead unconnected cable should not be visible.

  12. 10 hours ago, CarloC said:

    I was already a little sensitized to this issue because of an issue I am/was having with preclearing some drives.

    In this post https://forums.lime-technology.com/topic/12391-re-preclear_disksh-a-new-utility-to-burn-in-and-pre-clear-disks-for-quick-add/?do=findComment&comment=506407 some corrections to the preclear_disk.sh file are presented.

    I really struggled with getting the second more complex correction done and had to give up.

    I will write a post to this topic talking about all the issues I have recently had with a preclear that would not go correctly.

     

    Commands like that aren't made to be typed by hand!   ;)

     

    I recommend using PuTTY, which does support copy and paste, although in an awkward way.  If nothing else, you can copy the command into a file, copy the file to the boot drive, then run the file from a terminal session.

     

    10 hours ago, CarloC said:

    The other information you shared has made me think of something. I'm not sure if this is the right place to bring this up.

    It would seem to me that it is possible to have a second set of drives which are not a part of the user shares and could then act as backups to the drives in the array. The drives in the array could be public for use by anyone on the network, and the others could be secure and controlled by an administrator. Although not a true, offsite independent backup, they would be a set of redundantly protected backup disks holding copies of files.

    Is this actually acheivable?

    Does anyone do this?

     

    Well yes they do, some using the Unassigned Devices plugin to add and control extra drives.  But a true backup should always be on a different machine, so most use other means for backups, like a second server, or cloud backups, etc.

  13. OK, that could explain it.  But that indicates poor device management in the BIOS, which should KNOW those ports aren't there, aren't available.  That's more impetus for finding a newer BIOS.

     

    By the way, you said you saw 6 SATA ports.  The syslog however indicates the 6th one may not be working.  The kernel designated it as a DUMMY, which generally means there's a port there, but the BIOS says it's not operational (which is what it *should* have said about those other 2 ports!).  More reason for an updated BIOS!

  14. Recently, there have been a few questions posted and topics started, application related, but put into the old Applications board of the Legacy section.  Topics there are very old, from long ago versions of unRAID.  Yet the questions do seem appropriate for a general Applications board, and are generally cross platform, not specific to Dockers or plugins or VM's, and may involve combinations of those app platforms.  Here's an example:  DLNA Upnp with Squeezebox LMS not working.

     

    I could not think where best to move this.  I'm wondering if we should re-activate this old board, or better, create a new Applications board in the Applications Support section?  A general Apps board would be more consistent with our move to the idea that apps are apps, whatever the platform.  But we don't really want to create more boards...

     

    If nothing else, we should lock the old Applications board, move it to the Deprecated section. 

  15. As to the ACPI issues, I don't see anything very unusual, but since your motherboard BIOS is from 2013, I'd recommend checking for a newer one, *might* help.

     

    The second parity drive appears to have a loose connection, might check all of its connectors, both ends of any connected cables, power and data.

     

    The kernel thinks it sees something on the last 2 SATA ports, but gives up trying to identify them.  You have IDE emulation turned on for your onboard SATA drives.  When you next boot, go into the BIOS settings and look for the SATA mode, and change it to a native SATA mode, preferably AHCI if available, anything but IDE emulation mode.  It should be slightly faster, and a little safer, if you do connect anything to them.

     

    Once those last 2 ports are using the native SATA AHCI option, then we should be able to see what's there, if anything.

  16. In this case, it's not really a matter of how long it has, but that it's *already* useless as a parity drive, so you might as well remove it now.  I don't see any indicators of mechanical issues, so if this drive had had files and folders, you would probably have had plenty of time to copy them off.  But it doesn't!  Parity drives don't have a single bit of data on them so there's nothing to save!  What it does have is lots of bad sectors, obvious media surface damage, at such a young age, and therefore the drive cannot be trusted.

     

    I'd pull it now, and run without parity until you get a replacement for it.  I can't think of any advantage to keep using it, and it's slowing down all writes, for nothing.  What danioj said is exactly correct.

×
×
  • Create New...