Jackal

Members
  • Posts

    11
  • Joined

  • Last visited

Everything posted by Jackal

  1. Hi @browned, I have the same HBA card, HPE H240, on a Supermicro motherboard. Can you please tell me how did you find the relevant disk index number? I guess it is the "smPort1" variable in the smart-one.cfg script, correct?
  2. Hi @Daniel Ayers, Didi you intalled some how "ssacli" tools in Unraid? Could you please share how did you do that? Thanks in advance!
  3. I would like to store my cache drives (SSDs) into a case like the following ones, where you can store up to 6x drives in 5.25'' space. Any recommendations? 1. Icy Box Raidsonic: IB-2260SSK-12G [https://icybox.de/en/product.php?id=355] 2. Icy Dock ExpressCage: MB326SP-B [https://www.icydock.com/goods.php?id=231] I have seen a few users using the MB326SP, however I was leaning towards the Raidsonic with the metal enclosure. Please let me know if you have any ideas/prior experience e.t.c.
  4. Well, the following settings worked for me. I also discovered, this is what it is used by the 'preclear' plugin in order to extract attributes and information. I hope I am not doing something wrong here New Information Added, that cleared things up for me more. This is taken from the smartctl (8) - Linux Man Pages
  5. Hi all, I am reviving this thread because since a little while ago I got a 2nd hand H240 because I saw it was a plain HBA and should work out of the box. Today upon testing, I realized that there might be a problem with the HDD temperatures. They are not show up, as well as the SMART attributes. I saw that @ezhik made a post a couple of years back as well. Playing around I just stumbled upon the following parameters that one can setup optionally in Unraid and now the temperatures are shown up properly in the GUI. Is this all that one has to do? What happens if a disk changes a letter after a reboot (or after being failed from the array)? However I realized that SMART attributes do not appear. Is there a solution about that by any chance?
  6. Hi @Xaero . I really appreciate following up on my problem. Had it not been for this problem I would not have gathered all this information. So far all the DIMMs in pairs have been tested successfully, as well as 4 of them at the same time. Now, I am waiting for a test with all of 6 of them installed in slots except the "problematic" one. It may not be in the motherboards recommended mode of operation but what shall I do. Next test is to test once again the "problematic" slot and see if the error is reproduced. And then I am definitely following your advice and pop the CPU out for inspection/cleaning etc during the weekend. I have done that many times in the past before, but last time I did such a thing, I discovered 2 bent pins in the cpu socket and it took me a while to bring back the system in a working state. Not a very pleasant memory ;-). But will definitely do if necessary. Thankfully all this will end up pretty soon, as I want to move on, choose a case, and start putting the parts together ...!!!
  7. Now I have started testing the modules in pairs to see which one is the faulty one. So far 2/6 DIMMs passed the stress tests without producing any ECC errors. Now I am quite sceptical if there is something wrong with a specific RAM slot according to @Xaero. Let us see ... 4 more hours remain to see what is going on with these two DIMMs.
  8. Thanks a lot @Xaero. It seems that I have some testing in front of me. The thing is that I do not have a known good memory module that I can swap. I am testing these that the system came with to see which is bad and which is not. Now you are making me scary !!! I hope this is not going to be anything than a bad memory module...
  9. Hi all, I recently got my hands into a SuperMicro X10DAI with (2x) E5-2630v4 CPUs and (6x) 8GB RDIMMs of RAM which I will become a multipurpose server. So far so good What I found out today after testing the RAM, for a second time in a weeks-period, is that I get some ECC errors. Well, it is just "2" but shall I be worried / no-worried / semi-worried / forget and re-check it again after an X-amount of days/months? It seems that the system responds and corrects the errors but...? Furthermore, since I do not find any answer in the manual, which one is Channel 4? In other manuals I have seen detailed information regarding RAM population. In this one I can not seem to understand the "Fill First" Method. My Guess is that ------------------------- DIMM_00 ------- DIMM_02 Channel_00 --> P1-DIMMA1 & P1-DIMMA2 Channel_01 --> P1-DIMMB1 & P1-DIMMB2 Channel_02 --> P1-DIMMC1 & P1-DIMMC2 Channel_03 --> P1-DIMMD1 & P1-DIMMD2 Channel_00 --> P2-DIMME1 & P2-DIMME2 Channel_01 --> P2-DIMMF1 & P2-DIMMF2 Channel_02 --> P2-DIMMG1 & P2-DIMMG2 Channel_03 --> P2-DIMMH1 & P2-DIMMH2
  10. And now I am afraid that 1 of the 2 parity disks I have went offline Why This part was in Yellow !!! Jun 24 19:45:48 MonkeyIsland kernel: sd 7:0:3:0: [sde] Synchronize Cache(10) failed: Result: hostbyte=0x01 driverbyte=0x00 And this part was in RED !!! Jun 24 20:30:37 MonkeyIsland kernel: md: disk29 read error, sector=3907077192 Jun 24 20:30:37 MonkeyIsland kernel: md: disk29 write error, sector=3907077192 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 read error, sector=24 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 read error, sector=1501708064 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 read error, sector=1501708072 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 read error, sector=1501708080 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 read error, sector=1501708088 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 write error, sector=24 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 write error, sector=1501708064 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 write error, sector=1501708072 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 write error, sector=1501708080 Jun 24 20:31:07 MonkeyIsland kernel: md: disk29 write error, sector=1501708088 What is going on? I just created a folder ;-) !!! Nothing More !!! What shall I do now ? ? ?
  11. Hi all. As this is my 1st message in this forum, I am short of introducing myself directly with a question at hand !!! I am a new user of UNRAID and have built a small NAS for storage purposes. I converted my old PC, an i7-4770, and added 4 new HDDs (2 x Seagate IronWolf & 2x Toshiba N300) along with a couple of old HDDs and an SSD that I had laying around. The HDDs are attached to an LSI card 9211-8i which I converted to HBA with the newest FirmWare (this forum provided excellent help and I appreciate that !!! ) mpt2sas_cm0: LSISAS2008: FWVersion(20.00.07.00), ChipRevision(0x03), BiosVersion(07.39.02.00) However, yesterday I got informed that 2 of the brand new drives, 1 IronWolf & 1 Toshiba, have "UDMA CRC ERRORs". One has 5 while the other one has 72. I checked the logs and I did not see anything else suspicious. UNRAID does not reveal any Read/Write Errors. I do not like this at all. Especially for these 2 HDDs that are brand new. I am kindly asking for your ideas what things to check in order to minimize these CRC errors in the future. I understand it is not critical but it is not a good thing either. Below, are some more information. The LSI card was bought 2nd hand from e-bay. The seller had very good reputation and insured me that the card was bought locally from a store. The cables that were included were brand new in a sealed box During the firmware update with the LSI official FirmWare, all went smoothly I even put a fan on top of the card as seen here (https://www.thingiverse.com/thing:4171229) My PSU is 550W (which I do not suspect that this could be the problem) I would be more suspicious that all the HDDs are in a common cable and not in two different lines like the PSUs that can be found in Workstation PCs I pre-cleared all the HDDs without any problems before adding them in the array. Your ideas are more than welcome!!! Thanks. monkeyisland-diagnostics-20210624-1907.zip