Jump to content

wallacearizona

Members
  • Posts

    3
  • Joined

  • Last visited

Posts posted by wallacearizona

  1. 20 minutes ago, DeathStar Darth said:

    Having just installed my first LSI 9201-16e and seen this in my logs, I went digging.

     

    It looks like this is coming directly from the driver

    /**
    
    * _scsih_temp_threshold_events - display temperature threshold exceeded events
    
    * @ioc: per adapter object
    
    * @event_data: the temp threshold event data
    
    * Context: interrupt time.
    
    *
    
    * Return nothing.
    
    */
    
    static void
    
    _scsih_temp_threshold_events(struct MPT2SAS_ADAPTER *ioc,
    
    Mpi2EventDataTemperature_t *event_data)
    
    {
    
    if (ioc->temp_sensors_count >= event_data->SensorNum) {
    
    printk(MPT2SAS_ERR_FMT "Temperature Threshold flags %s%s%s%s"
    
    "exceeded for Sensor: %d !!!\n", ioc->name,
    
    ((event_data->Status & 0x1) == 1) ? "0 " : " ",
    
    ((event_data->Status & 0x2) == 2) ? "1 " : " ",
    
    ((event_data->Status & 0x4) == 4) ? "2 " : " ",
    
    ((event_data->Status & 0x8) == 8) ? "3 " : " ",
    
    event_data->SensorNum);
    
    printk(MPT2SAS_ERR_FMT "Current Temp In Celsius: %d\n",
    
    ioc->name, event_data->CurrentTemperature);
    
    }
    
    }

     

    I don't know about you, but I don't want to be compiling my own driver :)

    I will be ordering a Noctura (or two) to somehow strap to what appeared to be a really rather good heatsink - putting the case sides on sees the reported temps rapidly rise...suspect I am going to need to add another extractor to the case as well to cope with the extra heat the Noctura will (hopefully) suck away from the LSI.

    Yes, the only post I have seen about these temp messages is to remove the bios from the card but on my machine it will not initialize with Unraid without the bios present.  
     

    Yes this card runs extremely hot.  I created a custom vent shroud with 2 60mm fans removing heat away and still runs in the 60’s .

     

    If anyone with the log problem has a bit of coding experience could we parse the logs of these messages?

  2. On 8/12/2022 at 4:38 AM, Strider27 said:

    @SimonF Yes, I tried several, "Active Streams", "File Activity" and "Open Files". The first two showed nothing, "Open Files" showed a large list including docker etc. which live on the cache drive anyway, disabled Docker and VM manager. Was only getting "find" popping up every 10 secs or so, disabled "Dynamix Cache Directories", "find" stopped popping up, nothing else in the list, still same issue.

     

    @doron Removed both plugins as well as Dynamix System Stats and rebooted, same issue.

     

    Every time a drive tries to spin down, there is a tiny read from that drive that wakes it up, it is like something is monitoring the drives and spins them up if they try to spin down.

    The SATA drive is now fully provisioned and is in the same array, seems to spin down no problem (no files on it yet though)

     

    Selected New Config and removed the array, drives would not spin down when in unassigned devices list either.

     

    EDIT:

    installed iotop + python 2.7  (needed for iotop)

    Stopped docker, when spinning down drives, no IO at all in iotop, in GUI shows either 7.9 kb/s or 4.8 kb/s every time I try to spin down manually.

     

    EDIT 2:

    Started randomly working but not all drives can be spun down. 3 left that can't spin down... only thing that changed is I installed Dynamix SCSI Devices at some point in the last few hours (tested  spin down immediately after installing and configuring it, didn't work at the time) . And the server has been up for a few hours without me rebooting for testing.

    I have the same issue from what I noticed its when the drive are commanded to shutdown then  I get this in the log that seems to wake up the drive:

     

    emhttpd: read SMART /dev/sdm

     

    Not sure why it triggers to read the SMART on the drive after its spundown.

  3. Hello I am seeking some help on a new setup using the LSI 9206-16e.  I was able to get the card flashed to IT mode with the BIOS but for some reason my MPT2SAS keeps filling up with temp flag warnings.

     

    Apr 15 09:20:04 kernel: mpt2sas_cm0: Temperature Threshold flags 0 exceeded for Sensor: 0 !!!

    Apr 15 09:20:04 kernel: mpt2sas_cm0: Current Temp In Celsius: 80

    Apr 15 09:20:24 kernel: mpt2sas_cm1: Temperature Threshold flags 0 exceeded for Sensor: 0 !!!

    Apr 15 09:20:24 kernel: mpt2sas_cm1: Current Temp In Celsius: 60

     

    I have read the temperature and air flow requirements for the card and I am within specs for the card.

     

    Can someone tell me how to change the Temperature Threshold Flags or how to silence the messages in the logs?

     

    Attached is my Diagnostics.

     

    Thanks Everyone! 

    wallaceserver-diagnostics-20210415-1143.zip

×
×
  • Create New...