• SMART Temp misreported. .. I think


    Frayedknot
    • Minor

    I'm running Unraid 6.10.0-rc2 and a day ago I put in a 980 Samsung NVME drive and  I got message last night that it was 85 Celsius.  This is pretty alarming since the system is honestly pretty idle.

     

    On the MAIN page for unraid it indeed says 85C, but in the smart values for the drive it showed a Temperature of 35.  Also the TEMP Sensor 1 was about 35C ish and the Temp Sensor 2 was about 39C ish.

     

    The funny thing is that the THERMAL TEMP. 2 Transition Count was the 85.  (I tried to look what that was and I guess that is the number of times it went into thermal throttle (according to Kingston drives).  So is it reporting that value!?

     

    I also took temp readings with a thermal thermometer which showed 35C on it.  BTW it does have the heatspreader from the motherboard on the drive (Gigabyte z590 Aorus Elite).

     

    The overheat message came at 10:01 that the drive was 85C and I got another one at 10:32 that it returned to normal.   

     

    Here's the system log during that time:

    Quote

    Jan  6 17:37:18 Tower emhttpd: read SMART /dev/sdc
    Jan  6 17:38:03 Tower emhttpd: read SMART /dev/sdb
    Jan  6 18:10:34 Tower emhttpd: spinning down /dev/sdb
    Jan  6 18:20:19 Tower emhttpd: read SMART /dev/sdb
    Jan  6 18:51:52 Tower emhttpd: spinning down /dev/sdb
    Jan  6 18:51:52 Tower emhttpd: spinning down /dev/sdc
    Jan  6 22:01:02 Tower sSMTP[2073]: Creating SSL connection to host
    Jan  6 22:01:02 Tower sSMTP[2073]: SSL connection using ECDHE-RSA-AES256-GCM-SHA384
    Jan  6 22:01:03 Tower sSMTP[2073]: Sent mail for ********** (221 2.0.0 csp01.eastlink.ca Eastlink closing connection) uid=0 username=root outbytes=836
    Jan  6 22:01:51 Tower flash_backup: adding task: /usr/local/emhttp/plugins/dynamix.my.servers/scripts/UpdateFlashBackup.php update
    Jan  6 22:12:24 Tower webGUI: Successful login user root from 192.168.22.30
    Jan  6 22:12:53 Tower emhttpd: read SMART /dev/sde
    Jan  6 22:13:03 Tower emhttpd: read SMART /dev/sdd
    Jan  6 22:13:27 Tower emhttpd: read SMART /dev/sdc
    Jan  6 22:25:27 Tower emhttpd: cmd: /usr/local/emhttp/plugins/dynamix/scripts/disk_log nvme0n1
    Jan  6 22:32:56 Tower flash_backup: adding task: /usr/local/emhttp/plugins/dynamix.my.servers/scripts/UpdateFlashBackup.php update

     

    One last thing I noticed is that the drive now shows a value of 132 for Thermal Temp 2 transition count.  I know SMART is a interesting beast and different manufacturers have different values; so maybe that isn't actually what this is.   

    A test I did this morning was run a benchmark on that SSD with DISKSPEED docker and the Temp Sensor 2 never went above 50C.

     

    My next plan is to update the firmware (apparently there is a new version for this drive).

     

    Quick Edit: I was going to include the SMART report and I swear it now just said "Temperature of 85C" in the smart, but the MAIN page was fine.   Unfortunately I clicked away and went back and it has returned to 34C.   .. and the Thermal Temp 2 Transition count is now 133.    I don't believe Temperature sensor 1 or 2 was off just the generic "temperature" one.

     

    Here's my current values for SMART:

    image.thumb.png.882eaef170f9ef690b32a6f499525009.png




    User Feedback

    Recommended Comments

    I have a feeling it is an incompatibility with the NVME drive and O/S.   Many other reports of pretty much the same exact thing and looks not to functionally be a UNRAID related issue.

     

    As per this community forum with no resolution.  (But I'll try the firmware and report if any positive results happen)

    https://us.community.samsung.com/t5/Monitors-and-Memory/SSD-980-heat-spikes-to-84-C-183-F/td-p/2002779

    Edited by Frayedknot
    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.