[Plugin] IPMI for unRAID 6.1+


Recommended Posts

Everything checks out, that did it! Sorry it took me to long to figure out it was the order that was important, not the name, I just figured since it properly red the rpms and had them in the right place that the command to tune them would also be in the right place.

 

In any case great plugin! finer control over the min and max temps would be cool, maybe a hysteresis option would be good. My fans just sometimes step up and down 700rpms because my min and max is 30-50c and they keep bouncing between like 44 and 45. But that's not a problem now that everything is staying cool :) thanks again for the help man. I love your plugin!

Link to comment

 

 

There's no way to search threads on this forum, so can someone tell me if this plugin works for SuperMicro motherboards which have IPMI? I want to be able to monitor stuff and auto control fan speeds etc.

 

You can monitor SuperMicro motherboards and receive alerts through the unRAID notification system but I just haven't had time to implement a solution for fan control. The fan control for SM boards would also be limited to low, medium and high. They don't have the ipmi fan control that Asrock has. However you could possibly use the dynamix plugin to control your fans.

Link to comment

:-[

 

I rebooted the server (to put in a new cooling fan that was failing) and I've got a new problem now. The log says the HDD temps are 43C

 

2016-09-25 04:38:30 CPU_FAN1(100%):CPU Temperature(42°C), FRNT_FAN1(72%):HDD Temperature(43°C), FRNT_FAN2(72%):HDD Temperature(43°C)

 

and the fans are spinning at the right speed, but that HDD temp doesn't make any sense my harddrives show:

 

38C

38C

36C

34C

33C

33C

Array of six devices 35.3C

 

Even the ipmi plugin on the dashboard shows the HDD temp as 38C, but even after stopping and restarting the fan service the log still shows that it thinks the hdd temps are 43C. any idea where it's getting this info from? and how I could correct this?

 

Edit: In side the sensor page for ipmi it reads 38C so I'm not sure where the fan controller is getting this 43C from?

 

Link to comment

 

 

:-[

 

I rebooted the server (to put in a new cooling fan that was failing) and I've got a new problem now. The log says the HDD temps are 43C

 

2016-09-25 04:38:30 CPU_FAN1(100%):CPU Temperature(42°C), FRNT_FAN1(72%):HDD Temperature(43°C), FRNT_FAN2(72%):HDD Temperature(43°C)

 

and the fans are spinning at the right speed, but that HDD temp doesn't make any sense my harddrives show:

 

38C

38C

36C

34C

33C

33C

Array of six devices 35.3C

 

Even the ipmi plugin on the dashboard shows the HDD temp as 38C, but even after stopping and restarting the fan service the log still shows that it thinks the hdd temps are 43C. any idea where it's getting this info from? and how I could correct this?

 

Edit: In side the sensor page for ipmi it reads 38C so I'm not sure where the fan controller is getting this 43C from?

 

Any other drives in your system besides start drives? The ipmifan script and the sensor page are two separate things. I reuse the dynamix readings for the hard drive temps and take the greatest value.  The fan script checks the hard drive temps based on the frequency selected in the settings,  using smartctl to get the temps, then uses the highest value also.

Link to comment

Tried a bunch of things including checking out the smart data on all the other drives, seems that it was somehow stuck? kind of an unsettling error, rebooting it cleared it up. I just wish I could figure out how it got stuck in the first place, at least if the problem only happens at boot I can keep an eye for it, as it never seem to happen after it's running proper.  ???

Link to comment

 

 

IS it possible to show the CPU temp as low/medium/high on Supermicro boards that only give that kind of info?

 

I can do that and included it in the footer too. Can you run ipmi-sensors from the command line and post the output? Do you know what the exact text is? low/medium/high?

Link to comment

 

 

IS it possible to show the CPU temp as low/medium/high on Supermicro boards that only give that kind of info?

 

I can do that and included it in the footer too. Can you run ipmi-sensors from the command line and post the output? Do you know what the exact text is? low/medium/high?

 

root@Tower:~#  ipmi-sensors
ID | Name         | Type                     | Reading    | Units | Event
4  | CPU Temp     | OEM Reserved             | N/A        | N/A   | 'OEM Event = 0000h'
5  | Sys Temp     | Temperature              | 34.00      | C     | 'OK'
6  | CPU Vcore    | Voltage                  | 1.16       | V     | 'OK'
7  | DIMM Volt    | Voltage                  | 1.82       | V     | 'OK'
8  | 3.3V         | Voltage                  | 3.30       | V     | 'OK'
9  | 5V           | Voltage                  | 4.82       | V     | 'OK'
10 | 12V          | Voltage                  | 11.81      | V     | 'OK'
11 | -12V         | Voltage                  | -12.10     | V     | 'OK'
12 | 5VSB         | Voltage                  | 4.85       | V     | 'OK'
13 | VBAT         | Voltage                  | 3.22       | V     | 'OK'
14 | Fan1         | Fan                      | 3000.00    | RPM   | 'OK'
15 | Fan2         | Fan                      | 1000.00    | RPM   | 'OK'
16 | Fan3         | Fan                      | 1000.00    | RPM   | 'OK'
17 | Fan4         | Fan                      | 900.00     | RPM   | 'OK'
18 | Fan5         | Fan                      | 3000.00    | RPM   | 'OK'
19 | Fan6/CPU     | Fan                      | 1400.00    | RPM   | 'OK'
20 | Intrusion    | Physical Security        | N/A        | N/A   | 'OK'
21 | Power Supply | Power Supply             | N/A        | N/A   | 'OK'
22 | Thermal Trip | Module/Board             | N/A        | N/A   | 'OK'
23 | BIOS         | System Firmware Progress | N/A        | N/A   | N/A

 

And yes, it's Low, Medium and High - It was in Low when I got the output above..

 

cheers!

Link to comment

 

 

IS it possible to show the CPU temp as low/medium/high on Supermicro boards that only give that kind of info?

 

I can do that and included it in the footer too. Can you run ipmi-sensors from the command line and post the output? Do you know what the exact text is? low/medium/high?

 

root@Tower:~#  ipmi-sensors
ID | Name         | Type                     | Reading    | Units | Event
4  | CPU Temp     | OEM Reserved             | N/A        | N/A   | 'OEM Event = 0000h'
5  | Sys Temp     | Temperature              | 34.00      | C     | 'OK'
6  | CPU Vcore    | Voltage                  | 1.16       | V     | 'OK'
7  | DIMM Volt    | Voltage                  | 1.82       | V     | 'OK'
8  | 3.3V         | Voltage                  | 3.30       | V     | 'OK'
9  | 5V           | Voltage                  | 4.82       | V     | 'OK'
10 | 12V          | Voltage                  | 11.81      | V     | 'OK'
11 | -12V         | Voltage                  | -12.10     | V     | 'OK'
12 | 5VSB         | Voltage                  | 4.85       | V     | 'OK'
13 | VBAT         | Voltage                  | 3.22       | V     | 'OK'
14 | Fan1         | Fan                      | 3000.00    | RPM   | 'OK'
15 | Fan2         | Fan                      | 1000.00    | RPM   | 'OK'
16 | Fan3         | Fan                      | 1000.00    | RPM   | 'OK'
17 | Fan4         | Fan                      | 900.00     | RPM   | 'OK'
18 | Fan5         | Fan                      | 3000.00    | RPM   | 'OK'
19 | Fan6/CPU     | Fan                      | 1400.00    | RPM   | 'OK'
20 | Intrusion    | Physical Security        | N/A        | N/A   | 'OK'
21 | Power Supply | Power Supply             | N/A        | N/A   | 'OK'
22 | Thermal Trip | Module/Board             | N/A        | N/A   | 'OK'
23 | BIOS         | System Firmware Progress | N/A        | N/A   | N/A

 

And yes, it's Low, Medium and High - It was in Low when I got the output above..

 

cheers!

 

Could you try again with

ipmi-sensors --interpret-oem-data

 

And if that didn't work try

ipmi-sensors --interpret-oem-data --record-ids=4

Also what board?

Link to comment

root@Tower:/mnt/user/system/libvirt# ipmi-sensors --interpret-oem-data
ID | Name         | Type                     | Reading    | Units | Event
4  | CPU Temp     | OEM Reserved             | N/A        | N/A   | 'Low'
5  | Sys Temp     | Temperature              | 35.00      | C     | 'OK'
6  | CPU Vcore    | Voltage                  | 1.16       | V     | 'OK'
7  | DIMM Volt    | Voltage                  | 1.82       | V     | 'OK'
8  | 3.3V         | Voltage                  | 3.30       | V     | 'OK'
9  | 5V           | Voltage                  | 4.82       | V     | 'OK'
10 | 12V          | Voltage                  | 11.81      | V     | 'OK'
11 | -12V         | Voltage                  | -12.10     | V     | 'OK'
12 | 5VSB         | Voltage                  | 4.85       | V     | 'OK'
13 | VBAT         | Voltage                  | 3.22       | V     | 'OK'
14 | Fan1         | Fan                      | 3000.00    | RPM   | 'OK'
15 | Fan2         | Fan                      | 1000.00    | RPM   | 'OK'
16 | Fan3         | Fan                      | 1000.00    | RPM   | 'OK'
17 | Fan4         | Fan                      | 900.00     | RPM   | 'OK'
18 | Fan5         | Fan                      | 3100.00    | RPM   | 'OK'
19 | Fan6/CPU     | Fan                      | 1400.00    | RPM   | 'OK'
20 | Intrusion    | Physical Security        | N/A        | N/A   | 'OK'
21 | Power Supply | Power Supply             | N/A        | N/A   | 'OK'
22 | Thermal Trip | Module/Board             | N/A        | N/A   | 'OK'
23 | BIOS         | System Firmware Progress | N/A        | N/A   | N/A

 

Board is in my signature..

Link to comment

 

 

root@Tower:/mnt/user/system/libvirt# ipmi-sensors --interpret-oem-data
ID | Name         | Type                     | Reading    | Units | Event
4  | CPU Temp     | OEM Reserved             | N/A        | N/A   | 'Low'
5  | Sys Temp     | Temperature              | 35.00      | C     | 'OK'
6  | CPU Vcore    | Voltage                  | 1.16       | V     | 'OK'
7  | DIMM Volt    | Voltage                  | 1.82       | V     | 'OK'
8  | 3.3V         | Voltage                  | 3.30       | V     | 'OK'
9  | 5V           | Voltage                  | 4.82       | V     | 'OK'
10 | 12V          | Voltage                  | 11.81      | V     | 'OK'
11 | -12V         | Voltage                  | -12.10     | V     | 'OK'
12 | 5VSB         | Voltage                  | 4.85       | V     | 'OK'
13 | VBAT         | Voltage                  | 3.22       | V     | 'OK'
14 | Fan1         | Fan                      | 3000.00    | RPM   | 'OK'
15 | Fan2         | Fan                      | 1000.00    | RPM   | 'OK'
16 | Fan3         | Fan                      | 1000.00    | RPM   | 'OK'
17 | Fan4         | Fan                      | 900.00     | RPM   | 'OK'
18 | Fan5         | Fan                      | 3100.00    | RPM   | 'OK'
19 | Fan6/CPU     | Fan                      | 1400.00    | RPM   | 'OK'
20 | Intrusion    | Physical Security        | N/A        | N/A   | 'OK'
21 | Power Supply | Power Supply             | N/A        | N/A   | 'OK'
22 | Thermal Trip | Module/Board             | N/A        | N/A   | 'OK'
23 | BIOS         | System Firmware Progress | N/A        | N/A   | N/A

 

Board is in my signature..

 

Oh ok. So does it show low in the sensor readings page or did you want it for the footer? I already have --interpret-oem-data in the command string so it should show up in readings.

Link to comment

 

 

root@Tower:/mnt/user/system/libvirt# ipmi-sensors --interpret-oem-data
ID | Name         | Type                     | Reading    | Units | Event
4  | CPU Temp     | OEM Reserved             | N/A        | N/A   | 'Low'
5  | Sys Temp     | Temperature              | 35.00      | C     | 'OK'
6  | CPU Vcore    | Voltage                  | 1.16       | V     | 'OK'
7  | DIMM Volt    | Voltage                  | 1.82       | V     | 'OK'
8  | 3.3V         | Voltage                  | 3.30       | V     | 'OK'
9  | 5V           | Voltage                  | 4.82       | V     | 'OK'
10 | 12V          | Voltage                  | 11.81      | V     | 'OK'
11 | -12V         | Voltage                  | -12.10     | V     | 'OK'
12 | 5VSB         | Voltage                  | 4.85       | V     | 'OK'
13 | VBAT         | Voltage                  | 3.22       | V     | 'OK'
14 | Fan1         | Fan                      | 3000.00    | RPM   | 'OK'
15 | Fan2         | Fan                      | 1000.00    | RPM   | 'OK'
16 | Fan3         | Fan                      | 1000.00    | RPM   | 'OK'
17 | Fan4         | Fan                      | 900.00     | RPM   | 'OK'
18 | Fan5         | Fan                      | 3100.00    | RPM   | 'OK'
19 | Fan6/CPU     | Fan                      | 1400.00    | RPM   | 'OK'
20 | Intrusion    | Physical Security        | N/A        | N/A   | 'OK'
21 | Power Supply | Power Supply             | N/A        | N/A   | 'OK'
22 | Thermal Trip | Module/Board             | N/A        | N/A   | 'OK'
23 | BIOS         | System Firmware Progress | N/A        | N/A   | N/A

 

Board is in my signature..

 

Oh ok. So does it show low in the sensor readings page or did you want it for the footer? I already have --interpret-oem-data in the command string so it should show up in readings.

 

It shows up as N/A in the sensor reading.. It would be great to have in the footer as well..

Let me know if you need me to test anything..

Link to comment

Updated. It should show events as readings for OEM Reserved. The footer will take a bit more work.

 

Thanks, it shows up properly in the sensor report now, but still shows N/A in the dashboard..

Updated. Should work for dash and footer.

 

Thanks! It shows up properly in all places now.. Just a small request for your next update, can you remove the ' from the display.. Right now it shows as 'Low' everywhere instead of Low

 

Thanks again!

Link to comment

 

 

Updated. It should show events as readings for OEM Reserved. The footer will take a bit more work.

 

Thanks, it shows up properly in the sensor report now, but still shows N/A in the dashboard..

Updated. Should work for dash and footer.

 

Thanks! It shows up properly in all places now.. Just a small request for your next update, can you remove the ' from the display.. Right now it shows as 'Low' everywhere instead of Low

 

Thanks again!

 

Ok. I'll look into that.

Link to comment

Updated. It should show events as readings for OEM Reserved. The footer will take a bit more work.

 

Thanks, it shows up properly in the sensor report now, but still shows N/A in the dashboard..

Updated. Should work for dash and footer.

 

Thanks! It shows up properly in all places now.. Just a small request for your next update, can you remove the ' from the display.. Right now it shows as 'Low' everywhere instead of Low

 

Thanks again!

Can you run this command for me and post the output?

/usr/sbin/ipmi-sensors --output-sensor-thresholds --comma-separated-output --output-sensor-state --no-header-output --interpret-oem-data

Link to comment

Updated. It should show events as readings for OEM Reserved. The footer will take a bit more work.

 

Thanks, it shows up properly in the sensor report now, but still shows N/A in the dashboard..

Updated. Should work for dash and footer.

 

Thanks! It shows up properly in all places now.. Just a small request for your next update, can you remove the ' from the display.. Right now it shows as 'Low' everywhere instead of Low

 

Thanks again!

Can you run this command for me and post the output?

/usr/sbin/ipmi-sensors --output-sensor-thresholds --comma-separated-output --output-sensor-state --no-header-output --interpret-oem-data

 

Sure, here 'ya go..

 

root@Tower:~# /usr/sbin/ipmi-sensors --output-sensor-thresholds --comma-separated-output --output-sensor-state --no-header-output --interpret-oem-data
4,CPU Temp,OEM Reserved,Nominal,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,'Low'
5,Sys Temp,Temperature,Nominal,34.00,C,N/A,N/A,N/A,76.00,78.00,80.00,'OK'
6,CPU Vcore,Voltage,Nominal,1.16,V,0.86,0.88,0.90,1.62,1.63,1.64,'OK'
7,DIMM Volt,Voltage,Nominal,1.82,V,1.57,1.58,1.60,2.00,2.02,2.03,'OK'
8,3.3V,Voltage,Nominal,3.30,V,2.91,2.93,2.94,3.65,3.66,3.68,'OK'
9,5V,Voltage,Nominal,4.80,V,4.42,4.44,4.46,5.52,5.54,5.57,'OK'
10,12V,Voltage,Nominal,11.81,V,10.46,10.56,10.66,13.34,13.44,13.54,'OK'
11,-12V,Voltage,Nominal,-12.10,V,-10.50,-10.60,-10.70,-13.30,-13.40,-13.50,'OK'
12,5VSB,Voltage,Nominal,4.85,V,4.42,4.44,4.46,5.52,5.54,5.57,'OK'
13,VBAT,Voltage,Nominal,3.22,V,2.91,2.93,2.94,3.65,3.66,3.68,'OK'
14,Fan1,Fan,Nominal,3000.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
15,Fan2,Fan,Nominal,1000.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
16,Fan3,Fan,Nominal,1000.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
17,Fan4,Fan,Nominal,900.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
18,Fan5,Fan,Nominal,3000.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
19,Fan6/CPU,Fan,Nominal,1500.00,RPM,200.00,300.00,400.00,N/A,N/A,N/A,'OK'
20,Intrusion,Physical Security,Nominal,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,'OK'
21,Power Supply,Power Supply,Nominal,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,'OK'
22,Thermal Trip,Module/Board,Nominal,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,'OK'
23,BIOS,System Firmware Progress,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A,N/A

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.