Disk overheated warnings notifications


Recommended Posts

I've been noticing the same sequence of overheating warnings in the last while. Usually the problem clears itself within 30 minutes.

 

I doubt it's a airflow issues as it's in the middle of Winter here, ambient temperature is around 21 degrees celsius and it happens always with the same disk, no others. It happens mostly when I'm away from the server so I can't inspect the activity leds on the disk trays to verify if in fact that the disk is being accessed.

 

Bad disk temp sensor? It is a seagate afterall.

 

 

Event: unRAID Disk 7 temperature
Subject: Alert [CLARABELL] - Disk 7 overheated (64 C)
Description: ST2000DM001-1CH164_XXX (sdn)
Importance: alert

 

Event: unRAID Disk 7 SMART health [190]
Subject: Warning [CLARABELL] - airflow temperature cel (failing now) is 64
Description: ST2000DM001-1CH164_XXX (sdn)
Importance: warning

 

Event: unRAID Disk 7 message
Subject: Notice [CLARABELL] - Disk 7 returned to normal temperature
Description: ST2000DM001-1CH164_XXX (sdn)
Importance: normal

 

Smart:

 

Last SMART test result:
Completed without error

 

1	Raw read error rate	0x000f	117	099	006	Pre-fail	Always	Never	147903544
3	Spin up time	0x0003	095	095	000	Pre-fail	Always	Never	0
4	Start stop count	0x0032	095	095	020	Old age	Always	Never	5823
5	Reallocated sector count	0x0033	100	100	036	Pre-fail	Always	Never	0
7	Seek error rate	0x000f	073	060	030	Pre-fail	Always	Never	8633930535
9	Power on hours	0x0032	072	072	000	Old age	Always	Never	25312 (2y, 10m, 18d, 16h)
10	Spin retry count	0x0013	100	100	097	Pre-fail	Always	Never	0
12	Power cycle count	0x0032	100	100	020	Old age	Always	Never	64
183	Runtime bad block	0x0032	099	099	000	Old age	Always	Never	1
184	End-to-end error	0x0032	100	100	099	Old age	Always	Never	0
187	Reported uncorrect	0x0032	100	100	000	Old age	Always	Never	0
188	Command timeout	0x0032	100	100	000	Old age	Always	Never	0 0 0
189	High fly writes	0x003a	100	100	000	Old age	Always	Never	0
190	Airflow temperature cel	0x0022	074	036	045	Old age	Always	In the past	26 (0 16 64 17 0)
191	G-sense error rate	0x0032	100	100	000	Old age	Always	Never	0
192	Power-off retract count	0x0032	100	100	000	Old age	Always	Never	59
193	Load cycle count	0x0032	026	026	000	Old age	Always	Never	148444
194	Temperature celsius	0x0022	026	064	000	Old age	Always	Never	26 (0 15 0 0 0)
197	Current pending sector	0x0012	100	100	000	Old age	Always	Never	0
198	Offline uncorrectable	0x0010	100	100	000	Old age	Offline	Never	0
199	UDMA CRC error count	0x003e	200	200	000	Old age	Always	Never	0
240	Head flying hours	0x0000	100	253	000	Old age	Offline	Never	7333h+02m+05.323s
241	Total lbas written	0x0000	100	253	000	Old age	Offline	Never	22294738058
242	Total lbas read	0x0000	100	253	000	Old age	Offline	Never	157154852761

Link to comment

Probably a false reading, you can try to change the poll settings, settings > disk settings > poll_attributes, change from the default 1800 secs (30 minutes) to something like 300, if the disk goes from normal temperature to >60C and back to normal in 5 minutes it’s definitely a false reading.

 

Did as you say, and I noticed the error today, luckily I caught it. While it reported the high temperature warning, I was able to check whether the disk was at all being accessed and no, nothing. Returned to normal after 5 mins. Must be the drive.

 

Although annoying not sure I want to replace the drive just yet, seems to be fine besides the high temps.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.