identify CPU socket because higher temps

vic1707

Dabbler
Joined
Jan 28, 2019
Messages
46
Hello,

Hello, I got some thermal problems with my cpus.

As you can see on the screenshot, one of them is running a lot hotter (IDLE), i even reapply the thermal paste multiple time in the same way for both of them.
My question might seems stupid but is it the cpu0 or cpu1 that is hotter ? (I think it's cpu1 but if you can confirm it would be nice)
BTW the server is a DL380G7 2* X5670 Xeon if it's can help in some ways.


cpu temps.png
 

Patrick M. Hausen

Hall of Famer
Joined
Nov 25, 2013
Messages
7,776
Possibly the output of ipmitool sensor is helpful. I am not familiar with these Dell systems but it's server grade hardware so there should be IPMI. also ipmitool sel list.
 

vic1707

Dabbler
Joined
Jan 28, 2019
Messages
46
Possibly the output of ipmitool sensor is helpful. I am not familiar with these Dell systems but it's server grade hardware so there should be IPMI. also ipmitool sel list.

Thanks for the answer, here is the output for the command :

Code:
  NASvic# ipmitool sel list
   1 | 05/05/2011 | 00:25:05 | Power Supply #0x04 | Failure detected | Asserted
   2 | 07/20/2018 | 20:23:58 | OS Boot | C: boot completed | Asserted
   3 | 07/20/2018 | 20:23:58 | OEM record dc | 000137 | 00db44525b00
   4 | 07/20/2018 | 20:25:38 | OS Critical Stop | OS graceful shutdown | Asserted
   5 | 07/20/2018 | 20:25:38 | OEM record dd | 000137 | 000000000500
   6 | 07/20/2018 | 20:28:50 | OS Boot | C: boot completed | Asserted
   7 | 07/20/2018 | 20:28:50 | OEM record dc | 000137 | 00ff45525b00
   8 | 05/16/2019 | 17:01:23 | OS Boot | C: boot completed | Asserted
   9 | 05/16/2019 | 17:01:23 | OEM record dc | 000137 | 005f97dd5c00
   a | 11/06/2019 | 09:49:12 | OS Boot | C: boot completed | Asserted
   b | 11/06/2019 | 09:49:12 | OEM record dc | 000137 | 001497c25d00
   c |  Pre-Init  |0000000672| Memory #0x2b | Uncorrectable ECC | Asserted
   d | 11/06/2019 | 10:17:42 | OS Boot | C: boot completed | Asserted
   e | 11/06/2019 | 10:17:42 | OEM record dc | 000137 | 00c29dc25d00
   f |  Pre-Init  |0000000333| Memory #0x2b | Uncorrectable ECC | Asserted
  10 | 01/02/2020 | 14:10:47 | OS Boot | C: boot completed | Asserted
  11 | 01/02/2020 | 14:10:47 | OEM record dc | 000137 | 00d1eb0d5e00
  12 | 01/02/2020 | 14:18:54 | OS Critical Stop | OS graceful shutdown | Asserted
  13 | 01/02/2020 | 14:18:54 | OEM record dd | 000137 | 000200018400
  14 |  Pre-Init  |0000000237| Memory #0x2b | Uncorrectable ECC | Asserted
  15 | 01/02/2020 | 13:45:44 | OS Boot | C: boot completed | Asserted
  16 | 01/02/2020 | 13:45:44 | OEM record dc | 000137 | 0005f40d5e00
  17 | 01/02/2020 | 13:58:00 | OS Critical Stop | OS graceful shutdown | Asserted
  18 | 01/02/2020 | 13:58:00 | OEM record dd | 000137 | 000000008500
  19 | 01/10/2020 | 12:31:41 | OS Boot | C: boot completed | Asserted
  1a | 01/10/2020 | 12:31:41 | OEM record dc | 000137 | 00ab6e185e00
  1b | 01/10/2020 | 12:34:52 | OS Critical Stop | OS graceful shutdown | Asserted
  1c | 01/10/2020 | 12:34:52 | OEM record dd | 000137 | 000000000500
  1d | 01/10/2020 | 12:44:39 | OS Boot | C: boot completed | Asserted
  1e | 01/10/2020 | 12:44:39 | OEM record dc | 000137 | 00b571185e00
  1f | 01/10/2020 | 12:51:44 | OS Critical Stop | OS graceful shutdown | Asserted
  20 | 01/10/2020 | 12:51:44 | OEM record dd | 000137 | 000000000500
  21 | 01/11/2020 | 13:34:15 | OS Boot | C: boot completed | Asserted
  22 | 01/11/2020 | 13:34:15 | OEM record dc | 000137 | 00d5ce195e00
  23 | 01/11/2020 | 13:49:38 | OS Critical Stop | OS graceful shutdown | Asserted
  24 | 01/11/2020 | 13:49:38 | OEM record dd | 000137 | 000000000500
  25 | 01/11/2020 | 13:54:50 | OS Boot | C: boot completed | Asserted
  26 | 01/11/2020 | 13:54:50 | OEM record dc | 000137 | 00a8d3195e00
  27 | 01/11/2020 | 14:14:30 | OS Critical Stop | OS graceful shutdown | Asserted
  28 | 01/11/2020 | 14:14:30 | OEM record dd | 000137 | 000000000500
  29 | 03/29/2020 | 13:11:15 | OS Boot | C: boot completed | Asserted
  2a | 03/29/2020 | 13:11:15 | OEM record dc | 000137 | 005f9e805e00
  2b |  Pre-Init  |0000000101| Fan #0x06 | Transition to Off Line | Asserted
  2c |  Pre-Init  |0000000112| Fan #0x06 | Transition to Off Line | Asserted
  2d |  Pre-Init  |0000000090| Fan #0x06 | Transition to Off Line | Asserted
  2e |  Pre-Init  |0000000091| Fan #0x06 | Transition to Off Line | Asserted
  2f |  Pre-Init  |0000000093| Fan #0x08 | Transition to Off Line | Asserted
  30 |  Pre-Init  |0000000109| Fan #0x0a | Transition to Running | Deasserted
  31 |  Pre-Init  |0000000110| Fan #0x0a | Transition to Running | Deasserted
  32 |  Pre-Init  |0000000117| Fan #0x0a | Transition to Running | Deasserted
  33 |  Pre-Init  |0000000118| Fan #0x0a | Transition to Running | Deasserted
  34 |  Pre-Init  |0000000121| Fan #0x0a | Transition to Running | Deasserted
  35 |  Pre-Init  |0000000102| Fan #0x08 | Transition to Off Line | Asserted
  36 |  Pre-Init  |0000000116| Fan #0x0a | Transition to Running | Deasserted

BTW a DL380G7 is a HP server not Dell ;)
 

Patrick M. Hausen

Hall of Famer
Joined
Nov 25, 2013
Messages
7,776
What's ipmitool sensor say?
 

vic1707

Dabbler
Joined
Jan 28, 2019
Messages
46
What's ipmitool sensor say?
Here it comes
Code:
NASvic# ipmitool sensor
UID Light        | 0x0        | discrete   | 0x0080| na        | na        | na        | na        | na        | na
Sys. Health LED  | 0x0        | discrete   | 0x0080| na        | na        | na        | na        | na        | na
Power Supply 1   | 115        | Watts      | ok    | na        | na        | na        | na        | na        | na
Power Supply 2   | 135        | Watts      | ok    | na        | na        | na        | na        | na        | na
Power Supplies   | 0x0        | discrete   | 0x0180| na        | na        | na        | na        | na        | na
Fan 1            | 78.400     | percent    | ok    | na        | na        | na        | na        | na        | na
Fan 2            | 78.400     | percent    | ok    | na        | na        | na        | na        | na        | na
Fan 3            | 90.160     | percent    | ok    | na        | na        | na        | na        | na        | na
Fan 4            | 78.400     | percent    | ok    | na        | na        | na        | na        | na        | na
Fan 5            | 78.400     | percent    | ok    | na        | na        | na        | na        | na        | na
Fan 6            | 78.400     | percent    | ok    | na        | na        | na        | na        | na        | na
Fans             | 0x0        | discrete   | 0x0180| na        | na        | na        | na        | na        | na
Temp 1           | 23.000     | degrees C  | ok    | na        | na        | na        | na        | 41.000    | 45.000
Temp 2           | 40.000     | degrees C  | ok    | na        | na        | na        | na        | 82.000    | 83.000
Temp 3           | 40.000     | degrees C  | ok    | na        | na        | na        | na        | 82.000    | 83.000
Temp 4           | 47.000     | degrees C  | ok    | na        | na        | na        | na        | 87.000    | 92.000
Temp 5           | 42.000     | degrees C  | ok    | na        | na        | na        | na        | 87.000    | 92.000
Temp 6           | 56.000     | degrees C  | ok    | na        | na        | na        | na        | 87.000    | 92.000
Temp 7           | 52.000     | degrees C  | ok    | na        | na        | na        | na        | 87.000    | 92.000
Temp 8           | 42.000     | degrees C  | ok    | na        | na        | na        | na        | 90.000    | 95.000
Temp 9           | 41.000     | degrees C  | ok    | na        | na        | na        | na        | 65.000    | 70.000
Temp 10          | 55.000     | degrees C  | ok    | na        | na        | na        | na        | 90.000    | 95.000
Temp 11          | 41.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 12          | 47.000     | degrees C  | ok    | na        | na        | na        | na        | 90.000    | 95.000
Temp 13          | 39.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 14          | 33.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 15          | 31.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 16          | 31.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 17          | 35.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 18          | 33.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 19          | 30.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 20          | 35.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 21          | 41.000     | degrees C  | ok    | na        | na        | na        | na        | 80.000    | 85.000
Temp 22          | 44.000     | degrees C  | ok    | na        | na        | na        | na        | 80.000    | 85.000
Temp 23          | 57.000     | degrees C  | ok    | na        | na        | na        | na        | 77.000    | 82.000
Temp 24          | 43.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 25          | 41.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 26          | 39.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 27          | 30.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 28          | 32.000     | degrees C  | ok    | na        | na        | na        | na        | 70.000    | 75.000
Temp 29          | 59.000     | degrees C  | ok    | na        | na        | na        | na        | 60.000    | 65.000
Temp 30          | 90.000     | degrees C  | ok    | na        | na        | na        | na        | 110.000   | 115.000
Memory           | 0x0        | discrete   | 0x4080| na        | na        | na        | na        | na        | na
Power Meter      | 278        | Watts      | ok    | na        | na        | na        | na        | na        | na
Cntlr 1 Bay 1    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 1 Bay 2    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 1 Bay 3    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 1 Bay 4    | 0x1        | discrete   | 0x0180| na        | na        | na        | na        | na        | na
Cntlr 2 Bay 5    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 2 Bay 6    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 2 Bay 7    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 2 Bay 8    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 3 Bay 1    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 3 Bay 2    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 3 Bay 3    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 3 Bay 4    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 4 Bay 5    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 4 Bay 6    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 4 Bay 7    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
Cntlr 4 Bay 8    | 0x5        | discrete   | 0x0580| na        | na        | na        | na        | na        | na
 

Patrick M. Hausen

Hall of Famer
Joined
Nov 25, 2013
Messages
7,776
OK, not too much detail. Does the iLO web interface tell you anything in terms of CPU #1 temperature vs. CPU #2?
These sensor readings are essentially the raw data that the iLO interface uses, but again, Dell or HP ... I run Fujitsu and Supermicro, sorry. ;)
And once you found a CPU number, there's hopefully a refererence printed on the mainboard or the inside of the chassis cover or even documentation. That's how I would go about it with my servers ...
 

vic1707

Dabbler
Joined
Jan 28, 2019
Messages
46
OK, not too much detail. Does the iLO web interface tell you anything in terms of CPU #1 temperature vs. CPU #2?
These sensor readings are essentially the raw data that the iLO interface uses, but again, Dell or HP ... I run Fujitsu and Supermicro, sorry. ;)
And once you found a CPU number, there's hopefully a refererence printed on the mainboard or the inside of the chassis cover or even documentation. That's how I would go about it with my servers ...
No unfortunately the ILO is stuck to 40°C for both CPUs...
Yeah, in fact the server isn't in his stock case anymore (needed to silence it because familly...) so i'm almost sure that the problem came from the new DIY case... But i want to know which CPU i need to focus the airflow or if the problem is elsewere.
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Here's a quick & dirty method to check: take some canned air and spray one of the CPU's heat sinks. See which set of temps go down.
 

vic1707

Dabbler
Joined
Jan 28, 2019
Messages
46
Here's a quick & dirty method to check: take some canned air and spray one of the CPU's heat sinks. See which set of temps go down.
In fact it would work XD, but i finally found, i inverted the front fans and now it's the cores 0-11 that are hotter, so it's a fans problem ^^
 
Top