Afficher un message
Vieux 05/09/2007, 22h33   #1
Ignoramus22620
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut CPU overheats, Linux slows down, need some ideas.

Something happened on my server that made several processes do
intensive computations. (a bug in some of my perl scripts) That loaded
all cores of my two dual cored CPUs at 100% CPU. I got these messages
in /var/log/messages and a lot of "wall" messages:

Sep 5 13:17:09 ak74 kernel: CPU1: Running in modulated clock mode
Sep 5 13:17:18 ak74 kernel: CPU1: Temperature above threshold
Sep 5 13:17:18 ak74 kernel: CPU0: Temperature above threshold
Sep 5 13:17:18 ak74 kernel: CPU0: Running in modulated clock mode
Sep 5 13:17:18 ak74 kernel: CPU1: Running in modulated clock mode
Sep 5 13:17:27 ak74 kernel: CPU0: Temperature above threshold
Sep 5 13:17:27 ak74 kernel: CPU1: Temperature above threshold
Sep 5 13:17:27 ak74 kernel: CPU1: Running in modulated clock mode
Sep 5 13:17:27 ak74 kernel: CPU0: Running in modulated clock mode
Sep 5 13:17:38 ak74 kernel: CPU0: Temperature above threshold
Sep 5 13:17:38 ak74 kernel: CPU1: Temperature above threshold
Sep 5 13:17:38 ak74 kernel: CPU1: Running in modulated clock mode
Sep 5 13:17:38 ak74 kernel: CPU0: Running in modulated clock mode
Sep 5 13:17:44 ak74 kernel: CPU1: Temperature above threshold
Sep 5 13:17:44 ak74 kernel: CPU0: Temperature above threshold


So I think that Linux saved my ass and switched to modulated clock
mode instead of shutting down or burning my CPU. It simply slowed
down.

Thanks for that.

My question is, normally a computer can be very busy and should not
overheat. So something physical is wrong. Would you have any idea,
maybe I need to replace case fans or the CPU fan?

This is a 1U rackmount from Rackmounts Etc.

I called my ISP and they confirmed that they did not have any cooling
or heating issues on their racks.

thanks

i
  Réponse avec citation
 
Page generated in 0,05087 seconds with 9 queries