LSI MegaRAID Expected Chip Temperature?

We recently built a replicating SAN array from 2x Dell R720XD's, we are using LSI 9270-8i MegaRAID cards with CacheCade 2.0, BBU and Write Back cache enabled.

Our cards are showing HUGE chip temperatures (97*C+ with NO disk activity!).

Our R720's are in auto temp management mode so the max exhaust temp is 50*C.

The MegaRAID cards are passively cooled and depend on good airflow to cool them - however is 97*C normal? - I have seen reference to 60*C max ambients but nothing for chip temp.


http://www.lsi.com/downloads/Public/Host%20Bus%20Adapters/9206-16e_HBA_TemperatureAirflow_Application_Note.pdf

This seems to give some idea about temperature ranges, although for a different chip. ~100°C is high and dangerously close to the limit but still within spec. I have a similar issue with a 9201-16i card. These chips have a 2000000 hours MTBF but at such high idle temperatures I doubt they can last :(

I considered replacing the heat-sink with one from a retired video card. If anybody had succeeded in removing it safely from the chip would be nice to write a few lines about the procedure. Looks to be glued with some epoxy, not just a thermal compound easily detachable. This results in a high risk of breaking the BGA itself or the soldering to the board...


Well, LSI's response was hardly a solution or even useful:

I do not see anything in the logs that might indicates an overheating of the controller card. The 97 degree Celsius is still withing the range of the temperature threshold of the ROC which will be 115. The main temperature to watch will be the ambient temperature inside the server which requires at least 200 LFPM of airflow from the fans to stay at the required threshold. Please find these required conditions below.

For the MegaRAID SAS 9270-8i RAID controller, the operating (thermal and atmospheric) conditions are as follows:

Relative humidity range is 20 percent to 80 percent noncondensing.

Airflow must be at least 200 linear feet per minute (LFPM) to avoid operating the LSISAS2208 processor above the maximum ambient temperature.

Temperature range: — +10 °C to +45 °C (with BBU) — +10 °C to +55 °C (with LSIiBBU09 mode 1 through 5)

The parameters for the nonoperating (such as storage and transit) environment for these controllers are as follows:

Relative humidity range is 5 percent to 90 percent noncondensing.

Temperature range: – 40 °C to +70 °C (without BBU) — 0 °C to +45 °C (with BBU

Thank you

Our unit's are providing 240LFPM of airflow on low-fan speed setting, ambients are 18*C and chassis temp is much the same - appears they aren't going to admit this is a manufacturing fault - no silicon should ever run at this temperature at idle.


While I can't explain why you are idling so high, I can say this is a problem we have been dealing with for years on our servers. A server we just recently retired with a MegaRAID 9280 controller idled around 87 degrees. What was really a pain was that the battery backup unit would constantly overheat and fry out in about a year because of the heat. Since the replacement units were nearly $300, this added what seemed like an unnecessary amount of risk and expense to our TCO.

We seem to have finally resolved the issue by purchasing a cheap Antec Cyclone blower; basically a single slot fan that sits facing the heat sink of the RAID card and constantly blows hot air out the back of the box. We found that it lowered the idle temp of the card by a whopping 26 degrees Celsius. There are a number of similar products on the market to choose from, and can generally be purchased for under $10.


I know this is a very old post, but since its the top result in a google search I thought I'd chime in for somebody else's benefit. The LSI 9270 series RAID cards are designed to have high concentrated server-style airstreams blown past them at relatively high velocities. For example at idle in my tower-style server with the front, back and top fans going my 9270-8i idles at 89C. When I rigged a Noctua 60mm fan to blow at right angles to the heatsink (by screwing the fan into heatsink of a nearby video card) it dropped sharply to 59C at idle and 70c under load, when I replaced its dried up heatsink grease with IC Diamond the idle temps dropped to 54C and load temps to 59C.


That's really not normal no, what's weirder is that they're both doing it - must be a firmware/driver thing I guess - have you spoken to LSI or Dell about it?