CPU load inhibits interrupts on Linux

Solution 1:

I suspect what's going on is that the CPU load is blocking from the interrupts being serviced by the CPU.

Use itop to see what's really going on. An output of this would help to understand your problem further.

Solution 2:

There may be some BIOS settings that are adjustable, but more information would be necessary to be certain exactly which settings would have the most significant impact.

The interrupts being required to exit the HLT state referenced in this answer seems interesting and possibly related? Based on the nature of the stress workload and if hyper threading is enabled on the system some of the 'virtual' CPUs that are added with hyper threading could conceivably be put into an HLT state until stress levels are lowered and it is possible that the hardware could chose to ignore interrupts to processes running on those CPUs.

Low level hardware configurations in BIOS and the OS kernel can have huge impacts on many different aspects of performance. Verifying that your system is configured to properly handle add-in cards(graphics, communications, etc) of any sort can be a major challenge. More details about the physical hardware and kernel settings, as well as any additional logs like Server/System Event Log(SEL) or Sensor Data Record(SDR) would be beneficial to further diagnose a low level hardware issue like this.

The SEL and SDR are available on most modern servers and can be accessed using ipmitool or a variety of other open source and proprietary/vendor provided tools for in-band and out-of-band server management.