New posts in micro-optimization

What is the fastest way to find if a number is even or odd?

Why NASM on Linux changes registers in x86_64 assembly

When, if ever, is loop unrolling still useful?

Is it possible to tell the branch predictor how likely it is to follow the branch?

Is reading the `length` property of an array really that expensive an operation in JavaScript?

what is faster: in_array or isset? [closed]

Cost of exception handlers in Python

What C/C++ compiler can use push pop instructions for creating local variables, instead of just increasing esp once?

What methods can be used to efficiently extend instruction length on modern x86?

Branch alignment for loops involving micro-coded instructions on Intel SnB-family CPUs

What does `rep ret` mean?

Is there a penalty when base+offset is in a different page than the base?

Why does breaking the "output dependency" of LZCNT matter?

Should I use Java's String.format() if performance is important?

The advantages of using 32bit registers/instructions in x86-64

Divide by 10 using bit shifts?

Is performance reduced when executing loops whose uop count is not a multiple of processor width?

Floating point division vs floating point multiplication

Which is better option to use for dividing an integer number by 2?

INC instruction vs ADD 1: Does it matter?