Newbetuts
.
New posts in micro-optimization
How much faster are SSE4.2 string instructions than SSE2 for memcmp?
assembly
x86
sse
micro-optimization
sse4
How to force NASM to encode [1 + rax*2] as disp32 + index*2 instead of disp8 + base + index?
assembly
x86
nasm
micro-optimization
machine-code
Do java finals help the compiler create more efficient bytecode? [duplicate]
java
optimization
micro-optimization
' ... != null' or 'null != ....' best performance?
java
performance
micro-optimization
"enter" vs "push ebp; mov ebp, esp; sub esp, imm" and "leave" vs "mov esp, ebp; pop ebp"
assembly
x86
stack
micro-optimization
stack-frame
Cycles/cost for L1 Cache hit vs. Register on x86?
performance
x86
cpu-architecture
cpu-cache
micro-optimization
Is it useful to use VZEROUPPER if your program+libraries contain no SSE instructions?
performance
assembly
x86
avx
micro-optimization
Is vxorps-zeroing on AMD Jaguar/Bulldozer/Zen faster with xmm registers than ymm?
assembly
x86
avx
micro-optimization
amd-processor
Does using xor reg, reg give advantage over mov reg, 0? [duplicate]
assembly
x86
micro-optimization
Using Intrinsics to Extract And Shift Odd/Even Bits
c++
bit-manipulation
intrinsics
micro-optimization
Weird use of `?:` in `typeid` code
c++
conditional-operator
micro-optimization
typeid
How to force GCC to assume that a floating-point expression is non-negative?
c++
gcc
assembly
floating-point
micro-optimization
Micro Optimization of a 4-bucket histogram of a large array or list
c#
optimization
histogram
simd
micro-optimization
Which of these pieces of code is faster in Java?
java
performance
micro-optimization
Fast method to copy memory with translation - ARGB to BGR
c
x86
rgb
sse
micro-optimization
Weird performance effects from nearby dependent stores in a pointer-chasing loop on IvyBridge. Adding an extra load speeds it up?
assembly
x86
micro-optimization
microbenchmark
micro-architecture
Is it more efficient to perform a range check by casting to uint instead of checking for negative values?
c#
performance
micro-optimization
numeric-conversion
range-checking
Fastest way to strip all non-printable characters from a Java String
java
string
performance
optimization
micro-optimization
Why can't GCC generate an optimal operator== for a struct of two int32s?
c++
gcc
x86-64
compiler-optimization
micro-optimization
Why does my application spend 24% of its life doing a null check?
c#
performance
optimization
il
micro-optimization
Prev
Next