Newbetuts
.
New posts in sse
How to constexpr initialize intrinsic SSE/AVX register?
c++
sse
constexpr
intrinsics
avx
Is there a way to cast integers to bytes, knowing these ints are in range of bytes. Using SSE?
assembly
x86-64
masm
sse
sse4
Compare 16 byte strings with SSE
c
gcc
x86
sse
simd
AVX scalar operations are much faster
c
memory
x86
sse
avx
Why does adding an xorps instruction make this function using cvtsi2ss and addss ~5x faster?
clang
x86-64
cpu-architecture
sse
microbenchmark
How much faster are SSE4.2 string instructions than SSE2 for memcmp?
assembly
x86
sse
micro-optimization
sse4
Which versions of Windows support/require which CPU multimedia extensions? (How to check if SSE or AVX are fully usable?)
windows
assembly
sse
avx
avx512
Get sum of values stored in __m256d with SSE/AVX
c++
optimization
sse
avx
avx2
SSE register return with SSE disabled
c
gcc
floating-point
sse
Load address calculation when using AVX2 gather instructions
x86
sse
simd
avx2
Getting started with Intel x86 SSE SIMD instructions
c
gcc
x86
sse
simd
sse2 instruction set not enabled
g++
sse
SSE intrinsic functions reference
c++
c
gcc
sse
simd
How are denormalized floats handled in C#?
c#
.net
performance
intel
sse
In GNU C inline asm, what are the size-override modifiers for xmm/ymm/zmm for a single operand?
c
gcc
sse
inline-assembly
avx512
Getting max value in a __m128i vector with SSE?
c
assembly
x86
sse
AVX/SSE version of xorshift128+
c
performance
sse
avx
SSE SSE2 and SSE3 for GNU C++
c++
optimization
simd
sse
sse2
Find largest element in matrix and its column and row indexes using SSE and AVX
c++
matrix
sse
avx
avx2
cpu dispatcher for visual studio for AVX and SSE
c++
visual-studio
sse
avx
Prev
Next