python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
AVX512: How to convert first 8 bytes into 8 64-bit integers?
I have an __m512i inputVector where each of the 64 bytes contains some offset. Next I need to add the first 8 byte offsets to 8 64-bit values stored in another __m512i variable (base). (In order to pr...
Linoliumz
Votes: 0
Answers: 2
Does anyone know of a fix for an MSVC compiler bug/annoyance where SIMD Extension settings get "stuck" on AVX?
Does anyone know of a fix for an MSVC compiler bug/annoyance where SIMD Extension settings get "stuck" on AVX?
The context of this question is coding up SIMD CPU dispatchers, closely followi...
dts
Votes: 0
Answers: 2
Efficient code to load AVX vectors for 1D convolution kernel of length 8
An implementation of a 1D convolution operation will often need to load a vectors of data that sequentially step through a buffer of data offset by one element each iteration.
For example, consider a ...
TrentP
Votes: 0
Answers: 2
Generate random numbers in a given range with AVX2, faster than SVML _mm256_rem_epu32 remainder?
I'm currently trying to implement an XOR_SHIFT Random Number Generator using AVX2, it's actually quite easy and very fast. However I need to be able to specify a range. This usually requires modulo.
T...
dave_thenerd
Votes: 0
Answers: 1