python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
How does gpu acceleration work for instructions mot in GPU
I’ve heard that gpus can only compute simple instructions but do them in a parallel matter, which makes them suited for machine learning.
What happens if a pytorch tensor is in gpu but the type of com...
Danny Han
Votes: 0
Answers: 0
Are there bit (un)packing instructions in common x86_64 processor line mmx command for bytes?
dead hachers!
Background:
Maybe you know mmx packing/unpacking instruction set, what howewer works on bytes and larger volumes, like shorts, words, etc.
Quesion:
Are there any instructions for any x8...

Peter Tschischik
Votes: 0
Answers: 0
ARM Link register - non-leaf subroutine
I am wondering about, where the Link register is used in ARM CPU. As I understand it is storing return address of functions. But does every return address go to this register after function call or it...
Bratw
Votes: 0
Answers: 2
Organizing multiple implementations (for SIMD)
This is admittedly an open-ended/subjective question but I am looking for different ideas on how to "organize" multiple alternative implementations of the same functions.
I have a set of sev...

Matthew M.
Votes: 0
Answers: 1