¡@

Home 

c++ Programming Glossary: xmm1

Compilation fails with OpenMP on Mac OS X Lion (memcpy and SSE intrinsics)

http://stackoverflow.com/questions/12932107/compilation-fails-with-openmp-on-mac-os-x-lion-memcpy-and-sse-intrinsics

48 rbp xmm0 movl 1 eax movaps xmm0 80 rbp movaps 80 rbp xmm1 movl eax edi callq ___builtin_ia32_shufpd movapd xmm0 32 rbp.. one integer in eax and two XMM arguments in xmm0 and xmm1 with the result being returned in xmm0 as per the SysV AMD64..

Heisenbug: WinApi program crashes on some computers

http://stackoverflow.com/questions/132116/heisenbug-winapi-program-crashes-on-some-computers

Illegal Instruction. and code breaks on 0044A26A cvtsi2sd xmm1 dword ptr esp 14h So it seems that the problem was in the Code..

SSE2 Compiler Error

http://stackoverflow.com/questions/1874882/sse2-compiler-error

_asm mov eax 0 label movdqa xmm0 xmmword ptr t1 eax movdqa xmm1 xmmword ptr t2 eax pmuludq xmm0 xmm1 movdqa mul1 xmm0 movdqa.. ptr t1 eax movdqa xmm1 xmmword ptr t2 eax pmuludq xmm0 xmm1 movdqa mul1 xmm0 movdqa xmm0 xmmword ptr t1 eax pshufd xmm0.. xmm0 xmmword ptr t1 eax pshufd xmm0 xmm0 05fh pshufd xmm1 xmm1 05fh muludq xmm0 xmm1 movdqa mul2 xmm0 add eax 16 cmp eax..

Fast multiplication/division by 2 for floats and doubles (C/C++)

http://stackoverflow.com/questions/7720668/fast-multiplication-division-by-2-for-floats-and-doubles-c-c

of movupd xmm0 xmmword ptr ecx lea ecx ecx 10h mulpd xmm0 xmm1 movupd xmmword ptr ecx 10h xmm0 Versus 2.4 seconds otherwise.. still worse with 3.0 seconds with an inner loop of movsd xmm1 mmword ptr esp eax 8 38h mulsd xmm1 xmm0 movsd mmword ptr esp.. an inner loop of movsd xmm1 mmword ptr esp eax 8 38h mulsd xmm1 xmm0 movsd mmword ptr esp eax 8 38h xmm1 inc eax VC10 without..

how to achieve 4 FLOPs per cycle

http://stackoverflow.com/questions/8389648/how-to-achieve-4-flops-per-cycle

xmm3 mulsd xmm7 xmm3 mulsd xmm6 xmm3 mulsd xmm5 xmm3 mulsd xmm1 xmm3 addsd xmm13 xmm2 addsd xmm12 xmm2 addsd xmm11 xmm2 addsd.. xmm3 mulsd xmm6 xmm3 mulsd xmm5 xmm3 mulsd xmm1 xmm3 addsd xmm13 xmm2 addsd xmm12 xmm2 addsd xmm11 xmm2 addsd xmm10 xmm2 addsd.. mulsd xmm5 xmm3 mulsd xmm1 xmm3 addsd xmm13 xmm2 addsd xmm12 xmm2 addsd xmm11 xmm2 addsd xmm10 xmm2 addsd xmm9 xmm2 cmp..