¡@

Home 

c++ Programming Glossary: addsd

how to achieve 4 FLOPs per cycle

http://stackoverflow.com/questions/8389648/how-to-achieve-4-flops-per-cycle

Since that is true for packed addpd as well as the scalar addsd versions and sse registers can contain 2 double 's the throughput.. xmm7 xmm3 mulsd xmm6 xmm3 mulsd xmm5 xmm3 mulsd xmm1 xmm3 addsd xmm13 xmm2 addsd xmm12 xmm2 addsd xmm11 xmm2 addsd xmm10 xmm2.. xmm6 xmm3 mulsd xmm5 xmm3 mulsd xmm1 xmm3 addsd xmm13 xmm2 addsd xmm12 xmm2 addsd xmm11 xmm2 addsd xmm10 xmm2 addsd xmm9 xmm2..

Why is one loop so much slower than two loops?

http://stackoverflow.com/questions/8547778/why-is-one-loop-so-much-slower-than-two-loops

times in the full program movsd xmm0 mmword ptr edx 18h addsd xmm0 mmword ptr ecx 20h movsd mmword ptr ecx 20h xmm0 movsd.. mmword ptr ecx 20h xmm0 movsd xmm0 mmword ptr esi 10h addsd xmm0 mmword ptr eax 30h movsd mmword ptr eax 30h xmm0 movsd.. mmword ptr eax 30h xmm0 movsd xmm0 mmword ptr edx 20h addsd xmm0 mmword ptr ecx 28h movsd mmword ptr ecx 28h xmm0 movsd..