I'm translating some code I wrote 3 years ago, scratching my head
because it is my first week in SSE.
RGB to YUV conversion that and now needs to convert it to SSE code
// Y = ( 8414 R + 16519 G + 3208 B)/32768 + 16
// Cb = (-4857 R - 9535 G + 14392 B)/32768 + 128
// Cr = (14392 R - 12052 G - 2341 B)/32768 + 128
// Convert the first three input vectors. Note that
// only the top 16 bits of the 32 bit product are
// stored. This is the same as doing the divide by 32768.
I was wondering if vec_mradds can be translated to the SSE3
_mm_maddubs_epi16
I SSE3 is core 2 duo and XEON 5100 which is fine for me.
or has anyone suggestions how I can translate above code?
regards,
marc
_______________________________________________
Do not post admin requests to the list. They will be ignored.
PerfOptimization-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden