Well, I rechecked the code several times (and took a walk several
times too :))
Originaly, i was using vecLib's "dotpr" function as a reference
point, and had a bit-to-bit match with dotpr's results.
Now, as i de-serialized the Sum, i am getting different results
Here is a test output showing the mismatches then relative error is
larger then 1%
You should probably use a double precision dot product as your
reference for correctness.