Has something changed? I recently profiled some G3 code, and NOW...
Shark 4.3.2 reports that __fres() has a 14-cycle latency and/or is
not pipelined! Quite obviously the code generated by GCC 3.3 thinks
this instruction has a 5-cycle latency, the same as __frsqrtes(),
__fnmsubs(), etc. Can I get some clarification regarding ppc,
ppc7400, ppc970 behavior of this intrinsic? I understand its
accuracy, but now certain G3 versions of my vec functions appear to
have huge bubbles. :\
--
Shaun Wexler
MacFOH http://www.macfoh.com
My software never has bugs. It just develops random features.