Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Strange perfromance results on G4 and G5 FFT




On Oct 13, 2004, at 1:25 PM, Bevis, Chris wrote:

Ian,

Thanks for the quick response.  The data on different sizes supports your hypothesis :

Size        G4        G5
128x128    0.2msec    0.2msec
256x256    1.0msec    0.4msec
512x512    13.4msec    16.6msec
1024x1024    106msec    64msec

So it looks like there is an threshold value of 256x256 beyond which both G4 and G5 overflow the cache and that except for the 512x512 case, the G5 is faster.  The exception is presumably because the number of cache misses is greater on the G5 than G4.

Given that our source images are integers, are there altivec libraries for 8 or 16 bit integer fft's ?  Any other suggestions for improving performance on larger 2d data sets ?

Looking at the FFTW benchmark page  (http://www.fftw.org/speed/g5-2GHz/) it seems likely that FFTW would be a good thing to look at next. 


I'm not quite sure where vDSP will be next year on the issue of large FFT speed. We have plans to look at it but it is not clear when or if any performance improvements will be delivered. 

Ian
_______________________________________________
Do not post admin requests to the list. They will be ignored.
PerfOptimization-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/perfoptimization-dev/email@hidden

This email sent to email@hidden
References: 
 >RE: Strange perfromance results on G4 and G5 FFT (From: "Bevis, Chris" <email@hidden>)



Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.