| |||
| [Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] |
I am new to the mac world, just started using a 2 ghz G5 cluster days ago.
I am doing something more than memory copy but I always start it with
understanding memory copy issues on the target architecture. I tried STREAM
benchmark on the G5 which gives 2GB/s copy bandwidth, while on an Intel
Xeon 2.4GHZ I got 4.5GB/S with SSE2 using non-temporal writes. G5 has a
theoretical memory bandwidth of 6.4 GB/s which is much higher than
2GB/s I got and also higher than the theoretical memory bandwidth of
that intel Xeon. I later tried to write my own copy routine using
Altivec, the performace was only improved a little. I tried memcpy and memmove, the result
is 3.1 GB/s, which is better but still less than half of 6.4GB/s.
Anyone has some idea about this? Is 3.1 the best I can get. I am reading
bcopy.s in darwinsource/10.4.3/xnu-792.6.22/osfmk/ppc/. Is it the right file to look
at?
Thanks!
_______________________________________________ Do not post admin requests to the list. They will be ignored. PerfOptimization-dev mailing list (email@hidden) Help/Unsubscribe/Update your Subscription: http://lists.apple.com/mailman/options/perfoptimization-dev/email@hidden This email sent to email@hidden
| Home | Archives | FAQ | Terms/Conditions | Contact | RSS | Lists | About |
Visit the Apple Store online or at retail locations.
1-800-MY-APPLE
Contact Apple | Terms of Use | Privacy Policy
Copyright © 2007 Apple Inc. All rights reserved.