Thanks, Terry. That's all very good advice and I'll take it into
consideration.
So now my code knows how to find out that the system it is running on
has N > 1 processors. Say, for example, it is running on a system
with N = 2 processors so it wants to use a two-thread version of some
heavily computational routine. This is only going to be efficient if
the two threads actually use different physical processes.
So here a few interesting questions arise:
1. Is there any way to specify that threads run on physically
different processors/cores? I know the OS is supposed to be smart
but maybe not that smart because the thread-creating code isn't aware
of the context of the program being computed.
2. Particularly if the answer to the above question is YES, is it
guaranteed that a thread will spend its entire life within the
physical processor it first started on? I am not terribly familiar
with the low-level stuff but, when a thread has exhausted its time
slice, could it be scheduled on a different physical processor the
next time around?
3. Depending on what the answers of 1. and 2. are, it seems to be it
would be useful to have a mechanism for "locking" a thread to a
processor? Sort of to make the maximum use of the available
hardware? Particularly in the case when a process wants to start M
threads where M <= N available processors.
I don't think my specific situations are very complicated but am
trying to use the opportunity to teach myself a programming style
which is certainly very new to me. My situations involve mostly high-
complexity matrix and graph computations that can be partitioned and
parallelized easily. Nothing extra fancy (like data-races, and the
necessity for communication, locking, etc.). It's pretty safe to say
that the N parallel threads can proceed full speed ahead -- depending
on what hardware is available -- I basically need a lot of
computational cycles available.
Finally, I am particularly interested in knowing the answers for Mac
OS X, but it would be lovely to know what the POSIX ways (if any) are
of doing these things.
Thanks for any pointers.
Cheers,
--
ivan
On Dec 19, 2005, at 10:10 PM, Terry Lambert wrote:
On Dec 18, 2005, at 9:58 PM, Eric Albert wrote:
On Dec 18, 2005, at 5:45 PM, Ivan S. Kourtev wrote:
First, I tried sysconf() but it doesn't seem capable of doing
what I need under Mac OS? The two variables
_SC_NPROCESSORS_CONF and _SC_NPROCESSORS_ONLN are undefined in
unistd.h -- am I missing something?
They don't seem to be defined on Mac OS X. I'd suggest filing a
bug report with Apple (<http://bugreport.apple.com>) if you'd like
to see them added.
They are non-standard extensions tot he sysconf namespace. They
are unlikely to be included even if a bug report is filed, since
that particular namespace belongs to the standards committee; if
they come up with the same name that meant something else, we
wouldn't be able to implement it correctly because of binary
backward compatibility issues, so it's better if we don't add it.
The reason it's in the man page is that our man page is cribbed
from FreeBSD, and FreeBSD implements them. Our manual page there
is fairly out of data, but man page fixes are unlikely to make it
into a software update, for various reasons. The authoritative
reference is the contents of unistd.h.
The only ones you can actually use portably between platforms are
the ones defined by POSIX (assuming your other platforms are POSIX
compliant).
I also looked into sysctl as per Daniel's and Eric's suggestions
-- I noticed even the sysconf manpage suggests that the sysctl
interface is much richer. On Mac OS X, I got some code working
right away (attached at end) but I haven't figured out how to get
it to go under redhat (everything I do must work under both Mac
OS X and redhat at least). redhat has a sys/sysctl.h but it only
contains the declaration of sysctl() and none of the keywords. I
realize this may be a little off-topic, but any clues?
This sounds like a great job for a configure script. This is
hardly the only difference between Mac OS X and Linux. :) Another
alternative is to do something like
#ifdef HW_NCPU
...do the sysctl thing...
#elif defined(_SC_NPROCESSORS_CONF)
...do the sysconf thing...
#else
#error Uh oh.
#endif
sysctl is the way it should be done. As otherwise noted in this
thread already, these particular sysctl entries are generally
portable between 4.4BSD based systems.
By the way, in the code below, what are the HW_ keywords (if any)
that correspond to the commented out entries in the data[]
array? I really only need HW_NCPU and HW_AVAILCPU for now but
just out of curiosity?
Not all sysctl entries have numeric items to go along with their
names. Sometimes you just have to use sysctlbyname.
And in fact you should use names everywhere you can, rather than
OIDs, for forward code compatibility. We are likely to change
things in the future, particularly in this area of sysctl, and
sysctlbyname will be less fragile. I expect the current values
won't change (i.e. suddenly stop working for already compiled
code), but the sysctlbyname() is the preferred interface going
forward.
-- Terry
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Unix-porting mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/unix-porting/email@hidden