Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Mac OS X Server Authentication and AFP services hang



Unfortunately, my situation has degraded a bit since my original posting.

Since then, I did an upgrade installation to Tiger Server.  I was then having different hangs and spontaneous reboots.  These also had no log entries that clued me in to the actual cause of the problem.  I then did a wiped-drive install of Tiger Server and imported my user information using dsimport which made me REALLY popular with my users (  8-(   ).  The following Monday (I did the install on a weekend), the server spontaneously panicked/rebooted midday.  I talked with an Apple tech who suggested that perhaps some hardware was to blame and suggested that I remove all of my external devices.  

I had a couple SCSI tape autochangers, a FW800 drive, the USB cable to the APC UPS and the USB cable to my KVM.  All were disconnected and the thing chugged along fine until this past Saturday.  That afternoon, the server decided to go into a panic/reboot loop.  I went into the office and tried swapping in some known good memory, but the panic/reboot loop kept happening.  I called Apple and the tech suggested that I narrow down hardware vs software by installing to a FW disk and seeing if the panics persist, which I did.  The panics did not occur with the OS on the FW disk, so I booted from my internal system disk using "Safe Mode" (hold down shift key during boot) and the system stayed up for me to archive my OD information and save some Server Admin settings tear-offs.  I then re-installed (again) to the system disk after formatting the disk, updated everything back to 10.4.2, restored my OD and reconfigured everything.

I haven't had a reboot yet, but it's only been a few days.  Apple has since dispatched a new mainboard and pair of CPUs.

That's where I stand now.  From my perspective and my boss's, the situation is grim.  I've been shopping for an alternative server environment as a backup plan.  I'm pretty upset about the whole thing because Mac OS X Server (on paper anyway) supports all the client types that I've got natively and the administrator tools are really easy to use.  I just can't take the instabilities that I've been dealing with much longer.

I hope you have better luck.

On Sep 19, 2005, at 1:25 PM, Eric Redlinger wrote:


I have a similar thing happening about once a week.  In my case an entire classroom will get disconnected from the server (networked home directories) and the server machine will be inaccessible via ssh until hard-rebooted...  afterwards all fine again, with nothing in the logs to show for what the problem might have been except for entries similar to the below ones (bunches of users simultaneously "timed out").  I'm also running SMB with DNS handled by the campus...  For me it also happens under "high load", but CPU is never above 60% (for a dual 2ghz G5 xserve)

Any suggestions of other 'smoking guns' to look for?

-Eric






.:.:hunter college..FM:.ICIT.::.



On Sep 1, 2005, at 4:53 PM, Peter Schwenk wrote:

I've got a Panther (10.3.9) Server file server (Xserve G5 DP) that seems to hang up its authentication and AFP services, and I'm having a hell of a time figuring out why.  Of course, it's getting more and more frequent.  I've poked around in a bunch of log files, including system.log, AppleFileServiceAccess.log and AppleFileServiceError.log, but nothing telling jumps out at me.  There are plenty of after the fact symptoms in the logs like (from system.log):

Sep  1 10:29:17 file /usr/share/servermgrd/cgi-bin/servermgr_smb: [11564-a000b2a4] SIGALRM: process timed out
Sep  1 10:34:13 file /usr/share/servermgrd/cgi-bin/servermgr_info: [11620-a000b2a4] SIGALRM: process timed out
Sep  1 10:35:46 file /usr/share/servermgrd/cgi-bin/servermgr_print: [11633-a000b2a4] SIGALRM: process timed out
Sep  1 10:37:16 file /usr/share/servermgrd/cgi-bin/servermgr_smb: [11647-a000b2a4] SIGALRM: process timed out
Sep  1 10:39:13 file /usr/share/servermgrd/cgi-bin/servermgr_info: [11672-a000b2a4] SIGALRM: process timed out
Sep  1 10:40:46 file /usr/share/servermgrd/cgi-bin/servermgr_print: [11687-a000b2a4] SIGALRM: process timed out

and this (from AppleFileServiceAccess.log, I put the S, X, Y, Z in the IPs):

IP 128.175.S.X - - [01/Sep/2005:10:34:39 -0500] "Client no response timeout: hsiao" 0 0 0
IP 128.175.S.X - - [01/Sep/2005:10:34:39 -0500] "Saved for Reconnect User: hsiao" 1125500866 46 0
IP 128.175.S.X - - [01/Sep/2005:10:34:39 -0500] "Client no response timeout: <Guest>" 0 0 0
IP 128.175.S.X - - [01/Sep/2005:10:34:39 -0500] "Saved for Reconnect User: <Guest>" 1125500866 49 0
IP 128.175.S.Y - - [01/Sep/2005:10:34:39 -0500] "Client no response timeout: irwin" 0 0 0
IP 128.175.S.Y - - [01/Sep/2005:10:34:39 -0500] "Saved for Reconnect User: irwin" 1125500866 52 0
IP 128.175.S.Y - - [01/Sep/2005:10:34:39 -0500] "Client no response timeout: <Guest>" 0 0 0
IP 128.175.S.Y - - [01/Sep/2005:10:34:39 -0500] "Saved for Reconnect User: <Guest>" 1125500866 55 0
IP 128.175.S.Z - - [01/Sep/2005:10:34:39 -0500] "Client no response timeout: bacuta" 0 0 0
IP 128.175.S.Z - - [01/Sep/2005:10:34:39 -0500] "Saved for Reconnect User: bacuta" 1125500866 94 0

.  There's nothing the CrashReporter logs that anywhere near coincide with the times of the hangs.

Here's some more details about the server.  It's an OD master.  It fileserves AFP, NFS, and SMB.  My NFS clients mount shares statically, no automounting.  The AFP clients use automounting, and the mount records are in the OD. The home directory share (/Volumes/Homes) is shared all three ways so that all the types of clients I have can get to their home directories.  It's an SMB PDC.  DNS is handled by the campus, and it works in reverse and forward.  The server has one static IP address, and a name is associated with it.  I'm using SSL with the LDAP directory for my Linux clients with a purchased certificate.  All the users are type OD.  So far, as usual, the hangs occur during the middle of the day when usage is probably heaviest, but there's no sign of failure to keep up in the logs, and the load average is always low (well under .5).

I've called Apple support, and, at their suggestion, I've already reindexed the LDAP directory using some combinations of 'slapconfig' and 'slapindex' that I don't remember.  I've also reinitialized my PasswordServer database using some NeST commands, which made me very popular with my users.  :-(

Today, Apple support pointed me toward knowledge base article number 107899, which talks about a situation where authentication can hang due to lookupd maxing out its threads.  I've been running 'top -l0 | grep lookupd' for hours now, and the thread count for lookupd hasn't gone past 6, so I'm reluctant to believe that this is my issue.

If anyone has been in a similar boat and has some suggestions for clues to look for that would be most appreciated.  Thanks a bunch in advance for your help.


--
- Peter Schwenk
- CITA-3, Systems Administrator
- Mathematical Sciences
- University of Delaware
- 437 Ewing Hall, Newark, Delaware 19716-2553 USA
- (302) 831-0437 (v); (302) 831-4511 (f)
- schwenk _at_ math _dot_ udel _dot_ edu



_______________________________________________
Do not post admin requests to the list. They will be ignored.
Macos-x-server mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:

This email sent to email@hidden





--
- Peter Schwenk
- CITA-3, Systems Administrator
- Mathematical Sciences
- University of Delaware
- (302) 831-0437
- schwenk _at_ math _dot_ udel _dot_ edu

 _______________________________________________
Do not post admin requests to the list. They will be ignored.
Macos-x-server mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/macos-x-server/email@hidden

This email sent to email@hidden

References: 
 >Mac OS X Server Authentication and AFP services hang (From: Peter Schwenk <email@hidden>)



Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.