AFP548

OD Crash…

Hi We have a problem... two machines display the same behaviour. Both machines have been stripped and rebuilt from scratch... but still a crash occurs... BEfore and after the alterations made below... Xenon Quad core 10.5.6... What could it be ??? *Environment:* OpenDirectory Master that has had it's LDAP directory imported from a previous Tiger server instance. PDC for windows network. WINS server. AFP home folder for macs. SMB shares for PC's and macs. NFS shares for Linux servers and workstations. HFS+ file systems on fibre Unity RAID array. * Factors leading up to a crash:* Increased load on AFP and SMB shares, usually reaching their peak of ~40+ AFP connections and ~90+ SMB connections. We have witnessed this crash during periods of low load also. *Results of the crash:* DirectoryServices daemon running continuously at 100+%. Slow to ID a user, directory listing are slow as a result. Slow to login via SSH. Samba fails to allow current new connections to work. CPU lights on the xserve are showing high levels of activity. Server can be come progressively worse until it is almost unresponsive. *Steps to recover:* Reload DirectoryServices using these commands: launchctl unload /System/Library/LaunchDaemons/com.apple.DirectoryServices.plist launchctl load /System/Library/LaunchDaemons/com.apple.DirectoryServices.plist Sometimes samba needs restarting after running the above commands in order to get access working properly again. Sometimes kerberos authentication fails for specific external servers requiring us to reboot OS X server to fix this. A hard reboot is sometimes required to get other services working properly again. *Steps taken to prevent the crash:* In an attempt to fix the vm growth error (see syslog) we made the following changes: Edited /etc/sysctl.conf and added the following lines kern.maxproc=2128 kern.maxprocperuid=400 Also this change: echo "limit maxproc 1500 2500" | sudo tee -a /etc/launchd.conf *Frequency of occurrence:* Usualy daily, resulting in up to ten minutes loss of access for 40 pc's and 100+ proxy users sys log... xserve DirectoryService[28]: Potential VM growth in DirectoryService since client PID: 0, has 800 open references when the warning limit is 500. Mar 26 12:51:41 xserve DirectoryService[28]: Potential VM growth in DirectoryService since client PID: 0, has 775 open references when the warning limit is 500. Mar 26 12:51:42: --- last message repeated 1 time --- Mar 26 12:51:42 xserve DirectoryService[28]: Potential VM growth in DirectoryService since client PID: 0, has 800 open references when the warning limit is 500. Mar 26 12:51:53 xserve DirectoryService[28]: Potential VM growth in DirectoryService since client PID: 0, has 775 open references when the warning limit is 500. Mar 26 12:52:06: --- last message repeated 2 times --- Mar 26 12:52:06 xserve DirectoryService[28]: Potential VM growth in DirectoryService since client PID: 0, has 800 open references when the warning limit is 500. Crash Log Exception Type: EXC_BAD_ACCESS (SIGBUS) Exception Codes: KERN_PROTECTION_FAILURE at 0x0000000000000030 Crashed Thread: 6 Thread 0: 0 libSystem.B.dylib 0x90bc11da write$NOCANCEL$UNIX2003 + 10 1 libSystem.B.dylib 0x90bc105f __sflush + 79 2 libSystem.B.dylib 0x90bca511 fflush + 106 3 ...ectoryServiceCore.Framework 0x00162998 CFile::write(void const*, int) + 284 4 ...ectoryServiceCore.Framework 0x00164797 CFile::write(char const*, int) + 31 5 ...ectoryServiceCore.Framework 0x00163a13 CLog::Append(CString const&) + 299 6 ...ectoryServiceCore.Framework 0x00163ba6 SrvrLog + 98 7 DirectoryService 0x000171fd main + 2821 8 DirectoryService 0x000166da start + 54 Thread 1:
Exit mobile version