Home › Forums › OS X Server and Client Discussion › Questions and Answers › netboot problem
- This topic has 13 replies, 10 voices, and was last updated 17 years, 6 months ago by
hdjong.
-
AuthorPosts
-
May 20, 2007 at 12:53 pm #369086
coolpor2
ParticipantHi All
I’m hoping someone can help with a netboot related issue. i’m trying to
deploy an osx 10.4.9 image to a number of imac G5’s. the netboot service is running on OSX server 10.4.9 on an xserve G5. All services are running as expected (afp, nfs, OD). Both a netinstall and netboot image were created from .dmg file. system image utility carried out image creation without error and the netboot service is up and runing.On the client side. the globe icon appears for about 20 seconds, then the spinning globe w/ apple icon appears for a few seconds before the spinning progress indicator appears and then nothing happens – it doesn’t advance beyond this stage.
I’ve done the basic checks – the target machines can get ip’s from our dhcp server. When logged in locally to the clients the network images are available as options in the startup disk pane. The netboot server logs don’t appear to indicate anything unusual (the clients and server exchange bootpd (bpsd) communication). What is strange is that when the clients are set to boot from the network image the system doesn’t allow me to enter verbose mode on startup – so i’m unable to see what problems the client is having. I’m guessing this is unusual.
Any ideas ?
May 21, 2007 at 1:52 pm #369092coolpor2
ParticipantHi
Thanks for the reply. I can connect to the server via tftp from the terminal on one of the clients. I can also
mount the NetBootSP0 share via NFS so no problems there.I’m still having difficulty getting verbose output for the NB startup. I’ve tried pressing CMD V and N a the same time but
all I get is the verbose ouput for the normal boot process, and this doesn’t really tell me anything.May 22, 2007 at 4:23 am #369100ingenious7
ParticipantHi,
Did you create the image and netinstall set on an G5 iMac? This can cause problems if you have created the image on say a G4.
I remember having a similar problem when we upgraded to new iMac G5s.
May 22, 2007 at 9:20 am #369101coolpor2
ParticipantMactroll;
I’ve managed to get verbose output from the NB startup on the client. It’s gets an IP from our dhcp server, locates the path of the image on the server
but then spits out RPC timeout error. On Bombich.com he says that this is likely a NFS issue, but I’ve managed to run the mnt commands he suggest
to check nfs connectivity when booted locally on the client machines. aaargh!!! this problem is driving me round th bend.Also, to the last poster I have created the image from an imac G5 running 10.4.9
May 22, 2007 at 11:26 am #369104factor
Participant[QUOTE][u]Quote by: coolpor2[/u][p]Mactroll;
I’ve managed to get verbose output from the NB startup on the client. It’s gets an IP from our dhcp server, locates the path of the image on the server
but then spits out RPC timeout error. On Bombich.com he says that this is likely a NFS issue, but I’ve managed to run the mnt commands he suggest
to check nfs connectivity when booted locally on the client machines. aaargh!!![/p][/QUOTE]I would guess that your mach.macosx.mkext in the i386 directory on the server (that gets sucked down by TFTP) doesn’t have all the required networking type bits in it.
That is — you are getting the kernel initial boot image via TFTP, and the kernel has started to spit some stuff out (boot loader has relinquished control, kernel has taken over), it then goes looking for the “/” root directory. The kernel cannot do the NFS connection bit though because some part of the network card driver or network filesystem driver is missing (kext cache needs to be built specially to support NFS boot — see the bottom of the Bombich article I think).
May 24, 2007 at 9:49 am #369134coolpor2
ParticipantThanks for everyones responses. It appears that the main problem was with SIU modifying the dmg disk image during the netinstall image creation process.
It managed to skip certain files….
I’m not quite sure if the RPC timeout error was related but I didn’t get that error again, but the upshot was that I had to reimage the reference machine. But this time I did a direct capture within SIU via firewire rather than my original attempt which was :Create image using diskutil then create netinstall image in SIU from image file created in diskutil.
It appears SIU is a bit hit and miss if you’re not creating a netboot/netinstall image from a FW target-disk-mode mounted volume.
It’s all very well Apple talking about wiki servers and all the other new features in leopard server, but I really hope they make significant improvements in Netboot and SIU – so that we don’t have to turn to third-party tools for deployment in preference because things that should ‘just work’ in 10.4 don’t. I think fundamentally they’re good tools – but as I say very temperamental. Apple have done a good job with the improvements in ARD 3 now let’s hope they don’t sit still with NB/SIU in leopard.
Anyway, i’ve managed to deploy the OS….eventually. Thanks for everyones input.
May 29, 2007 at 1:01 am #369158jacobpearson
ParticipantI’m having almost the exact same problem,
I had two images enabled on the server, but the the netinstall process defaulted to the older image, even though I select the newer image. It did however install perfectly. I didn’t think to try a verbose install before I disabled the older image.
With the older image disabled, the newer image would hang there with the ticker flicking over. When I viewed the verbose output, it created the RAM discs, then:
[code]netinfod local[26]: unable to bind to parent – RPC: Timed Out
lookupd [43]: NetInfo connection failed on server 127.0.0.1/local
kern.maxunodes: 2500 -> 2864[/code]later in the output it said:
[code]lookupd[96] Can’t load /system/library/PrivateFrameworks/NetInfo.framework/Resources/lookupd/Agents/DS.bundle/DS
/System/Library/Frameworks/ApplicationServices.framework/Frameworks/ATS.framework/Support/ATSServer: (-1) Cannot use ATS Persistent Store – switching to old FODB cache code
/etc/rc.cdrom.postWS: line 35: kill: (108) – No such process[/code]it also reported that the files for quartz weren’t available, then [code]Network Installer has quit due to an unexpected error. (exit code 133) Please restart your computer[/code]
Interesting that without being in verbose mode, I’d never have seen these messages, the GUI doesn’t produce any notifications at all.
Now the annoying part: I’ve opened up both the old and new images, and those files that verbose mode marked as missing or not available are indeed not there, but are missing in both the working and not working images.
And the really annoying part: coolpor2 your way of fixing the image is exactly the way that I did it in the first place. I created the system on my G5, then started it as a firewire drive and hooked it up to my xServe, and created the image directly on the xServe with SIU. So I’m now trying the way that failed for you, and if that doesn’t work – and nobody here replies with a suggestion – I’ll be using Mike’s tools.
May 29, 2007 at 11:06 am #369160coolpor2
ParticipantMy main advice is once you find a way that finally works stick with it often minor things are significant. And don’t be afraid to restart from scratch. i.e set up a new deloyment server (this would be last resort in your case as it seems it’s screwing up during actual image creation), and reinstall everything on the reference computer starting with the OS if need be.
If it helps I was running osx server 10.4.9, and the image was created from a G5 running 10.4.7. And I’m sticking with those versions because now for us everything just works.
If you’re sure that it isn’t a network issue i.e the usual suspects dhcp, dns and nfs are behaving then just be really methodical with recreating the image.
If you are going to create an install image from a dmg file make sure you check and verify permiissions on the volume…I’m surprised the direct firewire capture didn’t work the reason i avoided that method in the first place was because i was being impatient and a lazy ass.
Once i got everything working something else cropped up to put a spanner in the works. When I tried to deploy to some machines the following day it would hang at the blue screen. It turns out I needed to a quick re-erase AND re-format of the volume on each affected machine using my trusted osx statup disks – this cleared it but be prepared as this might happen to you.
HTH
June 27, 2007 at 8:25 pm #369380jdonaldson
ParticipantI was having the same problem with a client 10.4.10 image. I finally gave up and called Apple and was informed that the client image must be at 10.4.8 due to a bug introduced in 10.4.9. Once the image is created, I was told I can use the System Image Utility to apply the combo update to the image, but the image must be created with a 10.4.8 client! Was told that the problem is in engineering at the moment and the tech didn’t know if it would be fixed in at 10.4.x release or 10.5.x.
I am now rebuilding my image using 10.4.8 operating system.
Joel
July 18, 2007 at 4:16 pm #369553RobertHammen
MemberJust ran into this issue this morning. Found an apparent solution on the Apple Discussion forums. I say “apparent” because my NetBoot no longer errors out immediately, and it’s imaging a test Intel machine right now, so I don’t know for sure if it works until it’s done. It does appear to be working, though…
Mount a Universal 10.4.9 or 10.4.10 drive or machine (i.e. via FireWire – the drive you made the NetInstall image from).
Mount the image from your NetInstall set (i.e. /Library/NetBoot/NetBootSP0/ImageXXXXXXX.nbi/Install.dmg – it should come up as Read/Write.
Copy the /System/Library/Frameworks/CoreVideo.framework from the 10.4.9 or 10.4.10 drive to the mounted Install.dmg image (in the same location – it shouldn’t ask to replace the file, as the problem is the file’s not there/it’s missing from the image).
Now, eject the mounted Install.dmg image.
Try to NetInstall from a target machine (use Command-V verbose mode to debug). Does it work? It is apparently doing so for me…
July 20, 2007 at 1:22 pm #369580rveety
ParticipantRobert,
You rock, that worked perfectly. Thanks for the post.
September 27, 2007 at 11:02 am #370069hdjong
Participant😀 😀 😀 😀 😀 😀 😀
It worked for me too!
Unbelievable such a BUG!I have used bombich CCC and netrestore for a long time. Also do I use netinstall on my Intel XServe. The last images I made suddenly didn’t work on my XServe. I tried different thing and methods. Luckily I found the solution in this posting.
Great work!
-
AuthorPosts
- You must be logged in to reply to this topic.
Comments are closed