From owner-freebsd-stable@FreeBSD.ORG Fri May 20 20:30:21 2005 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0F53616A4CE for ; Fri, 20 May 2005 20:30:21 +0000 (GMT) Received: from carver.gumbysoft.com (carver.gumbysoft.com [66.220.23.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id B9D1C43D95 for ; Fri, 20 May 2005 20:30:20 +0000 (GMT) (envelope-from dwhite@gumbysoft.com) Received: by carver.gumbysoft.com (Postfix, from userid 1000) id AD94472DD4; Fri, 20 May 2005 13:30:20 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by carver.gumbysoft.com (Postfix) with ESMTP id A804372DCB; Fri, 20 May 2005 13:30:20 -0700 (PDT) Date: Fri, 20 May 2005 13:30:20 -0700 (PDT) From: Doug White To: =?ISO-8859-2?Q?S=B3awek_=AFak?= In-Reply-To: <787bbe1c05051903346376988b@mail.gmail.com> Message-ID: <20050520132841.I8229@carver.gumbysoft.com> References: <787bbe1c05051903346376988b@mail.gmail.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=X-UNKNOWN Content-Transfer-Encoding: QUOTED-PRINTABLE cc: freebsd-stable@freebsd.org Subject: Re: Diskless boot problem X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 May 2005 20:30:21 -0000 Please try to avoid sending your message bodies base64 encoded :) My mail client got really confused by it and didn't quote the message properly. Also stripping hackers cc:. On Thu, 19 May 2005, [ISO-8859-2] S=B3awek =AFak wrote: > Hi, > I have a problem with booting Dell 2850 over network. The machine reads > kernel over net, boots upto mounting / from NFS and then crashes. What is the NFS server? It seems to think the NFS handle we pulled the kernel with is no longer valid. Does PXE and the system itself end up pulling different IP addresses? (rest of original email follows) Tcpdump output: 12:15:58.919683 arp who-has 10.158.190.73 tell 10.158.190.74 12:15:58.919702 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1 12:15:58.920058 IP 10.158.190.74.475209176 > 10.158.190.73.2049: 92 getattr [|nfs] 12:15:58.920134 IP 10.158.190.73.2049 > 10.158.190.74.475209176: reply ok 28 getattr ERROR: Stale NFS file handle 12:15:58.920432 arp who-has 10.158.190.73 tell 10.158.190.74 12:15:58.920442 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1 12:15:58.920681 IP 10.158.190.74.475209177 > 10.158.190.73.2049: 100 lookup [|nfs] 12:15:58.920707 IP 10.158.190.73.2049 > 10.158.190.74.475209177: reply ok 28 lookup ERROR: Stale NFS file handle 12:15:58.920932 IP 10.158.190.74.475209178 > 10.158.190.73.2049: 100 lookup [|nfs] 12:15:58.920963 IP 10.158.190.73.2049 > 10.158.190.74.475209178: reply ok 28 lookup ERROR: Stale NFS file handle 12:15:58.952180 IP 10.158.190.74.475209179 > 10.158.190.73.2049: 100 lookup [|nfs] 12:15:58.952277 IP 10.158.190.73.2049 > 10.158.190.74.475209179: reply ok 28 lookup ERROR: Stale NFS file handle 12:15:58.984785 IP 10.158.190.74.475209180 > 10.158.190.73.2049: 100 lookup [|nfs] 12:15:58.984866 IP 10.158.190.73.2049 > 10.158.190.74.475209180: reply ok 28 lookup ERROR: Stale NFS file handle 12:15:59.020500 IP 10.158.190.74.475209181 > 10.158.190.73.2049: 104 lookup [|nfs] 12:15:59.020573 IP 10.158.190.73.2049 > 10.158.190.74.475209181: reply ok 28 lookup ERROR: Stale NFS file handle 12:15:59.054130 IP 10.158.190.74.475209182 > 10.158.190.73.2049: 104 lookup [|nfs] 12:15:59.054224 IP 10.158.190.73.2049 > 10.158.190.74.475209182: reply ok 28 lookup ERROR: Stale NFS file handle I wonder where the `Stale NFS handle' error comes from, as the client doesn't seem to have mounted the filesystem over NFS from what I can see. On the console of the diskless I have this: NFS ROOT: 10.158.190.73:/var/www/FreeBSD-5.4-x86-PXE em0: Link is up 100 Mbps Half Duplex exec /sbin/init: error 70 exec /sbin/oinit: error 70 exec /sbin/init.bak: error 70 exec /rescue/init: error 70 exec /stand/sysinstall: error 70 init: not found in path /sbin/init:/sbin/oinit:/sbin/init.bak:/rescue/init:/stand/sysinstall panic: no init Uptime: 55s Cannot dump. No dump device defined. Automatic reboot in 15 seconds - press a key on the console to abort The speed for em0 is obviously wrong. Setting on the switch is 100 full-duplex. Our network wizards can f***kup autonegotiation on Cisco Catalyst, so it must stay that way. Intel em-s tend to hang for a couple of seconds before getting on the net so it might be the problem. On the other hand kernel loads just fine over TFTP. Any thoughts? Thanks, /S --=20 S=B3awek =AFak / UNIX Systems Administrator --=20 Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org