Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 12 Jul 2013 09:01:12 +0300
From:      Konstantin Belousov <kostikbel@gmail.com>
To:        Andre Albsmeier <Andre.Albsmeier@siemens.com>
Cc:        "freebsd-stable@freebsd.org" <freebsd-stable@freebsd.org>, John Baldwin <jhb@freebsd.org>
Subject:   Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found
Message-ID:  <20130712060112.GY91021@kib.kiev.ua>
In-Reply-To: <20130712052440.GA97779@bali>
References:  <201305311051.03157.jhb@freebsd.org> <20130616063942.GA72803@bali> <201306171530.31208.jhb@freebsd.org> <20130704051409.GA22021@bali> <20130704052440.GG91021@kib.kiev.ua> <20130704052659.GA23398@bali> <20130704061550.GI91021@kib.kiev.ua> <20130704142919.GA1798@bali> <20130704172528.GL91021@kib.kiev.ua> <20130712052440.GA97779@bali>

next in thread | previous in thread | raw e-mail | index | archive | help

--YACgKm7aulUA0QKG
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Fri, Jul 12, 2013 at 07:24:40AM +0200, Andre Albsmeier wrote:
> On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote:
> > On Thu, Jul 04, 2013 at 04:29:19PM +0200, Andre Albsmeier wrote:
> > > OK, patch is applied. I will reboot the machine later
> > > and see what happens tomorrow in the morning. However,
> > > it might take a few days since the last 2 weeks all was
> > > fine.
> > >=20
> > > BTW, should this patch be used in general or is it just
> > > for debugging? My understanding is that it is something
> > > which could stay in the code...
> >=20
> > Patch is to improve debugging.
> >=20
> > I probably commit it after the issue is closed.  Arguments against
> > the commit is that the change imposes small performance penalty
> > due to save and restore of the %ebp (I doubt that this is measureable
> > by any means).  Also, arguably, such change should be done for all
> > functions in support.s, but bcopy() is the hot spot.
>=20
> Got a new one, 2 hours old ;-)
>=20
> GNU gdb 6.1.1 [FreeBSD]
> Copyright 2004 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you =
are
> welcome to change it and/or distribute copies of it under certain conditi=
ons.
> Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB.  Type "show warranty" for detail=
s.
> This GDB was configured as "i386-marcel-freebsd"...
>=20
> Unread portion of the kernel message buffer:
>=20
>=20
> Fatal trap 12: page fault while in kernel mode
> fault virtual address   =3D 0xcd5ec000
> fault code              =3D supervisor write, page not present
> instruction pointer     =3D 0x20:0xc07cb2fe
> stack pointer           =3D 0x28:0xd82e45cc
> frame pointer           =3D 0x28:0xd82e45d4
> code segment            =3D base 0x0, limit 0xfffff, type 0x1b
>                         =3D DPL 0, pres 1, def32 1, gran 1
> processor eflags        =3D interrupt enabled, resume, IOPL =3D 0
> current process         =3D 18714 (mksnap_ffs)
> trap number             =3D 12
> panic: page fault
> KDB: stack backtrace:
> db_trace_self_wrapper(c08207eb,d82e4418,c05fdfc9,c081df13,c08a82e0,...) a=
t db_trace_self_wrapper+0x26/frame 0xd82e43e8
> kdb_backtrace(c081df13,c08a82e0,c0801bfa,d82e4424,d82e4424,...) at kdb_ba=
cktrace+0x29/frame 0xd82e43f4
> panic(c0801bfa,c0845a01,c2b067d4,1,1,...) at panic+0xc9/frame 0xd82e4418
> trap_fatal(c0ff6000,cd5ec000,2,0,c08b6bf4,...) at trap_fatal+0x353/frame =
0xd82e4458
> trap_pfault(baa8454b,21510,0,c2b06620,c08b6bf0,...) at trap_pfault+0x2d7/=
frame 0xd82e44a0
> trap(d82e458c) at trap+0x41a/frame 0xd82e4580
> calltrap() at calltrap+0x6/frame 0xd82e4580
> --- trap 0xc, eip =3D 0xc07cb2fe, esp =3D 0xd82e45cc, ebp =3D 0xd82e45d4 =
---
> bcopy(c36ed000,cd5e6000,8000,8000,c281b980,...) at bcopy+0x1a/frame 0xd82=
e45d4
> ffs_snapshot(c2b35a90,c2ed0400,0,0,0,...) at ffs_snapshot+0x2933/frame 0x=
d82e490c
> ffs_mount(c2b35a90,c322e200,ff,d82e4c08,c2ccbc8c,...) at ffs_mount+0x15ee=
/frame 0xd82e4a3c
> vfs_donmount(c2b06620,10313108,0,c2b74d80,c2b74d80,...) at vfs_donmount+0=
x196b/frame 0xd82e4c2c
> sys_nmount(c2b06620,d82e4ccc,c2b06908,d82e4c6c,c0605015,...) at sys_nmoun=
t+0x63/frame 0xd82e4c50
> syscall(d82e4d08) at syscall+0x2ce/frame 0xd82e4cfc
> Xint0x80_syscall() at Xint0x80_syscall+0x21/frame 0xd82e4cfc
> --- syscall (378, FreeBSD ELF32, sys_nmount), eip =3D 0x180bdf37, esp =3D=
 0xbfbfd65c, ebp =3D 0xbfbfddd8 ---
> Uptime: 4d20h0m44s
> Physical memory: 503 MB
> Dumping 104 MB: 89 73 57 41 25 9
>=20
> No symbol "stopped_cpus" in current context.
> No symbol "stoppcbs" in current context.
> #0  doadump (textdump=3D1) at pcpu.h:249
> 249     pcpu.h: No such file or directory.
>         in pcpu.h
> (kgdb) where
> #0  doadump (textdump=3D1) at pcpu.h:249
> #1  0xc05fdddd in kern_reboot (howto=3D260) at /src/src-9/sys/kern/kern_s=
hutdown.c:449
> #2  0xc05fe028 in panic (fmt=3D<value optimized out>) at /src/src-9/sys/k=
ern/kern_shutdown.c:637
> #3  0xc07cd1d3 in trap_fatal (frame=3D0xd82e458c, eva=3D3445538816)
>     at /src/src-9/sys/i386/i386/trap.c:1044
> #4  0xc07cd4b7 in trap_pfault (frame=3D0xd82e458c, usermode=3D0, eva=3D34=
45538816)
>     at /src/src-9/sys/i386/i386/trap.c:957
> #5  0xc07ce05a in trap (frame=3D0xd82e458c) at /src/src-9/sys/i386/i386/t=
rap.c:555
> #6  0xc07ba88c in calltrap () at /src/src-9/sys/i386/i386/exception.s:170
> #7  0xc07cb2fe in bcopy () at /src/src-9/sys/i386/i386/support.s:198
> #8  0xc072be13 in ffs_snapshot (mp=3D0xc2b35a90, snapfile=3D0xc2ed0400 "s=
5-2013.07.12-03.15.01")
>     at /src/src-9/sys/ufs/ffs/ffs_snapshot.c:793
> #9  0xc0748e8e in ffs_mount (mp=3D0xc2b35a90) at /src/src-9/sys/ufs/ffs/f=
fs_vfsops.c:483
> #10 0xc068a72b in vfs_donmount (td=3D0xc2b06620, fsflags=3D271659272, fso=
ptions=3D0xc2b74d80)
>     at /src/src-9/sys/kern/vfs_mount.c:948
> #11 0xc068a8e3 in sys_nmount (td=3D0xc2b06620, uap=3D0xd82e4ccc) at /src/=
src-9/sys/kern/vfs_mount.c:417
> #12 0xc07cd7ae in syscall (frame=3D0xd82e4d08) at subr_syscall.c:135
> #13 0xc07ba8f1 in Xint0x80_syscall () at /src/src-9/sys/i386/i386/excepti=
on.s:270
> #14 0x00000033 in ?? ()
> Previous frame inner to this frame (corrupt stack?)

Please show me the first 100 lines of the output of dumpfs(8) on the
filesystem where snapshot creation caused the panic.

--YACgKm7aulUA0QKG
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.20 (FreeBSD)

iQIcBAEBAgAGBQJR35uoAAoJEJDCuSvBvK1BrNsP/j3lkZ95daE6qdrV1sp7sdDz
FdqXHqPj20W0aog9f7/5kf3Q6+LZM1Jlx43+fhfuXkUZ0+vWBvfo9npaoseNjho7
GZyH3EyTS9TBJSQfPJwZxQ0DRHosy5YCoqYbFCqf/LIzdFD9dR93dcfZDI7WIdwA
3F84k3193CbREKNdII5CqJuq3GqilVvyWbX5mDXIpvPraHvDiMUZLJbCA6aW43RS
2Ij8MzZFOjwpAuOeP+5yeOGC2QGkMdKD9orWT1QDaYr7f9Wv8wm6qhPqMDj1gFLz
I84EC7kzzWkbv8DbnkVaxd7buZQIj28SVQ6ASPRWz4rERTP9JJlvV9Q3okf/Urll
8d+bTfVNidKaR5Hot5XUHF9C9g6R/AqNGJ7/iz0tF4dL0rG3zcr4a2au5OQFm03D
fNGl47mFbaxy+L9DGH0QxSAtqS2G3LHHME+oGY4LTHslq7nEojn+5c/gJR0hI6EA
CPj6GU9oRFIpoV+6JeIYhhxWT7curPCtEmjXiOtUYFFNtYfWsAWTYuNDZ9amHAxO
OOfwldpSogOmS4EomoWLY6FXmYOCcj7iJ9XlSeUAbbyLNx4GIJmUeY49eSvkvZ9c
Pgu0Mp6I44zYEHNYKpzbCA53sXTDhMycLtura4Pe+rcy5ooXc/eg2AueAmeUIF23
F+LqalUcINN4iWcdvFEq
=Bd04
-----END PGP SIGNATURE-----

--YACgKm7aulUA0QKG--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20130712060112.GY91021>