Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 16 Jul 2010 12:23:41 +0100
From:      Anton Shterenlikht <mexas@bristol.ac.uk>
To:        freebsd-ia64@freebsd.org
Subject:   Re: gpart segfault and fatal kernel trap
Message-ID:  <20100716112341.GA99205@mech-cluster241.men.bris.ac.uk>
In-Reply-To: <20100716110802.GA99033@mech-cluster241.men.bris.ac.uk>
References:  <20100716110802.GA99033@mech-cluster241.men.bris.ac.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Jul 16, 2010 at 12:08:02PM +0100, Anton Shterenlikht wrote:
> On r209586 I have 2 fibre disks (da1, da2).
> I added another and did 'camcontrol rescan all'.
> I got the disk detected as da3.
> I did "gpart -c gpt da3" successfully.
> I added a UFS partition successfully:
> 
> mech-as28# gpart add -t freebsd-ufs da3
> da3p1 added
> 
> However "gpart show" would segfault.
> 
> In addition devices /dev/da1 and /dev/da2 vanished.
> 
> I decided to reboot and got this:
> 
> - - - - - - - - - - - - Live Console - - - - - - - - - - - -
> 
> fatal kernel trap (cpu 0):
> 
>     trap vector = 0x14 (Page Not Present)
>     cr.iip      = 0xe00000000439f2c0
>     cr.ipsr     = 0x1210080a6010 (mfl,ic,i,dt,dfh,rt,cpl=0,it,ri=1,bn)
>     cr.isr      = 0xa0400000000 (code=0,vector=0,r,ei=1,ed)
>     cr.ifa      = 0x18
>     curthread   = 0xe000000010f9c3c0
>         pid = 2, comm = g_event
> 
> [ thread pid 2 tid 100008 ]
> Stopped at      g_access+0x41:  [M1]    ld8 r36=[r14]
> db> bt
> Tracing pid 2 tid 100008 td 0xe000000010f9c3c0
> g_access(0x0, 0xffffffffffffffff, 0xffffffffffffffff, 0x0) at g_access+0x41
> swapgeom_close_ev(0x0, 0xe0000000043977f0, 0x814, 0xd9) at swapgeom_close_ev+0x30
> g_run_events(0xe000000011443b00, 0x0, 0xe000000004aae500, 0xe000000004925888, 0xe000000004aaba38) at g_run_events+0x6c0
> g_event_procbody(0xe000000004a9c084, 0xe00000000491cd10, 0xe000000004a9b8c8, 0xe000000004401ca0) at g_event_procbody+0xf0
> fork_exit(0xe000000004977400, 0x0, 0xa0000000eaf1b550) at fork_exit+0x110
> enter_userland() at enter_userland
> db> 
> 
> I'll complete the reboot and see from there.

seems ok after reboot:

ZEEV> gpart show
=>       34  143374671  da0  GPT  (68G)
         34     409600    1  efi  (200M)
     409634   33554432    2  freebsd-ufs  (16G)
   33964066  109410639       - free -  (52G)

=>       34  142255508  da1  GPT  (68G)
         34   33554432    1  freebsd-swap  (16G)
   33554466  108701076    2  freebsd-ufs  (52G)

=>       34  142255508  da2  GPT  (68G)
         34   33554432    1  freebsd-swap  (16G)
   33554466  108701076    2  freebsd-ufs  (52G)

=>       34  860232488  da3  GPT  (410G)
         34  860232488    1  freebsd-ufs  (410G)

ZEEV> 

I see in /var/log/messages this fragment,
resulting from my "camcontrol rescan all":

Jul 16 11:50:17 mech-as28 kernel: (da1:isp0:0:0:1): lost device
Jul 16 11:50:17 mech-as28 kernel: (da2:isp0:0:0:2): lost device
Jul 16 11:50:17 mech-as28 kernel: da3 at isp0 bus 0 scbus2 target 0 lun 4
Jul 16 11:50:17 mech-as28 kernel: da3: <COMPAQ MSA1000 VOLUME 4.32> Fixed Direct Access SCSI-4
 device
Jul 16 11:50:17 mech-as28 kernel: da3: 200.000MB/s transfers WWNN 0x500805f3000ec220 WWPN 0x50
0805f3000ec221 PortID 0x10000
Jul 16 11:50:17 mech-as28 kernel: da3: Command Queueing enabled
Jul 16 11:50:17 mech-as28 kernel: da3: 420035MB (860232555 512 byte sectors: 255H 63S/T 53547C
)
Jul 16 11:50:29 mech-as28 kernel: pid 40031 (gpart), uid 0: exited on signal 11 (core dumped)
Jul 16 11:50:35 mech-as28 kernel: pid 40032 (gpart), uid 0: exited on signal 11 (core dumped)
Jul 16 11:52:42 mech-as28 kernel: pid 40042 (gpart), uid 0: exited on signal 11 (core dumped)


Marcel, is this of any interest?

many thanks
anton 

-- 
Anton Shterenlikht
Room 2.6, Queen's Building
Mech Eng Dept
Bristol University
University Walk, Bristol BS8 1TR, UK
Tel: +44 (0)117 331 5944
Fax: +44 (0)117 929 4423



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100716112341.GA99205>