Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 11 Oct 2010 11:30:51 -0400
From:      "Michael W. Lucas" <mwlucas@blackhelicopters.org>
To:        fs@freebsd.org
Subject:   hast crash
Message-ID:  <20101011153051.GA15699@bewilderbeast.blackhelicopters.org>

next in thread | raw e-mail | index | archive | help
Hi,

I upgraded my HAST cluster to 8.1-stable on 6 October 2010, and am now
experiencing crashes in hastd.  hastd debug output is showing:

...
[DEBUG][2] [mirror] (secondary) recv: (0x8013ecc40) Got request header: WRITE(11752701952, 131072).
[DEBUG][2] [mirror] (secondary) recv: (0x8013ecc40) Moving request to the disk queue.
[DEBUG][2] [mirror] (secondary) disk: (0x8013ecc40) Got request: WRITE(11752701952, 131072).
[DEBUG][2] [mirror] (secondary) recv: Taking free request.
[DEBUG][2] [mirror] (secondary) recv: (0x8013ecbf0) Got request.
[ERROR] [mirror] (secondary) Unable to receive request header: RPC version wrong.
[DEBUG][1] Unable to receive event header: Socket is not connected.
[DEBUG][1] Accepting connection to tcp4://0.0.0.0:8457.
[INFO] Connection from tcp4://192.168.0.1:21493 to tcp4://192.168.0.2:8457.
[DEBUG][2] tcp4://192.168.0.1:21493: resource=mirror
[DEBUG][1] [mirror] (secondary) Initial connection from tcp4://192.168.0.1:21493.
[DEBUG][1] [mirror] (secondary) Worker process exists (pid=8826), stopping it.
[ERROR] [mirror] (secondary) Worker process exited ungracefully (pid=8826, exitcode=75).
Assertion failed: (conn != NULL), function proto_close, file /usr/src/sbin/hastd/proto.c, line 287.
Abort (core dumped)

Both machines are running on VMWare ESXi.  The second machine is a
clone of the first.

Any thoughts, folks?

Thanks,
==ml

-- 
Michael W. Lucas 	mwlucas@BlackHelicopters.org
http://www.MichaelWLucas.com/, http://blather.MichaelWLucas.com/
New book available: Network Flow Analysis
http://www.networkflowanalysis.com/



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20101011153051.GA15699>