Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 12 Mar 2010 09:10:35 +0100
From:      Alexander Leidinger <Alexander@Leidinger.net>
To:        Borja Marcos <borjam@sarenet.es>
Cc:        freebsd-fs@FreeBSD.org, Pawel Jakub Dawidek <pjd@FreeBSD.org>, FreeBSD@FreeBSD.org, Stable <freebsd-stable@FreeBSD.org>
Subject:   Re: Many processes stuck in zfs
Message-ID:  <20100312091035.961823cwec0rxijo@webmail.leidinger.net>
In-Reply-To: <35ADB9B1-F571-4EE4-9089-5363ACEBE159@sarenet.es>
References:  <864468D4-DCE9-493B-9280-00E5FAB2A05C@lassitu.de> <20100309122954.GE3155@garage.freebsd.pl> <EC9BC6B4-8D0E-4FE3-852F-0E3A24569D33@sarenet.es> <20100309125815.GF3155@garage.freebsd.pl> <CB854F58-03AF-46DD-8153-85FA96037C21@sarenet.es> <BFF1E2D6-B48A-4A5E-ACEE-8577FDB07820@sarenet.es> <20100310110202.GA1715@garage.freebsd.pl> <E04F91AA-B2C4-4166-A24A-74F1BEF01519@sarenet.es> <20100310173143.GD1715@garage.freebsd.pl> <20100311084527.2934034895hvgxaw@webmail.leidinger.net> <764BD545-B86C-47DC-9004-964EB2216AF0@sarenet.es> <20100311150822.107231cvjvgs9gsg@webmail.leidinger.net> <35ADB9B1-F571-4EE4-9089-5363ACEBE159@sarenet.es>

next in thread | previous in thread | raw e-mail | index | archive | help
Quoting Borja Marcos <borjam@sarenet.es> (from Thu, 11 Mar 2010  
18:26:09 +0100):

> Of course CPUs have bugs, I don't doubt it. I was just wondering how  
> I coud reproduce the problem with a different hardware :) That's why  
> I said it was unlikely.
>
> Besides, such a low level fault should produce many more problems  
> than such a well defined failure mode, as far as I know.

In my case I had a 7.1 system which was running fine. After updating  
to 7.2 I got deadlocks after some minutes with UFS. Switching to ZFS  
for the main data partition extended the lifetime to 3-4 hours. After  
updating ZFS in 7-stable with the code from 8-stable this was extended  
to 6 hours (periodic daily triggered the problem faster), and after  
switching to exclusive locks instead of shared locks in ZFS the system  
survived a night with several jails running periodic daily (but I had  
to reboot in the morning because apache was not able to serve data  
anymore). Everything else was working correctly. I would say this was  
a very narrow problem case.

Bye,
Alexander.

-- 
Adult, n.:
	One old enough to know better.

http://www.Leidinger.net    Alexander @ Leidinger.net: PGP ID = B0063FE7
http://www.FreeBSD.org       netchild @ FreeBSD.org  : PGP ID = 72077137



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100312091035.961823cwec0rxijo>