Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 13 Oct 2005 12:26:32 +0930
From:      Greg 'groggy' Lehey <grog@MySQL.com>
To:        NMH <drumslayer2@yahoo.com>, Kris Kennaway <kris@FreeBSD.org>, justin.bastedo@gmail.com
Cc:        FreeBSD-DataBase <freebsd-database@freebsd.org>, questions <freebsd-questions@freebsd.org>
Subject:   Re: Mysql server not able to stay running on anything but Linux?
Message-ID:  <20051013025632.GN49168@wantadilla.lemis.com>
In-Reply-To: <20051012225544.83673.qmail@web32901.mail.mud.yahoo.com> <20051012214154.17245.qmail@web32913.mail.mud.yahoo.com> <20051012210757.GA97184@xor.obsecurity.org> <20051012201622.49704.qmail@web32905.mail.mud.yahoo.com>
References:  <20051012225544.83673.qmail@web32901.mail.mud.yahoo.com> <20051012210757.GA97184@xor.obsecurity.org> <20051012214154.17245.qmail@web32913.mail.mud.yahoo.com> <20051012201622.49704.qmail@web32905.mail.mud.yahoo.com> <20051012210757.GA97184@xor.obsecurity.org> <20051012201622.49704.qmail@web32905.mail.mud.yahoo.com>

next in thread | previous in thread | raw e-mail | index | archive | help

--GTjM2GI4zMe3xHZ5
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline

Note the From: address.

On Wednesday, 12 October 2005 at 13:16:22 -0700, NMH wrote:
>
> I am stuck with a delema and I feel like a damn troll. But.. I have
> a Mysql Database that I posted about earlier. It seems that it is
> only able to not die by running on BSD 4.11 with Linux Threads. My
> boss is convinced this means that Linux is better for MySQL and
> wants that installed now.
>
> We even got a support contact from Mysql that so far has gotten us
> nothing for almost a month while our production database server died
> up to 3 times a day.  (and lots of we're looking into it's)

One of the reasons why you haven't got much more than "we're looking
into it"s is because we haven't been able to reproduce the problem;
you acknowledge this in follow-up mail quoted below.

As you know from various threads on the FreeBSD lists, including this
one, the typical answer is "works fine for me".  That doesn't mean
that we're not taking your problems seriously, but we do have a
significant issue just reproducing the problem.  We have a number of
choices:

1.  Try different hardware or a different version of FreeBSD.  It's
    conceivable that there's something about your specific hardware,
    or about the combination of i386 kernel on amd64 in general, that
    triggers the problem.

2.  Do debugging on your production servers.  This isn't really a
    choice at all: it would involve even more down time.

3.  Get you to run a more stable version of FreeBSD while we
    investigate the problem.  This is the method we chose.  I haven't
    heard from you since the weekend, so I hope I'm correct in
    understanding that you currently don't have stability problems.
    On our side, we have installed FreeBSD 5.4 on one of our internal
    machines, and we're trying to reproduce the problem there.

> We were running fine but a little slow on FreeBSD 5.1-P11 and MySQL
> 4.0.18.(apperantly before a big Lib change) We had to move quicker
> than we wanted to a new server running FreeBSD 5.4 and MySQL 4.11
> (becouse of a dual HD death) Under production load the new 5.4
> server fell over regulary. It has only now become stable by wiping
> it and running it on FreeBSD 4.11 with Linux Threads. (it regularly
> has over 400+ threads)

Kris obviously understood that by this statement you meant a kernel
crash.  My understanding is that only the mysqld server is crashing.
Is this still correct?

> I want to try FreeBSD 5.4 AMD64 (the machines are Opteron) or 6.0
> but my boss feels that would be a waste given that MYSQL doesn't
> support Mysql on AMD64 well enough.

I think it would be a good idea to try this.  It's one of the things
that we intend to do in-house as soon as we can reproduce the problem
at all.

> Can anyone help or offer assistance to help track this down? Perhaps
> also any annecdotes or examples I can show my boss that other people
> have as busy MYSQL databases on BSD 5.X. We paid 3K to Mysql for
> help and so far they have been unable to offer any clues as to why
> ours will not stay stable on anything but Linux threads.

Have you had any kind of crash under 4.x?  I don't think that the
issue is so much linuxthreads as 5.x.

> As I say I only manage the server, I don't program the databases. Is
> there anything I should/could look for database wise that could
> trigger such things?

So far we've had the machine up in-house and have not reproduced the
problem.  If you have a spare machine that we could run under more
typical conditions on your premises, this might help.

On Wednesday, 12 October 2005 at 17:07:57 -0400, Kris Kennaway wrote:
>
> Unfortunately you'll need to provide details of how it "fell over"
> (e.g. panic messages + backtraces).

As I mention above, I think this is only a server crash.  I mentioned
this on the list a couple of weeks ago: all the backtraces I have seen
have been a SIGSEGV out of mutex_unlock_common.

>> Can anyone help or offer assistance to help track this down?
>> Perhaps also any annecdotes or examples I can show my boss that
>> other people have as busy MYSQL databases on BSD 5.X. We paid 3K to
>> Mysql for help and so far they have been unable to offer any clues
>> as to why ours will not stay stable on anything but Linux
>> threads. I feel really sad that so far no one has responded to my
>> posts and it feels like a victory for linux.
>
> If I was your boss I'd be asking why mysql hasn't delivered on their
> support contract.

Indeed.  I think we have, though.  There's a certain class of bugs
which are almost impossible to fix because they're so hard to chase
down.  This is one of them.

On Wednesday, 12 October 2005 at 14:41:54 -0700, NMH wrote:
> --- Kris Kennaway <kris@obsecurity.org> wrote:
>
>> If I was your boss I'd be asking why mysql hasn't delivered on
>> their support contract.
>

> Well I think support has many meanings. I decided to look at what he
> paid for and it says:
>  We get "access" to the mysql devlopers...
>  We get "access" to certified binaries. (none of which
> are FreeBSD)
>
> So, sad as it may seem, if your running on FreeBSD, $3000.00 buys
> you someone to talk to. It doesn't mean they have to say anything
> meaningful back or within any reasonable time. :(

We try to handle all problems within a reasonable time.  The fact that
you're running FreeBSD does mean that you don't get certified
binaries, but that's the only drawback.  And the fact that the time
for this problem has been unreasonable has nothing to do with the fact
you're running FreeBSD: it's because it's a bugger to track down.

> Also it doesn't cover tuning or code review etc. So if our problem
> is caused by anything like that.. they don't even have to tell us I
> would imagine.

FWIW, my personal opinion (I don't speak for MySQL, but I don't think
the official line would be much different) is that a database server
shouldn't crash.  If it does, it's a bug.  The question we need to
clarify is not whether your code is a problem, but whether the crash
is due to FreeBSD or MySQL.  Currently the evidence is pointing to
FreeBSD, but we're keeping an open mind on that.

> So.. that is why as always, these lists and fellow FreeBSD people
> are their own best support.

Apparently we've given you the impression that we're not doing
anything.  I'm sorry about that, and it's certainly not the case.  So
far all these lists have done is to provide the valuable information
that this is not a general problem.

On Wednesday, 12 October 2005 at 15:55:44 -0700, NMH wrote:
>  --- Justin Bastedo <justin.bastedo@gmail.com> wrote:
>
>
>  Also no one from Mysql seemed to think going to STABLE would do
>  anything. It's a very very tough thing to play lets see what
>  happens if.. On production servers. The time it takes us just to
>  bounce from one machine to another is pretty decent. Let alone
>  downtime etc.

This is puzzling.  The bug report is visible to the world at
http://bugs.mysql.com/bug.php?id=12251 : it was entered against
5.4-STABLE.  I did express the opinion, based on code analysis, that
upgrading wouldn't make much difference.

>> I'm definately interested to see what the final solution is on this
>> matter. I can easily forsee our db load getting close to that level
>> within the next couple of months depending on how these contracts
>> play out.

Whether or not you have a contract, I'd be very interested in hearing
of any problems you have with high load.  Obviously you'll get better
support if you do have a contract, Nicole's impressions
notwithstanding.

>  Well, I will post whatever happens. As I said the most annoying
>  part to all this is that so far we have been unable to crash the
>  5.4 servers via any testing means we (and Mysql) could come up
>  with. Yet in production they fell over as soon as the load started
>  to climb and points in between. But never on the slope of
>  decreasing load for the day. (always felt that was odd) IE it would
>  die on the bell curve increases but never at the coresponding
>  downward cycle of the daily busy curve.

I noticed that too; I suspect it's not statistically relevant.

>   If anyone knows of any particulary evil testing programs I would
>  love to know. It's tough to simulate 30+ servers asking lots
>  questions and forcing tons of writes.

If you (anyone) do have useful input, please keep me in the loop too.

Greg
--
Greg Lehey, Senior Software Engineer
MySQL AB, http://www.mysql.com/
Echunga, South Australia
See complete headers for address and phone numbers.

Are you MySQL certified?  http://www.mysql.com/certification/

--GTjM2GI4zMe3xHZ5
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (FreeBSD)

iD8DBQFDTczgIubykFB6QiMRAs+yAJ9YsDJE1iooFZzplYwcmm9KFJC7JwCcCtMy
FfcmSvl0PU3T9UD0EWgm5mY=
=PmgN
-----END PGP SIGNATURE-----

--GTjM2GI4zMe3xHZ5--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20051013025632.GN49168>