Skip site navigation (1)Skip section navigation (2)
Date:      03 Jun 2003 18:40:32 +0200
From:      Kern Sibbald <kern@sibbald.com>
To:        "Justin T. Gibbs" <gibbs@scsiguy.com>
Cc:        mjacob@feral.com
Subject:   Re: SCSI tape data loss
Message-ID:  <1054658432.13630.252.camel@rufus>
In-Reply-To: <882210000.1054657530@aslan.btc.adaptec.com>
References:  <3EDB31AB.16420.C8964B7D@localhost> <3EDB59A4.27599.C93270FB@localhost> <20030602110836.H71034@beppo> <20030602131225.F71034@beppo> <1054645616.13630.161.camel@rufus>  <20030603072944.U44880@beppo> <1054652678.13630.209.camel@rufus> <882210000.1054657530@aslan.btc.adaptec.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Yes, I probably should move the clrerror() and the
check/set of errno inside the check for "stat == -1". 
However, the code though odd is correct since 
I do not use errno unless the status is -1.

Our most recent tests are even more interesting.
We are getting the same data loss any time
Bacula switches tapes.  This means the data loss
does not have anything in particular to do with
the LEOM or PEOM status.

By the way, the funny casting is mandatory in C++,
because ssize_t as returned by the write is not the 
same as size_t (what is written).

More after I look at the most recent tests results.

Best regards,

Kern

On Tue, 2003-06-03 at 18:25, Justin T. Gibbs wrote:
> > What is clear from the output is that the write()
> > is returning a -1 status. errno could possibly be 0,
> > in which case I set it to ENOSPC, if it is not 0
> > then it is ENOSPC judging by the error message that
> > is printed "Write error on device ...".
> > 
> > You may want to see more, but here is the basic code
> > that does the write:
> >    if ((uint32_t)(stat=write(dev->fd, block->buf, (size_t)wlen)) !=
> > wlen) {
> >       /* We should check for errno == ENOSPC, BUT many 
> >        * devices simply report EIO when it is full.
> >        * with a little more thought we may be able to check
> >        * capacity and distinguish real errors and EOT
> >        * conditions.  In any case, we probably want to
> >        * simulate an End of Medium.
> >        */
> >       clrerror_dev(dev, -1);
> 
> Apart from the funny casting, the only obvious bug is that you
> are expecting errno to be set on every syscall.  Errno is only
> valid if stat == -1 or you explicitly clear it prior to the
> syscall (or after the last time it was set).  You don't seem
> to be doing that here.
> 
> See the errno man page for details
> 
> --
> Justin



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1054658432.13630.252.camel>