Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 19 Oct 2017 17:17:00 -0400
From:      Ken Merry <ken@freebsd.org>
To:        Shiva Bhanujan <Shiva.Bhanujan@Quorum.com>
Cc:        "freebsd-scsi@freebsd.org" <freebsd-scsi@freebsd.org>
Subject:   Re: FreeBSD 10.3/11.0 SCSI errors with Symbios Logic SAS3008 PCI-Express Fusion-MPT SAS-3
Message-ID:  <32C157CE-A122-435F-8430-9531BEEB5914@freebsd.org>
In-Reply-To: <3E746990-8C6D-4CA1-BD79-B5566CFB07F4@freebsd.org>
References:  <3A5A10BE32AC9E45B4A22F89FC90EC0701C3665D5D@QLEXC01.Quorum.local> <3A5A10BE32AC9E45B4A22F89FC90EC0701C3665E8B@QLEXC01.Quorum.local> <20171016144231.GA94858@mithlond.kdm.org> <3A5A10BE32AC9E45B4A22F89FC90EC0701C366610E@QLEXC01.Quorum.local> <20171017023126.GA6559@mithlond.kdm.org> <3A5A10BE32AC9E45B4A22F89FC90EC0701C3666143@QLEXC01.Quorum.local> <3A5A10BE32AC9E45B4A22F89FC90EC0701C3666345@QLEXC01.Quorum.local> <3A5A10BE32AC9E45B4A22F89FC90EC0701C3666352@QLEXC01.Quorum.local> <3E746990-8C6D-4CA1-BD79-B5566CFB07F4@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
By the way, the message you referenced is here:

=
https://lists.freebsd.org/pipermail/freebsd-scsi/2017-January/007237.html =
<https://lists.freebsd.org/pipermail/freebsd-scsi/2017-January/007237.html=
>

And there is more here:

https://bugs.freenas.org/issues/14517 =
<https://bugs.freenas.org/issues/14517>;

Ken
=E2=80=94=20
Ken Merry
ken@FreeBSD.ORG



> On Oct 19, 2017, at 5:15 PM, Ken Merry <ken@freebsd.org> wrote:
>=20
> What does sg_readcap =E2=80=9416 show for these drives?
>=20
> If it has type 2 protection turned on, check to see what the DCIPZ =
value is in the control mode page:
>=20
> camcontrol modepage daX -v -m 10
>=20
> If that is set to 0, add a -e to the above command line and see if you =
can set it to 1.  That may or may not help anything.
>=20
> The mpr(4) and mps(4) drivers try to support protection information if =
it is turned on on the drive.  So, they set the protection information =
if protection information is turned on in the drive.  For that reason, =
setting the DCIPZ bit may not fix it.
>=20
> There could be a problem with how that is implemented that=E2=80=99s =
causing the drives to reject the command, but I=E2=80=99m not sure.
>=20
> If it is, Steve (CCed) can help us debug it.
>=20
> Ken
> =E2=80=94=20
> Ken Merry
> ken@FreeBSD.ORG
>=20
>=20
>=20
>> On Oct 17, 2017, at 9:54 PM, Shiva Bhanujan =
<Shiva.Bhanujan@Quorum.com> wrote:
>>=20
>> Sorry.  perhaps I have failed to mention, the SCSI errors are only w/ =
the seagate drives.  These are of the model ST4000NM0023.
>>=20
>>=20
>> From: Shiva Bhanujan
>> Sent: Tuesday, October 17, 2017 6:53 PM
>> To: Kenneth D. Merry
>> Cc: freebsd-scsi@freebsd.org <mailto:freebsd-scsi@freebsd.org>
>> Subject: RE: FreeBSD 10.3/11.0 SCSI errors with Symbios Logic SAS3008 =
PCI-Express Fusion-MPT SAS-3
>>=20
>> Please note, that this isn't an issue w/ Toshiba drives.  is this a =
firmware issue by any chance?
>>=20
>>=20
>>=20
>> From: owner-freebsd-scsi@freebsd.org =
<mailto:owner-freebsd-scsi@freebsd.org> [owner-freebsd-scsi@freebsd.org =
<mailto:owner-freebsd-scsi@freebsd.org>] on behalf of Shiva Bhanujan =
[shiva.bhanujan@quorum.net <mailto:shiva.bhanujan@quorum.net>]
>> Sent: Tuesday, October 17, 2017 6:08 AM
>> To: Kenneth D. Merry
>> Cc: freebsd-scsi@freebsd.org <mailto:freebsd-scsi@freebsd.org>
>> Subject: RE: FreeBSD 10.3/11.0 SCSI errors with Symbios Logic SAS3008 =
PCI-Express Fusion-MPT SAS-3
>>=20
>> Since I started having the SCSI errors, I ended up running sg_format =
to format the disks. I've found that once the disks are formatted using =
sg_format, there are no SCSI errors. The errors that show up during the =
format are towards the end of the dmesg output.
>>=20
>> (da0:mpr0:0:8:0): SCSI sense: NOT READY asc:4,4 (Logical unit not =
ready, format in progress)
>> (da0:mpr0:0:8:0): Progress: 9% (6256/65536) complete
>>=20
>> once the format is done, I can successfully format and partition =
using gpart.
>>=20
>>=20
>> The errors that show up when I try to run gpart for the first time =
are as follows:
>>=20
>> (da9:mpr0:0:17:0): READ(10). CDB: 28 00 00 00 00 00 00 01 00 00
>> (da9:mpr0:0:17:0): CAM status: SCSI Status Error
>> (da9:mpr0:0:17:0): SCSI status: Check Condition
>> (da9:mpr0:0:17:0): SCSI sense: ILLEGAL REQUEST asc:20,0 (Invalid =
command operation code)
>> (da9:mpr0:0:17:0): Error 22, Unretryable error
>>=20
>>=20
>> It seems that it's the read that is failing, and is being tagged as =
an illegal request. While sg_format will address the issue at hand, this =
isn't an option for us, because there are appliances that were formatted =
using FreeBSD 10.2, and an upgrade to 10.3 or 11.x might be an issue?
>>=20
>>=20
>>=20
>>=20
>> ________________________________
>> From: Kenneth D. Merry [ken@FreeBSD.ORG <mailto:ken@FreeBSD.ORG>]
>> Sent: Monday, October 16, 2017 7:31 PM
>> To: Shiva Bhanujan
>> Cc: freebsd-scsi@freebsd.org <mailto:freebsd-scsi@freebsd.org>
>> Subject: Re: FreeBSD 10.3/11.0 SCSI errors with Symbios Logic SAS3008 =
PCI-Express Fusion-MPT SAS-3
>>=20
>> On Tue, Oct 17, 2017 at 01:19:27 +0000, Shiva Bhanujan wrote:
>>> Hi Ken,
>>>=20
>>> I've attached the output of dmesg. Here's the SCSI CDB for a sample =
drive, da3.
>>>=20
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 00 00 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 02 00 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 ba 00 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 bc 00 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 a0 00 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 a0 02 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 ba 22 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 bc 22 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 00 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 02 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 9f fc 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 9f fe 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 00 00 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 02 00 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 ba 00 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 bc 00 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 a0 00 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 a0 02 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 ba 22 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(16). CDB: 88 00 00 00 00 01 d1 c0 bc 22 00 =
00 01 00 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 00 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 00 02 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 9f fc 22 00 01 00 00
>>> (da3:mpr0:0:11:0): READ(10). CDB: 28 00 00 9f fe 22 00 01 00 00
>>>=20
>>>=20
>>> My understanding is that FreeBSD 11.1 contains the mpr(4) driver? =
I've tried this w/ 11.1, w/ the same results.
>>>=20
>>=20
>> Yes, the mpr(4) driver is in all recent FreeBSD releases.
>>=20
>> In looking at the dmesg, this is telling:
>>=20
>> (da0:mpr0:0:8:0): WRITE(6). CDB: 0a 00 00 00 01 00
>> (da0:mpr0:0:8:0): CAM status: SCSI Status Error
>> (da0:mpr0:0:8:0): SCSI status: Check Condition
>> (da0:mpr0:0:8:0): SCSI sense: NOT READY asc:4,4 (Logical unit not =
ready, format in progress)
>> (da0:mpr0:0:8:0): Progress: 9% (6256/65536) complete
>> (da0:mpr0:0:8:0): Error 16, Unretryable error
>>=20
>> If the drives are in the process of formatting, I guess it may make =
sense
>> for them to reject read commands. Otherwise, it makes no sense for a =
hard
>> drive to reject reads.
>>=20
>> Are you able to check the status of the format? You should be able to =
send
>> a test unit ready and figure out how far along the format is:
>>=20
>> camcontrol tur da0 -v
>>=20
>> And so on for each of the drives.
>>=20
>> Ken
>>=20
>>> ________________________________
>>> From: Kenneth D. Merry [ken@FreeBSD.ORG =
<mailto:ken@FreeBSD.ORG><mailto:ken@FreeBSD.ORG> =
<mailto:ken@FreeBSD.ORG>>]
>>> Sent: Monday, October 16, 2017 7:42 AM
>>> To: Shiva Bhanujan
>>> Cc: freebsd-scsi@freebsd.org =
<mailto:freebsd-scsi@freebsd.org><mailto:freebsd-scsi@freebsd.org> =
<mailto:freebsd-scsi@freebsd.org>>
>>> Subject: Re: FreeBSD 10.3/11.0 SCSI errors with Symbios Logic =
SAS3008 PCI-Express Fusion-MPT SAS-3
>>>=20
>>> On Fri, Oct 13, 2017 at 20:12:02 +0000, Shiva Bhanujan wrote:
>>>> Hello,
>>>>=20
>>>> I have a FreeBSD 10.3 install in a HVM on XenServer 6.5. The HBA330 =
SAS-3 controller is in pcipassthrough mode to the FreeBSD VM. When I try =
to access the disks (/dev/da0...) using gpart, I get SCSI errors, like =
the following:
>>>>=20
>>>>=20
>>>>=20
>>>> (da0:mpr0:0:0:0): CAM status: SCSI Status Error
>>>>=20
>>>> (da0:mpr0:0:0:0): SCSI status: Check Condition
>>>>=20
>>>> (da0:mpr0:0:0:0): SCSI sense: ILLEGAL REQUEST asc:20,0 (Invalid =
command operation code)
>>>>=20
>>>=20
>>> The error message above is missing the SCSI CDB. We need that in =
order to
>>> figure out what command the drive is complaining about.
>>>=20
>>> The error message means that FreeBSD is sending a SCSI command that =
the
>>> drive doesn't support. That can be benign, or it can cause a =
problem.
>>>=20
>>> So, what error does gpart give you when you have this problem?
>>>=20
>>>> I get the same errors w/ FreeBSD 11.0 also. Running 10.3 natively =
also has the same result.
>>>>=20
>>>=20
>>>=20
>>>> Please note, that these errors don't show up on a Fusion-MPT SAS-2 =
controller, or a MegaRAID SAS 2208 controller. Additionally, FreeBSD =
10.2 doesn't have any SCSI errors on the HBA330 SAS-3 controller either.
>>>>=20
>>>>=20
>>>>=20
>>>>=20
>>>> Is there a different version of the mpr driver I should be using? I =
haven't checked the differences between the mpr driver in 10.2 vs 10.3 =
and 11.0. I do see that there are others who have experienced these =
issues. Can somebody please provide me some pointers
>>>> as to why this is occurring? Or if there are some driver changes =
that I might be able to incorporate?
>>>>=20
>>>=20
>>> In general, the latest mpr(4) driver is the best one. The driver =
itself
>>> generally doesn't send SCSI commands (there are a few exceptions), =
but
>>> rather passes them through from the upper layers of CAM.
>>>=20
>>>> Please note, that I have gone through the mail titled "scsi error =
at SEAGATE ST1200MM0088 TT31" and have started sg_format on all the =
SEAGATE disks. Having said that, I still need to figure out what would =
happen, if the disks were written to using FreeBSD 10.2, which doesn't =
seem to have SCSI errors, and when I try to upgrade to 10.3. Any help is =
appreciated.
>>>>=20
>>>=20
>>> Send the full error messages, and we may be able to figure out =
what's going
>>> on.
>>>=20
>>> Ken
>>> --
>>> Kenneth Merry
>>> ken@FreeBSD.ORG <mailto:ken@FreeBSD.ORG><mailto:ken@FreeBSD.ORG> =
<mailto:ken@FreeBSD.ORG>><mailto:ken@FreeBSD.ORG> =
<mailto:ken@FreeBSD.ORG>>
>>> ________________________________
>>=20
>>=20
>> --
>> Kenneth Merry
>> ken@FreeBSD.ORG <mailto:ken@FreeBSD.ORG><mailto:ken@FreeBSD.ORG> =
<mailto:ken@FreeBSD.ORG>>
>> ________________________________
>> _______________________________________________
>> freebsd-scsi@freebsd.org <mailto:freebsd-scsi@freebsd.org> mailing =
list
>> https://lists.freebsd.org/mailman/listinfo/freebsd-scsi =
<https://lists.freebsd.org/mailman/listinfo/freebsd-scsi>;
>> To unsubscribe, send any mail to =
"freebsd-scsi-unsubscribe@freebsd.org =
<mailto:freebsd-scsi-unsubscribe@freebsd.org>"
>=20
> _______________________________________________
> freebsd-scsi@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-scsi
> To unsubscribe, send any mail to =
"freebsd-scsi-unsubscribe@freebsd.org"




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?32C157CE-A122-435F-8430-9531BEEB5914>