Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 6 Mar 2014 11:12:06 GMT
From:      Borja Marcos <borjam@sarenet.es>
To:        freebsd-gnats-submit@FreeBSD.org
Subject:   kern/187312: LSI Logic Invader card fails at boot
Message-ID:  <201403061112.s26BC61N020678@cgiserv.freebsd.org>
Resent-Message-ID: <201403061120.s26BK1tT043394@freefall.freebsd.org>

next in thread | raw e-mail | index | archive | help

>Number:         187312
>Category:       kern
>Synopsis:       LSI Logic Invader card fails at boot
>Confidential:   no
>Severity:       non-critical
>Priority:       low
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Thu Mar 06 11:20:01 UTC 2014
>Closed-Date:
>Last-Modified:
>Originator:     Borja Marcos
>Release:        FreeBSD 10-STABLE
>Organization:
>Environment:
FreeBSD elibm 10.0-STABLE FreeBSD 10.0-STABLE #1: Tue Mar  4 09:36:48 UTC 2014     root@:/usr/obj/usr/src/sys/GENERIC  amd64

>Description:
There is some trouble with this LSI Logic RAID card:

mfi0 Adapter:
    Product Name: ServeRAID M5210e
   Serial Number: 3CJ0SG          
        Firmware: 24.0.2-0013
     RAID Levels: JBOD, RAID0, RAID1, RAID10
  Battery Backup: not present
           NVRAM: 32K
  Onboard Memory: 0M
  Minimum Stripe: 64K
  Maximum Stripe: 64K


It's an Invader series card. When booting from install media (for example, a memstick) FreeBSD detects the RAID volume (/dev/mfid0) without issues and the installation completes. There's just a warning message:

However, once the system is installed, FreeBSD is unable to boot. The loader works, the kernel starts, but it fails when trying to mount the root file system from /dev/mfid0p2.

It fails with 10-RELEASE and 10-STABLE.


The errors are:

mfi0: Adapter RESET condition is detected
mfi0: First step of FW reset initialized…
mfi0: ADP_RESET_TBOLT: Hostdiag=0xa0
mfi0: First stage of reset complete, second stage initialized…
mfi0: APJ ****requeue command 0xfffffe000104dfc0 index=120
mfi0: second stage of reset complete, FW is ready now.
mfi0: 11032 (boot + 3s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014)
mfi0: 11033 (boot + 3s/0x0020/info) - Firmware version 4.200.21-2840
…

I haven't copied all the errors, last time I tried it finished entering an endless loop.

Interestingly, IT WORKS With 11-CURRENT. I can successfully install 11-CURRENT and boot normally.

>How-To-Repeat:
Create a RAID 1 volume (it's the only option I have tried) and install 10-RELEASE or 10-STABLE on /dev/mfid0, standard installation, no frills.

>Fix:
It works with 11-CURRENT.

Comparing the mfi driver betwen CURRENT and STABLE I noticed that there are three files with changes,

sys/dev/mfi/mfi_pci.c
sys/dev/mfi/mfi_tbolt.c
sys/dev/mvi/mfivar.h

http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfi_pci.c?revision=261535&view=markup
http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfi_tbolt.c?revision=261535&view=markup
http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfivar.h?revision=261535&view=markup

I copied the three files on a -STABLE source tree checked out yesterday, built a kernel and tried. The -STABLE kernel with mfi compiled with these new files works. It can boot from /dev/mfid0 and mount /dev/mfid0p2 as the root file system.

mfi0: <Invader> port 0x4f00-0x4fff mem 0x913f0000-0x913fffff,0x91400000-0x914fffff irq 34 at device 0.0 on pci22
mfi0: Using MSI
mfi0: Megaraid SAS driver Ver 4.23 
mfi0: FW MaxCmds = 240, limiting to 128
mfi0: MaxCmd = 240, Drv MaxCmd = 128, MaxSgl = 70, state = 0xb73c00f0
mfi0: 11019 (447416787s/0x0020/info) - Shutdown command received from host
mfi0: 11020 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014)
mfi0: 11021 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840
mfi0: 11022 (boot + 12s/0x0020/info) - Package version 24.0.2-0013
mfi0: 11023 (boot + 12s/0x0020/info) - Board Revision  00AL055
mfi0: 11024 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25)
mfi0: 11025 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000
mfi0: 11026 (boot + 33s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649
mfi0: 11027 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24)
mfi0: 11028 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000
mfi0: 11029 (boot + 33s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649
mfi0: 11030 (447416847s/0x0020/info) - Time established as 03/06/14 10:27:27; (33 seconds since power on)

mfi0: 11031 (447416924s/0x0020/info) - Time established as 03/06/14 10:28:44; (111 seconds since power on)
mfi0: 11032 (boot + 3s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014)
mfi0: 11033 (boot + 3s/0x0020/info) - Firmware version 4.200.21-2840
mfi0: 11034 (447417030s/0x0020/CRIT) - Controller encountered a fatal error and was reset
mfi0: 11035 (447417032s/0x0020/info) - Package version 24.0.2-0013
mfi0: 11036 (447417032s/0x0020/info) - Board Revision  00AL055
mfi0: 11037 (447417052s/0x0002/info) - Inserted: PD 02(e0x3e/s25)
mfi0: 11038 (447417052s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000
mfi0: 11039 (447417052s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649
mfi0: 11040 (447417052s/0x0002/info) - Inserted: PD 03(e0x3e/s24)
mfi0: 11041 (447417052s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000
mfi0: 11042 (447417052s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649
mfi0: 11043 (447417052s/0x0020/info) - Controller Hot Plug detected
mfi0: 11044 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014)

mfi0: 11045 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840
mfi0: 11046 (boot + 12s/0x0020/info) - Package version 24.0.2-0013
mfi0: 11047 (boot + 12s/0x0020/info) - Board Revision  00AL055
mfi0: 11048 (boot + 32s/0x0002/info) - Inserted: PD 02(e0x3e/s25)
mfi0: 11049 (boot + 32s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000
mfi0: 11050 (boot + 32s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649
mfi0: 11051 (boot + 32s/0x0002/info) - Inserted: PD 03(e0x3e/s24)
mfi0: 11052 (boot + 32s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000
mfi0: 11053 (boot + 32s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649
mfi0: 11054 (447417194s/0x0020/info) - Time established as 03/06/14 10:33:14; (32 seconds since power on)
mfi0: 11055 (447417277s/0x0020/info) - Time established as 03/06/14 10:34:37; (115 seconds since power on)

mfid0 on mfi0
mfid0: 121116MB (248045568 sectors) RAID volume (no label) is optimal
mfi0: Failed to get command
mfi0: <Invader> port 0x4f00-0x4fff mem 0x913f0000-0x913fffff,0x91400000-0x914fffff irq 34 at device 0.0 on pci22
mfi0: Using MSI
mfi0: Megaraid SAS driver Ver 4.23 
mfi0: FW MaxCmds = 240, limiting to 128
mfi0: MaxCmd = 240, Drv MaxCmd = 128, MaxSgl = 70, state = 0xb73c00f0
mfi0: 11056 (447417497s/0x0020/info) - Shutdown command received from host
mfi0: 11057 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014)
mfi0: 11058 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840
mfi0: 11059 (boot + 12s/0x0020/info) - Package version 24.0.2-0013
mfi0: 11060 (boot + 12s/0x0020/info) - Board Revision  00AL055
mfi0: 11061 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25)



mfi0: 11062 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000
mfi0: 11063 (boot + 33s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649
mfi0: 11064 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24)
mfi0: 11065 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000
mfi0: 11066 (boot + 33s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649
mfi0: 11067 (447417556s/0x0020/info) - Time established as 03/06/14 10:39:16; (33 seconds since power on)
mfi0: 11068 (447417632s/0x0020/info) - Time established as 03/06/14 10:40:32; (110 seconds since power on)
mfid0 on mfi0
mfid0: 121116MB (248045568 sectors) RAID volume (no label) is optimal
Trying to mount root from ufs:/dev/mfid0p2 [rw]...




There's still some problem with the driver (see  the "fatal error" message) but at least it works. I'm going to do some checks, run some heavy I/O activity to see if it can be reliable.

In case this solves the issue, could the above mentioned changes be backported to 10-STABLE?



>Release-Note:
>Audit-Trail:
>Unformatted:



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?201403061112.s26BC61N020678>