Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 05 Sep 2014 17:17:02 -0300
From:      Marcelo Gondim <gondim@bsdinfo.com.br>
To:        Adrian Chadd <adrian@freebsd.org>
Cc:        FreeBSD Net <freebsd-net@freebsd.org>
Subject:   Re: ixgbe CRITICAL: ECC ERROR!! Please Reboot!!
Message-ID:  <540A1A3E.2040306@bsdinfo.com.br>
In-Reply-To: <CAJ-VmomnzTfHzw2i0C01kmfg1rKLePcTHhZqbPWsAezg%2Be9q0g@mail.gmail.com>
References:  <5408F23C.2030309@bsdinfo.com.br>	<CAJ-VmomOECsnRt98ULijSkj9JmwnaLEnZ1ny7fqazsM3WJDNvA@mail.gmail.com>	<54091607.9010100@bsdinfo.com.br>	<5409CA44.8070203@bsdinfo.com.br> <CAJ-VmomnzTfHzw2i0C01kmfg1rKLePcTHhZqbPWsAezg%2Be9q0g@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On 05/09/2014 16:49, Adrian Chadd wrote:
> Hi,
>
> But is the airflow in the unit sufficient?
>
> I had this problem at a previous job - the box was running fine, the
> room was very cold, but the internal fans in the server were set to
> "be very quiet". It wasn't enough to keep the ixgbe NICs happy. I had
> to change the fan settings to "just always run full speed".
>
> The fan temperature feedback loop was based on sensors on the CPU,
> _not_ on the peripherals.
Hi Adrian,

Ummm. I'll check it and improve internal cooling.  :)
She is not happy and I'm also not. rsrsrsr

Cheers,

>
>
> -a
>
>
> On 5 September 2014 07:35, Marcelo Gondim <gondim@bsdinfo.com.br> wrote:
>> Hi Adrian,
>>
>> I confirmed with the support staff of the room where the server is, that the
>> ambient temperature was normal.
>>
>>
>> On 04/09/2014 22:46, Marcelo Gondim wrote:
>>> On 04/09/2014 20:48, Adrian Chadd wrote:
>>>> Hi,
>>>>
>>>> The only time this has happened to me is because the card overheated.
>>>> Can you check that?
>>> Hi Adrian,
>>>
>>> The room where the equipment is located is very cold but I'll check it
>>> out.
>>> Also seen at the time of the problem, a lot of dropped packets.
>>>
>>> # netstat -idn
>>> ...
>>> ix0    1500 <Link#9>      a0:36:9f:2a:6d:ac 18446743423829095869   159
>>> 750924631703 53285910688     0 0     0
>>> ix0       - fe80::a236:9f fe80::a236:9fff:f        0     - -        2
>>> -     -     -
>>> ix1    1500 <Link#10>     a0:36:9f:2a:6d:ae 18446743954328745465     0
>>> 119550050209 20178077451     0 0     0
>>> ix1       - fe80::a236:9f fe80::a236:9fff:f        0     - -        1
>>> -     -     -
>>> ...
>>>
>>> 119550050209 droped packets on ix1 and 750924631703 droped on ix0
>>>
>>> Could be interesting I upgrade to10.1-PRERELEASE?
>>> Could there be a problem with the driver?
>>>
>>> Traffic on ix0: 1.4Gbps output / 600Mbps input
>>> Traffic on ix1: 1.2Gbps output
>>>
>>> PPS on ix0: 163Kpps output / 215Kpps input
>>> PPS on ix1: 131Kpps output
>>>
>>> Thanks for your help.
>>>>
>>>>
>>>> -a
>>>>
>>>>
>>>> On 4 September 2014 16:14, Marcelo Gondim <gondim@bsdinfo.com.br> wrote:
>>>>> Hi All,
>>>>>
>>>>> I have an Intel X520-SR2and today was working when all traffic stopped.
>>>>> I looked in the logs and found this message:
>>>>>
>>>>> Sep  4 18:29:53 rt01 kernel: ix1:
>>>>> Sep  4 18:29:53 rt01 kernel: CRITICAL: ECC ERROR!! Please Reboot!!
>>>>>
>>>>> # uname -a
>>>>> FreeBSD rt01.xxxxx.com.br 10.0-STABLE FreeBSD 10.0-STABLE #10 r267839:
>>>>> Thu
>>>>> Jul 10 15:35:04 BRT 2014
>>>>> root@rt01.xxxxx.com.br:/usr/obj/usr/src/sys/GONDIM10  amd64
>>>>>
>>>>> # netstat -m
>>>>> 98324/53476/151800 mbufs in use (current/cache/total)
>>>>> 98301/44951/143252/1014370 mbuf clusters in use
>>>>> (current/cache/total/max)
>>>>> 98301/44897 mbuf+clusters out of packet secondary zone in use
>>>>> (current/cache)
>>>>> 0/421/421/507184 4k (page size) jumbo clusters in use
>>>>> (current/cache/total/max)
>>>>> 0/0/0/150276 9k jumbo clusters in use (current/cache/total/max)
>>>>> 0/0/0/84530 16k jumbo clusters in use (current/cache/total/max)
>>>>> 221183K/104955K/326138K bytes allocated to network (current/cache/total)
>>>>> 0/0/0 requests for mbufs denied (mbufs/clusters/mbuf+clusters)
>>>>> 0/0/0 requests for mbufs delayed (mbufs/clusters/mbuf+clusters)
>>>>> 0/0/0 requests for jumbo clusters delayed (4k/9k/16k)
>>>>> 0/0/0 requests for jumbo clusters denied (4k/9k/16k)
>>>>> 0 requests for sfbufs denied
>>>>> 0 requests for sfbufs delayed
>>>>> 0 requests for I/O initiated by sendfile
>>>>>
>>>>> Best regards,
>>>>>
>>>>> Gondim




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?540A1A3E.2040306>