From owner-freebsd-ports@freebsd.org Tue Apr 7 08:27:47 2020 Return-Path: Delivered-To: freebsd-ports@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 62F412AE3A7 for ; Tue, 7 Apr 2020 08:27:47 +0000 (UTC) (envelope-from list1@gjunka.com) Received: from msa1.earth.yoonka.com (yoonka.com [88.98.225.149]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "msa1.earth.yoonka.com", Issuer "msa1.earth.yoonka.com" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 48xLBV0Jrmz4H7R for ; Tue, 7 Apr 2020 08:27:45 +0000 (UTC) (envelope-from list1@gjunka.com) Received: from crayon2.yoonka.com (crayon2.yoonka.com [10.70.7.20]) (authenticated bits=0) by msa1.earth.yoonka.com (8.15.2/8.15.2) with ESMTPSA id 0378Rh7t078879 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO) for ; Tue, 7 Apr 2020 08:27:44 GMT (envelope-from list1@gjunka.com) Subject: Re: amdgpu panics To: freebsd-ports@freebsd.org References: <6b0092f3-8d90-f1bc-b2ae-cf2fa2f029e0@gjunka.com> <47774b7a-0a6d-8806-6dee-4f0036651ace@gjunka.com> <20200312163447.GB42880@phouka1.phouka.net> <8d8ae2c8-1ecd-5c8c-2437-4e47cf48bd60@gmx.de> <96c03c59-b28e-3af1-e98b-e95517c20010@gjunka.com> <83bfb6f7-0a84-2905-7849-e4e93d9f6fb1@selasky.org> <9426b9bb-4fe6-37ba-ecb4-13a1ade47f92@gjunka.com> <9bc766aa-b17f-e8bc-bea2-11431972cf5d@selasky.org> <2ddfe444-7a20-9835-0875-6f93aa0f6ab3@gmx.de> <5fb0aa95-9aa1-e170-15fe-ba5ce77869db@gjunka.com> <72befef2-16f2-a452-9e36-a3986988c556@gjunka.com> <16501c75-24b0-54f6-972c-1a03dfe50276@selasky.org> From: Grzegorz Junka Message-ID: <2c1d5679-811f-0b01-f032-7261e9f57259@gjunka.com> Date: Tue, 7 Apr 2020 08:27:43 +0000 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:68.0) Gecko/20100101 Thunderbird/68.5.0 MIME-Version: 1.0 In-Reply-To: <16501c75-24b0-54f6-972c-1a03dfe50276@selasky.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Rspamd-Queue-Id: 48xLBV0Jrmz4H7R X-Spamd-Bar: ----- Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=none; spf=pass (mx1.freebsd.org: domain of list1@gjunka.com designates 88.98.225.149 as permitted sender) smtp.mailfrom=list1@gjunka.com X-Spamd-Result: default: False [-5.86 / 15.00]; ARC_NA(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000,0]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+ip4:88.98.225.149]; TO_MATCH_ENVRCPT_ALL(0.00)[]; MIME_GOOD(-0.10)[text/plain]; TO_DN_NONE(0.00)[]; PREVIOUSLY_DELIVERED(0.00)[freebsd-ports@freebsd.org]; RCPT_COUNT_ONE(0.00)[1]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; DMARC_NA(0.00)[gjunka.com]; IP_SCORE(-3.56)[ip: (-9.33), ipnet: 88.98.192.0/18(-4.67), asn: 56478(-3.73), country: GB(-0.07)]; FROM_EQ_ENVFROM(0.00)[]; R_DKIM_NA(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:56478, ipnet:88.98.192.0/18, country:GB]; MID_RHS_MATCH_FROM(0.00)[]; RCVD_TLS_ALL(0.00)[]; RCVD_COUNT_TWO(0.00)[2] X-BeenThere: freebsd-ports@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Porting software to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 07 Apr 2020 08:27:47 -0000 On 06/04/2020 23:49, Hans Petter Selasky wrote: > On 2020-04-07 00:07, Grzegorz Junka wrote: >> >> Is it possible to at least gather some debug info where this happens? >> I don't think there is any core dumped if the system doesn't panic? > > Can you SSH to this machine and get dmesg? > I sent the dmesg after booting privately as it was quite long. One interesting thing I just noticed is that the halt is not a complete halt. The system responds to ping and an ssh user session was active, in the sense that I could do ls -l and get a response. But it hung as soon as I tried su. Same with initiating any new ssh session - the system responds with prompt for password but after that nothing happens. This is the content of the messages log starting at the moment when I try to load the modules: Apr  7 07:54:30 venus kernel: [drm] amdgpu kernel modesetting enabled. Apr  7 07:54:30 venus kernel: drmn0: on vgapci0 Apr  7 07:54:30 venus kernel: vgapci0: child drmn0 requested pci_enable_io Apr  7 07:54:30 venus syslogd: last message repeated 1 times Apr  7 07:54:30 venus kernel: [drm] initializing kernel modesetting (VEGA10 0x1002:0x687F 0x1002:0x0B36 0xC0). Apr  7 07:54:30 venus kernel: [drm] register mmio base: 0xFD100000 Apr  7 07:54:30 venus kernel: [drm] register mmio size: 524288 Apr  7 07:54:30 venus kernel: [drm] PCI I/O BAR is not found. Apr  7 07:54:30 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_gpu_info.bin Apr  7 07:54:30 venus kernel: [drm] probing gen 2 caps for device 1022:1471 = 700d03/e Apr  7 07:54:30 venus kernel: [drm] probing mlw for device 1002:687f = 400d03 Apr  7 07:54:30 venus kernel: [drm] UVD is enabled in VM mode Apr  7 07:54:30 venus kernel: [drm] UVD ENC is enabled in VM mode Apr  7 07:54:30 venus kernel: [drm] VCE enabled in VM mode Apr  7 07:54:30 venus kernel: ATOM BIOS: 113-D0500500-104 Apr  7 07:54:30 venus kernel: [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit Apr  7 07:54:30 venus kernel: drmn0: VRAM: 8176M 0x000000F400000000 - 0x000000F5FEFFFFFF (8176M used) Apr  7 07:54:30 venus kernel: drmn0: GTT: 256M 0x000000F600000000 - 0x000000F60FFFFFFF Apr  7 07:54:30 venus kernel: Successfully added WC MTRR for [0xe0000000-0xefffffff]: 0; Apr  7 07:54:30 venus kernel: [drm] Detected VRAM RAM=8176M, BAR=256M Apr  7 07:54:30 venus kernel: [drm] RAM width 2048bits HBM Apr  7 07:54:30 venus kernel: [TTM] Zone  kernel: Available graphics memory: 33495488 kiB Apr  7 07:54:30 venus kernel: [TTM] Zone   dma32: Available graphics memory: 2097152 kiB Apr  7 07:54:30 venus kernel: [TTM] Initializing pool allocator Apr  7 07:54:30 venus kernel: [drm] amdgpu: 8176M of VRAM memory ready Apr  7 07:54:30 venus kernel: [drm] amdgpu: 8176M of GTT memory ready. Apr  7 07:54:30 venus kernel: i_size_write unimplemented Apr  7 07:54:30 venus kernel: [drm] GART: num cpu pages 65536, num gpu pages 65536 Apr  7 07:54:30 venus kernel: [drm] PCIE GART of 256M enabled (table at 0x000000F400800000). Apr  7 07:54:31 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_sos.bin Apr  7 07:54:32 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_asd.bin Apr  7 07:54:32 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_acg_smc.bin Apr  7 07:54:33 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_pfp.bin Apr  7 07:54:33 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_me.bin Apr  7 07:54:34 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_ce.bin Apr  7 07:54:34 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_rlc.bin Apr  7 07:54:35 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_mec.bin Apr  7 07:54:35 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_mec2.bin Apr  7 07:54:35 venus kernel: i_size_write unimplemented Apr  7 07:54:35 venus syslogd: last message repeated 9 times Apr  7 07:54:36 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_sdma.bin Apr  7 07:54:36 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_sdma1.bin Apr  7 07:54:36 venus kernel: [drm] use_doorbell being set to: [true] Apr  7 07:54:36 venus kernel: i_size_write unimplemented Apr  7 07:54:36 venus kernel: [drm] use_doorbell being set to: [true] Apr  7 07:54:36 venus kernel: i_size_write unimplemented Apr  7 07:54:37 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_uvd.bin Apr  7 07:54:37 venus kernel: [drm] Found UVD firmware Version: 65.29 Family ID: 17 Apr  7 07:54:37 venus kernel: [drm] PSP loading UVD firmware Apr  7 07:54:37 venus kernel: i_size_write unimplemented Apr  7 07:54:37 venus syslogd: last message repeated 2 times Apr  7 07:54:37 venus kernel: drmn0: successfully loaded firmware image with name: amdgpu/vega10_vce.bin Apr  7 07:54:37 venus kernel: [drm] Found VCE firmware Version: 57.4 Binary ID: 4 Apr  7 07:54:37 venus kernel: [drm] PSP loading VCE firmware Apr  7 07:54:37 venus kernel: i_size_write unimplemented Apr  7 07:54:37 venus syslogd: last message repeated 2 times Apr  7 07:54:38 venus kernel: [drm] Display Core initialized with v3.1.27! Apr  7 07:54:38 venus kernel: [drm] Connector DP-1: get mode from tunables: Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.modes.DP-1 Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.default_mode Apr  7 07:54:38 venus kernel: [drm] Connector DP-2: get mode from tunables: Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.modes.DP-2 Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.default_mode Apr  7 07:54:38 venus kernel: [drm] Connector DP-3: get mode from tunables: Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.modes.DP-3 Apr  7 07:54:38 venus kernel: [drm]   - kern.vt.fb.default_mode GrzegorzJ