Welcome, Guest. Please login or register.
Did you miss your activation email?

Author Topic: [EN] AMD driver bug since 6.3.9  (Read 2960 times)

Offline Helix

  • Newbie
  • Posts: 2
[EN] AMD driver bug since 6.3.9
« on: 2023/07/04, 20:52:12 »
Since kernel version 3.6.9 up until 6.4.1, the kernel have a null pointer dereference when running on AMD graphics cards.

Here is my dmesg:
Code: [Select]
[    4.550426] BUG: kernel NULL pointer dereference, address: 000000000000000a
[    4.550431] #PF: supervisor read access in kernel mode
[    4.550433] #PF: error_code(0x0000) - not-present page

I've gotten bite by this bug and other people too: https://gitlab.freedesktop.org/drm/amd/-/issues/2669

Work-around: use some older kernel. Here, I'm using 6.3.4.

Online towo

  • Administrator
  • User
  • *****
  • Posts: 2.939
Re: AMD driver bug since 6.3.9
« Reply #1 on: 2023/07/04, 20:56:18 »
Code: [Select]
~
towo:Defiant> LANG=C dmesg | grep BUG

~
towo:Defiant> LANG=C dmesg | grep -i null

Code: [Select]
~
towo:Defiant> inxi -SG
System:    Host: Defiant Kernel: 6.4.0-0-siduction-amd64 arch: x86_64 bits: 64 Desktop: KDE Plasma v: 5.27.5 Distro: siduction 18.3.0 Patience - kde - (202010061355)
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] driver: amdgpu v: kernel
           Display: wayland server: X.org v: 1.21.1.7 with: Xwayland v: 23.1.1 compositor: kwin_wayland driver: X: loaded: amdgpu
             unloaded: fbdev,modesetting,radeon,vesa dri: radeonsi gpu: amdgpu resolution: 1: 1920x1080 2: 1920x1080
           API: OpenGL v: 4.6 Mesa 23.2.0-devel renderer: AMD Radeon RX 6600 XT (navi23 LLVM 15.0.7 DRM 3.52 6.4.0-0-siduction-amd64)

So please, no generalisations.
Ich gehe nicht zum Karneval, ich verleihe nur manchmal mein Gesicht.

Offline Helix

  • Newbie
  • Posts: 2
Re: AMD driver bug since 6.3.9
« Reply #2 on: 2023/07/05, 19:18:13 »
Sorry for the generalization, towo.

It seems to be an issue related to some integrated graphics in AMD chips, not so much related to discrete GPUs.

People using Manjaro have too reported instability in recent kernel versions: https://forum.manjaro.org/t/unstable-update-2023-07-02-mesa-kernels-amdgpu-stuff/143425

People using Endeavor too: https://forum.endeavouros.com/t/if-you-have-problems-with-kernel-6-4-1-report-here-please/42756/26

Here is the remaining log, for documentation:
Code: [Select]
Jul  4 10:58:43 helixbox kernel: BUG: kernel NULL pointer dereference, address: 0000000000000208
Jul  4 10:58:43 helixbox kernel: #PF: supervisor instruction fetch in kernel mode
Jul  4 10:58:43 helixbox kernel: #PF: error_code(0x0010) - not-present page
Jul  4 10:58:43 helixbox kernel: PGD 0 P4D 0
Jul  4 10:58:43 helixbox kernel: Oops: 0010 [#2] PREEMPT SMP NOPTI
Jul  4 10:58:43 helixbox kernel: CPU: 2 PID: 768 Comm: systemd-logind Tainted: G      D    O       6.3.11-1-siduction-amd64 #1  siduction 6.3-11
Jul  4 10:58:43 helixbox kernel: Hardware name: LENOVO 20U6002TBO/20U6002TBO, BIOS R19ET38W (1.22 ) 11/07/2021
Jul  4 10:58:43 helixbox kernel: RIP: 0010:0x208
Jul  4 10:58:43 helixbox kernel: Code: Unable to access opcode bytes at 0x1de.
Jul  4 10:58:43 helixbox kernel: RSP: 0018:ffffc90001047d80 EFLAGS: 00010202
Jul  4 10:58:43 helixbox kernel: RAX: 0000000000000000 RBX: ffff8881049631b0 RCX: 00000000004deb02
Jul  4 10:58:43 helixbox kernel: RDX: 0000000000000208 RSI: ffff8881f671c000 RDI: ffff8881049631b0
Jul  4 10:58:43 helixbox kernel: RBP: ffff8881f671c000 R08: 000000000000000d R09: ffff8882f671c236
Jul  4 10:58:43 helixbox kernel: R10: ffffffffffffffff R11: 00000000ffffffff R12: ffff888104b1a000
Jul  4 10:58:43 helixbox kernel: R13: ffff8881f671c000 R14: 0000000000000001 R15: 0000000000000001
Jul  4 10:58:43 helixbox kernel: FS:  00007f5ab4ac14c0(0000) GS:ffff8883ff480000(0000) knlGS:0000000000000000
Jul  4 10:58:43 helixbox kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul  4 10:58:43 helixbox kernel: CR2: 0000000000000208 CR3: 000000010a2ba000 CR4: 0000000000350ee0
Jul  4 10:58:43 helixbox kernel: Call Trace:
Jul  4 10:58:43 helixbox kernel:  <TASK>
Jul  4 10:58:43 helixbox kernel:  ? __die+0x1a/0x60
Jul  4 10:58:43 helixbox kernel:  ? page_fault_oops+0x158/0x460
Jul  4 10:58:43 helixbox kernel:  ? exc_page_fault+0x346/0x590
Jul  4 10:58:43 helixbox kernel:  ? asm_exc_page_fault+0x22/0x30
Jul  4 10:58:43 helixbox kernel:  ? dev_uevent+0xa5/0x200
Jul  4 10:58:43 helixbox kernel:  ? uevent_show+0x86/0xf0
Jul  4 10:58:43 helixbox kernel:  ? dev_attr_show+0x13/0x50
Jul  4 10:58:43 helixbox kernel:  ? sysfs_kf_seq_show+0x9e/0xf0
Jul  4 10:58:43 helixbox kernel:  ? seq_read_iter+0x11d/0x470
Jul  4 10:58:43 helixbox kernel:  ? vfs_read+0x1ef/0x2c0
Jul  4 10:58:43 helixbox kernel:  ? ksys_read+0x5e/0xe0
Jul  4 10:58:43 helixbox kernel:  ? do_syscall_64+0x3a/0x90
Jul  4 10:58:43 helixbox kernel:  ? entry_SYSCALL_64_after_hwframe+0x63/0xcd
Jul  4 10:58:43 helixbox kernel:  </TASK>
Jul  4 10:58:43 helixbox kernel: Modules linked in: hid_generic hidp rfcomm snd_seq(+) snd_seq_device cpufreq_userspace vboxdrv(O) cpufreq_conservative cpufreq_powersave ccm des_generic libdes md4 qrtr ip6t_REJECT nf_reject_ipv6 xt_hl cmac bnep ip6_tables ip6t_rt ipt_REJECT nf_reject_ipv4 xt_LOG nf_log_syslog xt_multiport nft_limit xt_limit xt_addrtype xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables binfmt_misc nfnetlink nls_utf8 nls_cp437 vfat fat joydev btusb btrtl btbcm btintel btmtk bluetooth snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi intel_rapl_msr amdgpu intel_rapl_common snd_hda_intel rtw89_8852ae snd_pci_ps snd_intel_dspcfg rtw89_8852a snd_rpl_pci_acp6x snd_intel_sdw_acpi rtw89_pci edac_mce_amd iommu_v2 snd_hda_codec snd_acp_pci gpu_sched rtw89_core snd_pci_acp6x snd_hda_core drm_buddy snd_pci_acp5x kvm_amd i2c_algo_bit snd_rn_pci_acp3x drm_ttm_helper snd_hwdep kvm irqbypass snd_pcm_oss mac80211 think_lmi snd_acp_config libarc4 thinkpad_acpi tpm_crb rapl
Jul  4 10:58:43 helixbox kernel:  snd_mixer_oss firmware_attributes_class wmi_bmof pcspkr ttm snd_soc_acpi nvram snd_pcm ledtrig_audio drm_display_helper snd_pci_acp3x k10temp ipmi_devintf snd_timer platform_profile cfg80211 cec snd ucsi_acpi ipmi_msghandler typec_ucsi soundcore ac tpm_tis roles rfkill tpm_tis_core typec acpi_cpufreq button evdev serio_raw uvcvideo uvc videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videodev videobuf2_common mc msr parport_pc ppdev lp nfsd parport auth_rpcgss nfs_acl lockd grace fuse configfs sunrpc tpm rng_core ip_tables x_tables raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx raid1 raid0 multipath linear md_mod sdhci_pci xhci_pci r8169 cqhci xhci_pci_renesas polyval_clmulni polyval_generic psmouse realtek sdhci ehci_pci mdio_devres i2c_piix4 xhci_hcd mmc_core libphy ehci_hcd video battery wmi i2c_scmi [last unloaded: vboxnetflt(O)]
Jul  4 10:58:43 helixbox kernel: CR2: 0000000000000208
Jul  4 10:58:43 helixbox kernel: ---[ end trace 0000000000000000 ]---
Jul  4 10:58:43 helixbox kernel: RIP: 0010:internal_create_groups+0x10/0xa0
Jul  4 10:58:43 helixbox kernel: Code: 86 fc ff ff 66 0f 1f 44 00 00 48 89 f2 be 01 00 00 00 e9 73 fc ff ff 0f 1f 00 41 56 41 55 41 54 55 53 48 85 d2 74 78 49 89 d5 <48> 8b 12 48 85 d2 74 6d 49 89 fc 41 89 f6 31 ed eb 10 83 c5 01 48
Jul  4 10:58:43 helixbox kernel: RSP: 0018:ffffc90001227d00 EFLAGS: 00010206
Jul  4 10:58:43 helixbox kernel: RAX: 0000000000000000 RBX: ffffffffa0c79440 RCX: 0000000000000000
Jul  4 10:58:43 helixbox kernel: RDX: 000000000000000a RSI: 0000000000000000 RDI: ffffffffa0c79440
Jul  4 10:58:43 helixbox kernel: RBP: 0000000000000000 R08: 0000000000000228 R09: ffff88810004ad90
Jul  4 10:58:43 helixbox kernel: R10: ffff88812ec1ddd0 R11: 00000000ffffffff R12: 0000000000000000
Jul  4 10:58:43 helixbox kernel: R13: 000000000000000a R14: 0000000000000000 R15: ffffffffa0c73280
Jul  4 10:58:43 helixbox kernel: FS:  00007f5ab4ac14c0(0000) GS:ffff8883ff480000(0000) knlGS:0000000000000000
Jul  4 10:58:43 helixbox kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul  4 10:58:43 helixbox kernel: CR2: 0000000000000208 CR3: 000000010a2ba000 CR4: 0000000000350ee0
« Last Edit: 2023/07/05, 19:25:19 by Helix »