Welcome, Guest. Please login or register.
Did you miss your activation email?

Author Topic: [EN] kernel 6.12 not working with nvidia-tesla-470  (Read 1652 times)

Offline michaa7

  • User
  • Posts: 2.415
[EN] kernel 6.12 not working with nvidia-tesla-470
« on: 2024/12/06, 20:49:36 »
Kernel 6.12 is not working with nvidia-tesla-470 driver. System boots with nvidia errors and without X.

I don't remember the path to the  nvidia-470-kernel-xyz-make.log and can't provide any make error until someone gives me a hint where to look.

Found it @ /var/lib/dkms/nvidia-tesla-470/470.256.02/build/make.log:
Quote
/var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm/nvidia-drm-drv.c:171:6: error: ‘const struct drm_mode_config_funcs’ has no member named ‘output_poll_changed’
  171 |     .output_poll_changed = nv_drm_output_poll_changed,
      |      ^~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm/nvidia-drm-drv.c:171:28: error: initialization of ‘struct drm_atomic_state * (*)(struct drm_device *)’ from incompatible pointer type ‘void (*)(struct drm_device *)’ [-Wincompatible-pointer-types]
  171 |     .output_poll_changed = nv_drm_output_poll_changed,


Kernel 6.11 still working.

EDIT:

At nvidia forum there is a related posting, still unanswered:
https://forums.developer.nvidia.com/t/driver-470-256-02-o9n-linux-6-12-0-vanilla/313761

And somewhere else, there is a patch, but I don't know how reliable the source is:
https://forum.siduction.org/index.php?action=post;msg=75869;topic=9532.0

I wonder whether this patch will become "official", as nvidia has stopped support for the tesla-470 serie with driver-version 470.256.02 .


« Last Edit: 2024/12/07, 13:54:44 by michaa7 »
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline DeepDayze

  • User
  • Posts: 479
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #1 on: 2024/12/08, 04:37:55 »
Seems there needs to be an update to nvidia drivers so that they can work on 6.12 kernels. For now I locked my system at 6.11 so that there be no further updates to kernel till this is resolved.

Offline michaa7

  • User
  • Posts: 2.415
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #2 on: 2024/12/08, 11:24:52 »
It seems it won't be resolved for nvidia-tesla-470. It has reached EOL  :(
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline towo

  • Administrator
  • User
  • *****
  • Posts: 3.019
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #3 on: 2024/12/08, 14:18:03 »
I have uploaded patched 470 nvidia packages to our fixes repo.
Since i can't test it, let me know, if they are working.
Ich gehe nicht zum Karneval, ich verleihe nur manchmal mein Gesicht.

Offline dibl

  • siduction community member
  • Global Moderator
  • User
  • *****
  • Posts: 2.420
    • Land of the Buckeye
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #4 on: 2024/12/08, 14:26:17 »
For the benefit of others, Nvidia driver 550.135 is working well with recent kernels for the applicable graphics hardware:

Code: [Select]
System:
  Host: Hibiscus Kernel: 6.12.3-1-siduction-amd64 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.2.4 Distro: siduction 22.1.2 Masters_of_War -
    kde - (202303151559)
CPU:
  Info: single core Intel Core i7-4770 [N/A] speed (MHz): 3501
    min/max: 800/3900
Graphics:
  Device-1: NVIDIA GM107 [GeForce GTX 750 Ti] driver: nvidia v: 550.135
  Display: wayland server: X.org v: 1.21.1.14 with: Xwayland v: 24.1.4
    compositor: kwin_wayland driver: X: loaded: nvidia
    unloaded: fbdev,modesetting,nouveau,vesa gpu: nvidia resolution: 1366x768
  API: EGL v: 1.5 drivers: nvidia,swrast,zink
    platforms: gbm,wayland,x11,surfaceless,device
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: nvidia mesa v: 550.135
    renderer: NVIDIA GeForce GTX 750 Ti/PCIe/SSE2
  API: Vulkan v: 1.3.296 drivers: N/A surfaces: xcb,xlib,wayland
 
 
Code: [Select]
System:
  Host: hibiscus Kernel: 6.12.3-1-siduction-amd64 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.2.4 Distro: siduction 2023.1.1 giants - kde -
    (202405290730)
CPU:
  Info: quad core Intel Core i7-7740X [MT MCP] speed (MHz): avg: 800
    min/max: 800/4500
Graphics:
  Device-1: NVIDIA GP106 [GeForce GTX 1060 6GB] driver: nvidia v: 550.135
  Display: wayland server: X.org v: 1.21.1.14 with: Xwayland v: 24.1.4
    compositor: kwin_wayland driver: X: loaded: nvidia
    unloaded: fbdev,modesetting,nouveau,vesa gpu: nvidia resolution:
    1: 1920x1200 2: 1920x1080
  API: EGL v: 1.5 drivers: nvidia,swrast,zink
    platforms: gbm,wayland,x11,surfaceless,device
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: nvidia mesa v: 550.135
    renderer: NVIDIA GeForce GTX 1060 6GB/PCIe/SSE2
  API: Vulkan v: 1.3.296 drivers: N/A surfaces: xcb,xlib,wayland
System76 Oryx Pro, Intel Core i7-11800H, SSD 970 EVO Plus;  Asus ROG STRIX X299-E, Core i7-7740X, Nvidia GTX-1060, dual monitors, SSD 860 EVO

Offline michaa7

  • User
  • Posts: 2.415
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #5 on: 2024/12/08, 16:13:04 »
I have uploaded patched 470 nvidia packages to our fixes repo.
Since i can't test it, let me know, if they are working.

Thanks, towo, for your work.

Unfortunatly they are not working here. The build make.log shows no errors, but the modul can't be loaded during boot, and boot does not start X.

The make.loig path has chaned to
Quote
/var/lib/dkms/nvidia-tesla-470/470.256.02/6.12.3-amd64/x86_64/log/make.log
, the last lines show no mention of an error:
Quote
# BTF [M] /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-peermem.ko
  if [ ! -f vmlinux ]; then printf "Skipping BTF generation for %s due to unavailability of vmlinux
" /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-peermem.ko 1>&2; else LLVM_OBJCOPY="x86_64-linux-gnu-objcopy" pahole -J -j --btf_features=encode_force,var,float,enum64,decl_tag,type_tag,optimized_func,consistent_func,decl_tag_kfuncs -j --lang_exclude=rust --btf_features=distilled_base --btf_base vmlinux /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-peermem.ko; ./tools/bpf/resolve_btfids/resolve_btfids -b vmlinux /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-peermem.ko; fi;
Skipping BTF generation for /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-peermem.ko due to unavailability of vmlinux
# BTF [M] /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm.ko
  if [ ! -f vmlinux ]; then printf "Skipping BTF generation for %s due to unavailability of vmlinux
" /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm.ko 1>&2; else LLVM_OBJCOPY="x86_64-linux-gnu-objcopy" pahole -J -j --btf_features=encode_force,var,float,enum64,decl_tag,type_tag,optimized_func,consistent_func,decl_tag_kfuncs -j --lang_exclude=rust --btf_features=distilled_base --btf_base vmlinux /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm.ko; ./tools/bpf/resolve_btfids/resolve_btfids -b vmlinux /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm.ko; fi;
Skipping BTF generation for /var/lib/dkms/nvidia-tesla-470/470.256.02/build/nvidia-drm.ko due to unavailability of vmlinux
make[2]: Verzeichnis „/usr/src/linux-headers-6.12.3-amd64“ wird verlassen
make[1]: Verzeichnis „/usr/src/linux-headers-6.12.3-common“ wird verlassen
+ [ modules = modules ]
+ [ -x split-object-file.sh ]
+ [ -x split-object-file.sh ]
+ [ -x split-object-file.sh ]
+ [ -x split-object-file.sh ]
+ [ -x split-object-file.sh ]

If you need the whole make.log let me know where and how to poste it.

Thanks / Danke.
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline towo

  • Administrator
  • User
  • *****
  • Posts: 3.019
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #6 on: 2024/12/08, 17:44:10 »
The make log is useless, if the module was build and is loadable.
Ich gehe nicht zum Karneval, ich verleihe nur manchmal mein Gesicht.

Offline michaa7

  • User
  • Posts: 2.415
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #7 on: 2024/12/08, 18:49:29 »
The make log is useless, if the module was build and is loadable.

Ich mache ich auf deutsch weiter:

Das Modul wird gebaut, aber  so wie ich das sehe nicht geladen. Das gilt für den reinen Debian Kernel ebenso wie für den siduction kernel.

Ich lande immer im Terminal, nicht im loginmanager. Ich habe eben mit "startx" einen versuch gemacht. Damit startet eine X-session *ohne login*, ich lande jedoch im falschen DE (lxqt) mit root privilegien!!! Standart wäre fluxbox.

Ok, ich kannn start ich vom terminalpromt auch als user erfolgreich ausführen. Das startet aber auch nciht mein standartsession. Ich konnte jedoch aus "journalctl | grep nvidia" folgendes kopieren:
Quote
Dez 08 18:51:20 dellopti systemd-modules-load[370]: modprobe: FATAL: Module nvidia-tesla-470 not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:20 dellopti systemd-modules-load[368]: modprobe: ERROR: ../libkmod/libkmod-module.c:1072 command_do() Error running install command 'modprobe -i nvidia-tesla-470 ' for module nvidia: retcode 1
Dez 08 18:51:20 dellopti systemd-modules-load[368]: modprobe: ERROR: could not insert 'nvidia': Invalid argument
Dez 08 18:51:20 dellopti systemd-modules-load[371]: modprobe: FATAL: Module nvidia-tesla-470-modeset not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:20 dellopti systemd-modules-load[366]: modprobe: ERROR: ../libkmod/libkmod-module.c:1072 command_do() Error running install command 'modprobe nvidia ; modprobe -i nvidia-tesla-470-modeset ' for module nvidia_modeset: retcode 1
Dez 08 18:51:20 dellopti systemd-modules-load[366]: modprobe: ERROR: could not insert 'nvidia_modeset': Invalid argument
Dez 08 18:51:20 dellopti systemd-modules-load[372]: modprobe: FATAL: Module nvidia-tesla-470-drm not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:20 dellopti systemd-modules-load[275]: Error running install command 'modprobe nvidia-modeset ; modprobe -i nvidia-tesla-470-drm ' for module nvidia_drm: retcode 1
Dez 08 18:51:20 dellopti systemd-modules-load[275]: Failed to insert module 'nvidia_drm': Invalid argument
Dez 08 18:51:20 dellopti systemd-udevd[379]: modprobe: FATAL: Module nvidia-tesla-470 not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:20 dellopti (udev-worker)[347]: Error running install command 'modprobe -i nvidia-tesla-470 ' for module nvidia: retcode 1
Dez 08 18:51:21 dellopti systemd-modules-load[479]: modprobe: FATAL: Module nvidia-tesla-470 not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:21 dellopti systemd-modules-load[477]: modprobe: ERROR: ../libkmod/libkmod-module.c:1072 command_do() Error running install command 'modprobe -i nvidia-tesla-470 ' for module nvidia: retcode 1
Dez 08 18:51:21 dellopti systemd-modules-load[477]: modprobe: ERROR: could not insert 'nvidia': Invalid argument
Dez 08 18:51:21 dellopti systemd-modules-load[480]: modprobe: FATAL: Module nvidia-tesla-470-modeset not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:21 dellopti systemd-modules-load[475]: modprobe: ERROR: ../libkmod/libkmod-module.c:1072 command_do() Error running install command 'modprobe nvidia ; modprobe -i nvidia-tesla-470-modeset ' for module nvidia_modeset: retcode 1
Dez 08 18:51:21 dellopti systemd-modules-load[475]: modprobe: ERROR: could not insert 'nvidia_modeset': Invalid argument
Dez 08 18:51:21 dellopti systemd-modules-load[481]: modprobe: FATAL: Module nvidia-tesla-470-drm not found in directory /lib/modules/6.12.3-1-siduction-amd64
Dez 08 18:51:21 dellopti systemd-modules-load[467]: Error running install command 'modprobe nvidia-modeset ; modprobe -i nvidia-tesla-470-drm ' for module nvidia_drm: retcode 1
Dez 08 18:51:21 dellopti systemd-modules-load[467]: Failed to insert module 'nvidia_drm': Invalid argument
Dez 08 18:51:28 dellopti kernel: audit: type=1400 audit(1733680285.109:5): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=549 comm="apparmor_parser"
Dez 08 18:51:28 dellopti kernel: audit: type=1400 audit(1733680285.109:6): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=549 comm="apparmor_parser"
Dez 08 18:51:30 dellopti systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
Dez 08 18:51:30 dellopti nvidia-persistenced[693]: Started (693)
Dez 08 18:51:30 dellopti nvidia-persistenced[693]: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 118 has read and write permissions for those files.
Dez 08 18:51:30 dellopti nvidia-persistenced[693]: Shutdown (693)
Dez 08 18:51:30 dellopti nvidia-persistenced[689]: nvidia-persistenced failed to initialize. Check syslog for more details.
Dez 08 18:51:30 dellopti systemd[1]: nvidia-persistenced.service: Control process exited, code=exited, status=1/FAILURE
Dez 08 18:51:30 dellopti systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
Dez 08 18:51:30 dellopti systemd[1]: Failed to start nvidia-persistenced.service - NVIDIA Persistence Daemon.
mh@dellopti:~$

« Last Edit: 2024/12/08, 18:56:40 by michaa7 »
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline towo

  • Administrator
  • User
  • *****
  • Posts: 3.019
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #8 on: 2024/12/08, 21:08:31 »
... das Ganze benötigt wohl drm enabled.

For the 500x drivers, the kernel parameter "nvidia-drm.modeset=1" is required, in /etc/default/grub.
« Last Edit: 2024/12/08, 21:34:18 by dibl »
Ich gehe nicht zum Karneval, ich verleihe nur manchmal mein Gesicht.

Offline dibl

  • siduction community member
  • Global Moderator
  • User
  • *****
  • Posts: 2.420
    • Land of the Buckeye
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #9 on: 2024/12/08, 21:35:36 »
OOOppps, I did not mean to overwrite on towo's post. Entschuldigung, bitte.

(My ADMIN power should be removed, before I do some other damage!)
System76 Oryx Pro, Intel Core i7-11800H, SSD 970 EVO Plus;  Asus ROG STRIX X299-E, Core i7-7740X, Nvidia GTX-1060, dual monitors, SSD 860 EVO

Offline michaa7

  • User
  • Posts: 2.415
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #10 on: 2024/12/08, 22:22:29 »
... das Ganze benötigt wohl drm enabled.

For the 500x drivers, the kernel parameter "nvidia-drm.modeset=1" is required, in /etc/default/grub.

and for 470? Is there anything I can do?

And please, how do I start a dkms nvidia modul built manually?
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline dibl

  • siduction community member
  • Global Moderator
  • User
  • *****
  • Posts: 2.420
    • Land of the Buckeye
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #11 on: 2024/12/08, 23:05:28 »

And please, how do I start a dkms nvidia modul built manually?

If that is possible, it is way over my head. Sorry. If it was important to me, I would ask @damentz at liquorix.
System76 Oryx Pro, Intel Core i7-11800H, SSD 970 EVO Plus;  Asus ROG STRIX X299-E, Core i7-7740X, Nvidia GTX-1060, dual monitors, SSD 860 EVO

Offline michaa7

  • User
  • Posts: 2.415
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #12 on: 2024/12/09, 00:21:40 »

And please, how do I start a dkms nvidia modul built manually?

If that is possible, it is way over my head. Sorry. If it was important to me, I would ask @damentz at liquorix.

May be I am wrong, but I seem to recall that it's a one-liner which towo posted some time ago ... but one can trigger a module built by reinstalling the kernel. But I simply wanted to know how the elegant way of doing it would be ...
« Last Edit: 2024/12/09, 02:08:27 by michaa7 »
Ok, you can't code, but you still might be able to write a bug report for Debian's sake

Offline der_bud

  • User
  • Posts: 1.100
  • member
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #13 on: 2024/12/09, 08:17:26 »
Falls Du nicht "update-initramfs -u -k all" meinst (was glaube ich keinen dkms Neubau anstößt?) sondern wirklich dkms, kurze Übersicht hier:  https://blog.rabin.io/linux/manually-run-dkms-to-build-kernel-modules
Du lachst? Wieso lachst du? Das ist doch oft so, Leute lachen erst und dann sind sie tot.

Offline Pip

  • User
  • Posts: 192
Re: kernel 6.12 not working with nvidia-tesla-470
« Reply #14 on: 2024/12/14, 19:33:28 »
I have

Code: [Select]
dpkg -l | grep dkms
ii  dkms                                                     3.1.0-1                                  all          Dynamic Kernel Module System (DKMS)
ii  nvidia-tesla-470-kernel-dkms                             470.256.02-5.siduction.1                 amd64        NVIDIA binary kernel module DKMS source (Tesla 470 version)
ii  virtualbox-dkms                                          7.0.20-dfsg-1+b1                         amd64        x86 virtualization solution - kernel module sources for dkms

When I boot, the output of journalctl -xe | grep nvidia (from another terminal when the first freezes after "Reached target graphical.target - Graphical Interface" and "Finished checkrootkit.service - checkrootkit") is

Code: [Select]
sidomen kernel: audit: type=1400 audit(1734111349.795:5): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=833 comm="apparmor_parser"
sidomen kernel: audit: type=1400 audit(1734111349.795:6): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=833 comm="apparmor_parser"
sidomen kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  470.256.02  Thu May  2 14:50:40 UTC 2024
sidomen kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
sidomen kernel: [drm] Initialized nvidia-drm 0.0.0 for 0000:01:00.0 on minor 0
sidomen kernel: Failed to initialize the nv-hotplug-helper DRM client (ensure DRM kernel mode setting is enabled via nvidia-drm.modeset=1).
sidomen kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Unloading driver
sidomen systemd-modules-load[399]: Inserted module 'nvidia_drm'
sidomen systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
░░ Subject: A start job for unit nvidia-persistenced.service has begun execution
░░ A start job for unit nvidia-persistenced.service has begun execution.
sidomen nvidia-persistenced[1021]: Started (1021)
sidomen nvidia-persistenced[1021]: NUMA: Failed ioctl call to get device NUMA Info: Invalid argument
sidomen nvidia-persistenced[1021]: device 0000:01:00.0 - NUMA: Failed to get device NUMA info
sidomen nvidia-persistenced[1021]: device 0000:01:00.0 - failed to online memory.
sidomen systemd[1]: Started nvidia-persistenced.service - NVIDIA Persistence Daemon.
░░ Subject: A start job for unit nvidia-persistenced.service has finished successfully
░░ A start job for unit nvidia-persistenced.service has finished successfully.

If I run startx from that terminal, the desktop appears.

Is there anything I should be doing to be able to boot into the log in screen? Should I be setting "nvidia-drm.modeset=1" somewhere?