-
Notifications
You must be signed in to change notification settings - Fork 41
Description
Hi, I am trying to get amdgpu-fan working on Debian 12 due to high GPU temps, I followed the steps outlined on the Debian Wiki.
When checking the status of the service after restarting the daemon and system entirely I find this:
× amdgpu-fan.service - Service for controling amdgpu fan.
Loaded: loaded (/etc/systemd/system/amdgpu-fan.service; enabled; preset: enabled)
Active: failed (Result: exit-code) since Tue 2024-01-23 19:04:12 GMT; 2s ago
Duration: 107ms
Process: 6776 ExecStart=/usr/local/bin/amdgpu-fan (code=exited, status=1/FAILURE)
Main PID: 6776 (code=exited, status=1/FAILURE)
CPU: 811ms
Jan 23 19:04:12 Computer amdgpu-fan[6776]: >
Jan 23 19:04:12 Computer amdgpu-fan[6776]: File "/usr/local/lib/python3.11/dist-packages/amdgpu_fan/lib/amdgpu.py", l>
Jan 23 19:04:12 Computer amdgpu-fan[6776]: return int(self.read_endpoint('fan1_input'))
Jan 23 19:04:12 Computer amdgpu-fan[6776]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Jan 23 19:04:12 Computer amdgpu-fan[6776]: File "/usr/local/lib/python3.11/dist-packages/amdgpu_fan/lib/amdgpu.py", l>
Jan 23 19:04:12 Computer amdgpu-fan[6776]: return e.read()
Jan 23 19:04:12 Computer amdgpu-fan[6776]: ^^^^^^^^
Jan 23 19:04:12 Computer amdgpu-fan[6776]: OSError: [Errno 95] Operation not supported
Jan 23 19:04:12 Computer systemd[1]: amdgpu-fan.service: Main process exited, code=exited, status=1/FAILURE
Jan 23 19:04:12 Computer systemd[1]: amdgpu-fan.service: Failed with result 'exit-code'.
Which is odd because I actually have managed to get the service to run, maybe this was with the config file not requiring modifications to the pwm1 file?
● amdgpu-fan.service - Service for controling amdgpu fan.
Loaded: loaded (/etc/systemd/system/amdgpu-fan.service; enabled; preset: enabled)
Active: active (running) since Tue 2024-01-23 18:45:51 GMT; 8ms ago
Main PID: 4461 (amdgpu-fan)
Tasks: 1 (limit: 18997)
Memory: 2.5M
CPU: 3ms
CGroup: /system.slice/amdgpu-fan.service
└─4461 /usr/bin/python3 /usr/local/bin/amdgpu-fan
I have inspected the /usr/local/lib/python3.11/dist-packages/amdgpu_fan/lib/amdgpu.py file myself but cannot seem to find any issues with it, other than the fact that these variables aren't pointing to the correct locations?
ROOT_DIR = '/sys/class/drm'
HWMON_DIR = 'device/hwmon'
My systemd service looks like this:
[Unit]
Description=Service for controling amdgpu fan.
After=syslog.service
[Service]
Type=simple
User=root
ExecStart=/usr/local/bin/amdgpu-fan
Restart=always
[Install]
WantedBy=multi-user.target
As mentioned I have been able to get the service to start but it has not modified the fan speeds for whatever reason (possibly due to the config file and current temps) but I have been able to use a different tool to control the fan speed so I know that it definitely can be done, I would just rather rely on this tool rather than that one.
The GPU in question
08:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Pitcairn XT [Radeon HD 7870 GHz Edition]
Subsystem: Micro-Star International Co., Ltd. [MSI] Pitcairn XT [Radeon HD 7870 GHz Edition]
Kernel driver in use: amdgpu
Kernel modules: radeon, amdgpu
As far as I know, the card (although old) should be supported by both amdgpu and amdgpu-fan.
I don't know if this helps/mean anything but after a system reboot, checking the status returns this:
× amdgpu-fan.service - Service for controling amdgpu fan.
Loaded: loaded (/etc/systemd/system/amdgpu-fan.service; enabled; preset: enabled)
Active: failed (Result: exit-code) since Tue 2024-01-23 19:42:35 GMT; 16min ago
Duration: 116ms
Process: 1177 ExecStart=/usr/local/bin/amdgpu-fan (code=exited, status=1/FAILURE)
Main PID: 1177 (code=exited, status=1/FAILURE)
CPU: 839ms
Jan 23 19:42:35 Computer systemd[1]: amdgpu-fan.service: Scheduled restart job, restart counter is at 5.
Jan 23 19:42:35 Computer systemd[1]: Stopped amdgpu-fan.service - Service for controling amdgpu fan..
Jan 23 19:42:35 Computer systemd[1]: amdgpu-fan.service: Start request repeated too quickly.
Jan 23 19:42:35 Computer systemd[1]: amdgpu-fan.service: Failed with result 'exit-code'.
Jan 23 19:42:35 Computer systemd[1]: Failed to start amdgpu-fan.service - Service for controling amdgpu fan..
Any help is much appreciated to save my GPU from frying.
Thanks in advance!
