coolgpus icon indicating copy to clipboard operation
coolgpus copied to clipboard

Now it's working, but it doesn't actually have any effect

Open zkbutt opened this issue 5 years ago • 5 comments

image

zkbutt avatar Mar 23 '21 15:03 zkbutt

image The fan is still 41% RTX2080 public

zkbutt avatar Mar 23 '21 15:03 zkbutt

No idea. Try running it with the --verbose flag, which'll print the commands that the program's dispatching. Then you can try running those commands yourself in a separate window, see if you can figure out what's stopping them from working.

andyljones avatar Mar 23 '21 18:03 andyljones

With --verbose falg, the following message appears that I am connecting remotely via SSH Please help to check it!

(base) feadre@e2680v2:~$ sudo $(which coolgpus) --speed 99 99 --verbose Command launched: nvidia-smi --format=csv,noheader --query-gpu=pci.bus_id 00000000:03:00.0 Command finished Command launched: pgrep Xorg Command finished No existing X servers, we're good to go Starting xserver: Xorg :0 -once -config /tmp/cool-gpu-00000000:03:00.0avgi4g_3/xorg.conf Started xserver Command launched: nvidia-smi --format=csv,noheader --query-gpu=temperature.gpu -i 00000000:03:00.0

X.Org X Server 1.19.6 Release Date: 2017-12-20 X Protocol Version 11, Revision 0 Build Operating System: Linux 4.15.0-124-generic x86_64 Ubuntu Current Operating System: Linux e2680v2 5.4.0-70-generic #78~18.04.1-Ubuntu SMP Sat Mar 20 14:10:07 UTC 2021 x86_64 Kernel command line: BOOT_IMAGE=/boot/vmlinuz-5.4.0-70-generic root=UUID=9027207a-2c9a-404c-9c36-2ff0e23c99ec ro text Build Date: 30 November 2020 08:01:56PM xorg-server 2:1.19.6-1ubuntu4.8 (For technical support please see http://www.ubuntu.com/support) Current version of pixman: 0.34.0 Before reporting problems, check http://wiki.x.org to make sure that you have the latest version. Markers: (--) probed, (**) from config file, (==) default setting, (++) from command line, (!!) notice, (II) informational, (WW) warning, (EE) error, (NI) not implemented, (??) unknown. (==) Log file: "/var/log/Xorg.0.log", Time: Wed Mar 31 16:02:08 2021 (++) Using config file: "/tmp/cool-gpu-00000000:03:00.0avgi4g_3/xorg.conf" (==) Using system config directory "/usr/share/X11/xorg.conf.d" 82 Command finished GPU :0, 82C -> [99%-99%]. Setting speed to 99% Command launched: nvidia-settings -a [gpu:0]/GPUFanControlState=1 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

Command finished Command launched: nvidia-settings -a [fan:0]/GPUTargetFanSpeed=99 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

ERROR: Error assigning value 99 to attribute 'GPUTargetFanSpeed' (e2680v2:0[fan:0]) as specified in assignment '[fan:0]/GPUTargetFanSpeed=99' (Unknown Error).

Command finished ^CCommand launched: nvidia-settings -a [gpu:0]/GPUFanControlState=0 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

Command finished

zkbutt avatar Mar 31 '21 08:03 zkbutt

So that says one of

Xorg :0 -once -config /tmp/cool-gpu-00000000:03:00.0avgi4g_3/xorg.conf

or

nvidia-settings -a [gpu:0]/GPUFanControlState=1 -c :0

are failing. It's up to you to figure out which one's the culprit and fix them up.

andyljones avatar Mar 31 '21 08:03 andyljones

With --verbose falg, the following message appears that I am connecting remotely via SSH Please help to check it!

(base) feadre@e2680v2:~sudo sudo (which coolgpus) --speed 99 99 --verbose Command launched: nvidia-smi --format=csv,noheader --query-gpu=pci.bus_id 00000000:03:00.0 Command finished Command launched: pgrep Xorg Command finished No existing X servers, we're good to go Starting xserver: Xorg :0 -once -config /tmp/cool-gpu-00000000:03:00.0avgi4g_3/xorg.conf Started xserver Command launched: nvidia-smi --format=csv,noheader --query-gpu=temperature.gpu -i 00000000:03:00.0

X.Org X Server 1.19.6 Release Date: 2017-12-20 X Protocol Version 11, Revision 0 Build Operating System: Linux 4.15.0-124-generic x86_64 Ubuntu Current Operating System: Linux e2680v2 5.4.0-70-generic #78~18.04.1-Ubuntu SMP Sat Mar 20 14:10:07 UTC 2021 x86_64 Kernel command line: BOOT_IMAGE=/boot/vmlinuz-5.4.0-70-generic root=UUID=9027207a-2c9a-404c-9c36-2ff0e23c99ec ro text Build Date: 30 November 2020 08:01:56PM xorg-server 2:1.19.6-1ubuntu4.8 (For technical support please see http://www.ubuntu.com/support) Current version of pixman: 0.34.0 Before reporting problems, check http://wiki.x.org to make sure that you have the latest version. Markers: (--) probed, (**) from config file, (==) default setting, (++) from command line, (!!) notice, (II) informational, (WW) warning, (EE) error, (NI) not implemented, (??) unknown. (==) Log file: "/var/log/Xorg.0.log", Time: Wed Mar 31 16:02:08 2021 (++) Using config file: "/tmp/cool-gpu-00000000:03:00.0avgi4g_3/xorg.conf" (==) Using system config directory "/usr/share/X11/xorg.conf.d" 82 Command finished GPU :0, 82C -> [99%-99%]. Setting speed to 99% Command launched: nvidia-settings -a [gpu:0]/GPUFanControlState=1 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

Command finished Command launched: nvidia-settings -a [fan:0]/GPUTargetFanSpeed=99 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

ERROR: Error assigning value 99 to attribute 'GPUTargetFanSpeed' (e2680v2:0[fan:0]) as specified in assignment '[fan:0]/GPUTargetFanSpeed=99' (Unknown Error).

Command finished ^CCommand launched: nvidia-settings -a [gpu:0]/GPUFanControlState=0 -c :0 Unable to init server: 无法连接: 拒绝连接 libEGL warning: DRI2: failed to authenticate

Command finished

have you solve your problem? i have the same issue as you.

zhaoedf avatar Oct 30 '21 06:10 zhaoedf