GPU parsing fails when no processes are running:
XPU.coerce('auto') fails to parse nvidia-smi in this instance:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.01 Driver Version: 418.87.01 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 TITAN Xp Off | 00000000:05:00.0 Off | N/A |
| 32% 48C P0 63W / 250W | 0MiB / 12194MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 TITAN Xp Off | 00000000:06:00.0 Off | N/A |
| 28% 45C P0 62W / 250W | 0MiB / 12196MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 2 TITAN Xp Off | 00000000:09:00.0 Off | N/A |
| 25% 43C P0 61W / 250W | 0MiB / 12196MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 3 TITAN Xp Off | 00000000:0A:00.0 Off | N/A |
| 21% 38C P0 55W / 250W | 0MiB / 12196MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
causes error:
Traceback (most recent call last):
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 280, in coerce
return XPU.from_auto(**kwargs)
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 161, in from_auto
gpu_num = find_unused_gpu(min_memory=min_memory)
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 672, in find_unused_gpu
gpus = gpu_info()
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 909, in gpu_info
proc = parse_proc_line(line)
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 882, in parse_proc_line
index = int(parts[0])
ValueError: invalid literal for int() with base 10: 'No'
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/lib/python3.5/runpy.py", line 184, in _run_module_as_main
"__main__", mod_spec)
File "/usr/lib/python3.5/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/xviewharn-0.0.1.dev0-py3.5.egg/xviewharn/damage_fit.py", line 1022, in <module>
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/xviewharn-0.0.1.dev0-py3.5.egg/xviewharn/damage_fit.py", line 632, in fit
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/xviewharn-0.0.1.dev0-py3.5.egg/xviewharn/damage_fit.py", line 565, in setup_harn
File "/home/local/KHQ/cameron.johnson/campy3/lib/python3.5/site-packages/netharn/device.py", line 309, in coerce
raise ValueError('cannot cast to XPU. item={!r}. Caused by: {!r}'.format(item, ex))
ValueError: cannot cast to XPU. item='auto'. Caused by: ValueError("invalid literal for int() with base 10: 'No'",)