Nvidia cuda: система имеет неподдерживаемую комбинацию драйвера дисплея / драйвера cuda - PullRequest
0 голосов
/ 21 апреля 2020

Ubuntu 18.04

  • cuda драйвер 10.1
  • nvidia драйвер 435.21

Docker контейнер - nvidia/cuda:10.1-devel-ubuntu16.04

Совместимо драйверы cuda и nvidia в соответствии с:

  1. https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html
  2. https://docs.nvidia.com/deploy/cuda-compatibility/index.html
nvidia-smi
Tue Apr 21 02:57:24 2020
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 435.21       Driver Version: 435.21       CUDA Version: 10.1     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:00:1E.0 Off |                    0 |
| N/A   44C    P0    28W /  70W |      0MiB / 15109MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

Однако я все еще сталкиваюсь

With Error: CUDA: Check failed: e == cudaSuccess (803 vs. 0) : system has unsupported display driver / cuda driver combination
Stack trace:
  File "include/mxnet/base.h", line 447
  [bt] (0) ./lenet(dmlc::LogMessageFatal::~LogMessageFatal()+0x7f) [0x40812f]
  [bt] (1) /work/mxnet/lib/libmxnet.so(mxnet::Context::GetGPUCount()+0x23c) [0x7fb1e9220a5c]
  [bt] (2) /work/mxnet/lib/libmxnet.so(mxnet::Context::CudaLibChecks()+0x1f5) [0x7fb1e92f20c5]
  [bt] (3) /work/mxnet/lib/libmxnet.so(mxnet::Context::Create(mxnet::Context::DeviceType, int)+0x69) [0x7fb1e651e1c9]
  [bt] (4) /work/mxnet/lib/libmxnet.so(void CreateNDArray<unsigned int>(unsigned int const*, int, int, int, int, int, void**)+0x5d1) [0x7fb1e9222ba1]
  [bt] (5) /work/mxnet/lib/libmxnet.so(MXNDArrayCreateEx+0x4d) [0x7fb1e91f6d9d]
  [bt] (6) ./lenet() [0x40b10a]
  [bt] (7) ./lenet() [0x418914]
  [bt] (8) ./lenet() [0x41ded0]
  [bt] (9) ./lenet() [0x4056e9]

...