无法加载动态库“libcublasLt.so.11”; dlerror: libcublasLt.so.11: 无法打开共享对象文件: 没有这样的文件或目录

2024-02-15

我刚刚更新了我的显卡驱动器

sudo apt install nvidia-driver-470
sudo apt install cuda-drivers-470

我决定以这种方式安装它们,因为它们在尝试安装时受到阻碍sudo apt upgrade。然后我错误地做了sudo apt autoremove清理旧包。重新启动计算机以正确设置新驱动程序后,我无法再将 GPU 加速与 Tensorflow 结合使用。

import tensorflow as tf
tf.test.is_gpu_available()
WARNING:tensorflow:From <stdin>:1: is_gpu_available (from tensorflow.python.framework.test_util) is deprecated and will be removed in a future version.
Instructions for updating:
Use `tf.config.list_physical_devices('GPU')` instead.
2021-12-07 16:52:01.771391: I tensorflow/core/platform/cpu_feature_guard.cc:151] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-12-07 16:52:01.807283: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:939] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-12-07 16:52:01.807973: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2021-12-07 16:52:01.808017: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcublas.so.11'; dlerror: libcublas.so.11: cannot open shared object file: No such file or directory
2021-12-07 16:52:01.808048: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcublasLt.so.11'; dlerror: libcublasLt.so.11: cannot open shared object file: No such file or directory
2021-12-07 16:52:01.856391: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcusolver.so.11'; dlerror: libcusolver.so.11: cannot open shared object file: No such file or directory
2021-12-07 16:52:01.856466: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcusparse.so.11'; dlerror: libcusparse.so.11: cannot open shared object file: No such file or directory
2021-12-07 16:52:01.857601: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1850] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
False

你安装了吗cuda-toolkit?该错误表明未找到版本 11 的库。问题是cudatoolkit和cudnn版本可能与您的tensorflow版本不兼容。

如果您已经安装了正确版本的工具包,请直接执行步骤5。(您可以使用命令检查版本nvcc --version).

  1. 从以下位置下载安装程序https://developer.nvidia.com/cuda-11-4-4-download-archive?target_os=Linux https://developer.nvidia.com/cuda-11-4-4-download-archive?target_os=Linux(此版本与驱动程序兼容nvidia-470你安装了)。接下来的步骤具体针对runfile option.

  2. 因为你已经安装了nvidia-drivers, press Continue如果出现此消息。

  3. 接受条款。

  4. 同样,由于您已经安装了驱动程序,只需禁用驱动程序选项并按Install.

  5. 现在您需要配置二进制文件和库的路径。使用find命令搜索nvcc and libcublas.so.*:

    sudo find / -name 'nvcc'  # Path to binaries
    sudo find / -name 'libcublas.so.*'  # Path to libraries
    
  6. 最后,在文件末尾添加下一行~/.profile根据您上面找到的路径。 Cuda 安装于/usr/local/cuda-11.4在我的系统中。

    if [ -d "/usr/local/cuda-11.4" ]; then
        PATH=/usr/local/cuda-11.4/bin${PATH:+:${PATH}}
        LD_LIBRARY_PATH=/usr/local/cuda-11.4/targets/x86_64-linux/lib/${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
    fi
    

如果更新~\.profile不起作用,尝试更新.bashrc or .zshrc(如果你使用zsh代替bash).

  1. 重新启动计算机。
本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)

无法加载动态库“libcublasLt.so.11”; dlerror: libcublasLt.so.11: 无法打开共享对象文件: 没有这样的文件或目录 的相关文章

随机推荐