安装docker
sudo apt install docker.io
docker info
yeqiang@yeqiang-ms-7b23:~$ sudo docker info
client:
context: default
debug mode: false
server:
containers: 12
running: 0
paused: 0
stopped: 12
images: 67
server version: 20.10.21
storage driver: overlay2
backing filesystem: xfs
supports d_type: true
native overlay diff: true
userxattr: false
logging driver: json-file
cgroup driver: systemd
cgroup version: 2
plugins:
volume: local
network: bridge host ipvlan macvlan null overlay
log: awslogs fluentd gcplogs gelf journald json-file local logentries splunk syslog
swarm: inactive
runtimes: runc io.containerd.runc.v2 io.containerd.runtime.v1.linux
default runtime: runc
init binary: docker-init
containerd version:
runc version:
init version:
security options:
apparmor
seccomp
profile: default
cgroupns
kernel version: 5.19.0-50-generic
operating system: ubuntu 22.04.2 lts
ostype: linux
architecture: x86_64
cpus: 6
total memory: 31.28gib
name: yeqiang-ms-7b23
id: wbwu:ntcf:2b5d:vxmx:dyyi:puyq:wl2a:7saj:5zs2:lyiz:vm7a:dihq
docker root dir: /home/docker
debug mode: false
registry: https://index.docker.io/v1/
labels:
experimental: false
insecure registries:
127.0.0.0/8
live restore enabled: false
安装nvidia container toolkit
配置apt源
distribution=$(. /etc/os-release;echo $id$version_id)
curl -s -l https://nvidia.github.io/libnvidia-container/gpgkey | sudo apt-key add -
curl -s -l https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
安装nvidia-container-toolkit
sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
安装日志
命中:1 http://mirrors.tuna.tsinghua.edu.cn/ubuntu jammy inrelease
命中:2 http://packages.microsoft.com/repos/code stable inrelease
命中:3 https://dl.google.com/linux/chrome/deb stable inrelease
命中:4 http://mirrors.tuna.tsinghua.edu.cn/ubuntu jammy-updates inrelease
获取:5 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 inrelease [1,484 b]
命中:6 http://mirrors.tuna.tsinghua.edu.cn/ubuntu jammy-backports inrelease
命中:7 http://mirrors.tuna.tsinghua.edu.cn/ubuntu jammy-security inrelease
命中:8 http://security.ubuntu.com/ubuntu jammy-security inrelease
命中:9 http://mirrors.tuna.tsinghua.edu.cn/ubuntu jammy-proposed inrelease
获取:10 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 packages [29.2 kb]
已下载 30.7 kb,耗时 1秒 (28.0 kb/s)
正在读取软件包列表... 完成
w: https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64/inrelease: 密钥存储在过时的 trusted.gpg 密钥环中(/etc/apt/trusted.gpg),请参见 apt-key(8) 的 deprecation 一节以了解详情。
正在读取软件包列表... 完成
正在分析软件包的依赖关系树... 完成
正在读取状态信息... 完成
将会同时安装下列软件:
libnvidia-container-tools libnvidia-container1 nvidia-container-toolkit-base
下列【新】软件包将被安装:
libnvidia-container-tools libnvidia-container1 nvidia-container-toolkit nvidia-container-toolkit-base
升级了 0 个软件包,新安装了 4 个软件包,要卸载 0 个软件包,有 71 个软件包未被升级。
需要下载 4,050 kb 的归档。
解压缩后会消耗 15.7 mb 的额外空间。
获取:1 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 libnvidia-container1 1.13.5-1 [928 kb]
获取:2 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 libnvidia-container-tools 1.13.5-1 [24.9 kb]
获取:3 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 nvidia-container-toolkit-base 1.13.5-1 [2,244 kb]
获取:4 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 nvidia-container-toolkit 1.13.5-1 [853 kb]
已下载 4,050 kb,耗时 12秒 (337 kb/s)
正在选中未选择的软件包 libnvidia-container1:amd64。
(正在读取数据库 ... 系统当前共安装有 263151 个文件和目录。)
准备解压 .../libnvidia-container1_1.13.5-1_amd64.deb ...
正在解压 libnvidia-container1:amd64 (1.13.5-1) ...
正在选中未选择的软件包 libnvidia-container-tools。
准备解压 .../libnvidia-container-tools_1.13.5-1_amd64.deb ...
正在解压 libnvidia-container-tools (1.13.5-1) ...
正在选中未选择的软件包 nvidia-container-toolkit-base。
准备解压 .../nvidia-container-toolkit-base_1.13.5-1_amd64.deb ...
正在解压 nvidia-container-toolkit-base (1.13.5-1) ...
正在选中未选择的软件包 nvidia-container-toolkit。
准备解压 .../nvidia-container-toolkit_1.13.5-1_amd64.deb ...
正在解压 nvidia-container-toolkit (1.13.5-1) ...
正在设置 nvidia-container-toolkit-base (1.13.5-1) ...
正在设置 libnvidia-container1:amd64 (1.13.5-1) ...
正在设置 libnvidia-container-tools (1.13.5-1) ...
正在设置 nvidia-container-toolkit (1.13.5-1) ...
正在处理用于 libc-bin (2.35-0ubuntu3.2) 的触发器 ...
验证安装
root@yeqiang-ms-7b23:~# systemctl restart docker
root@yeqiang-ms-7b23:~# docker run --rm -it --gpus all ubuntu:22.04 /bin/bash
root@cad0abb4936b:/# nvidia-smi
tue aug 1 00:57:29 2023
+-----------------------------------------------------------------------------+
| nvidia-smi 520.61.05 driver version: 520.61.05 cuda version: 11.8 |
|-------------------------------+----------------------+----------------------+
| gpu name persistence-m| bus-id disp.a | volatile uncorr. ecc |
| fan temp perf pwr:usage/cap| memory-usage | gpu-util compute m. |
| | | mig m. |
|===============================+======================+======================|
| 0 nvidia geforce ... off | 00000000:01:00.0 on | n/a |
| 0% 51c p8 21w / 160w | 528mib / 6144mib | 29% default |
| | | n/a |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| processes: |
| gpu gi ci pid type process name gpu memory |
| id id usage |
|=============================================================================|
+-----------------------------------------------------------------------------+
root@cad0abb4936b:/#
参考资料:
发表评论