k8s调用GPU
  TEZNKK3IfmPf 2023年11月13日 40 0
apiVersion: apps/v1
kind: Deployment
metadata:
name: gpu-master
namespace: gpu
spec:
replicas: 1
selector:
matchLabels:
app: gpu-master
template:
metadata:
labels:
app: gpu-master
spec:
hostname: gpu-master
containers:
- name: gpu-master
image: 192.168.168.10:5000/library/pytorch-gpu:v3
env:
- name: NVIDIA_DRIVER_CAPABILITIES
value: compute,utility
- name: NVIDIA_VISIBLE_DEVICES
value: all
securityContext:
privileged: true
runAsUser: 0
resources:
limits:
nvidia.com/gpu: "1"
requests:
nvidia.com/gpu: "1"
volumeMounts:
- name: code-host-path
mountPath: /persistent
volumes:
- name: code-host-path
hostPath:
path: /root/gpu/gpucode

创建可调用GPU的job

apiVersion: batch/v1
kind: Job
metadata:
labels:
app-name: gpu-job
job-name: gpu-job
name: gpu-job
namespace: gpu
spec:
backoffLimit: 6
parallelism: 1
template:
metadata:
labels:
app-name: gpu-job
job-name: gpu-job
name: gpu-job
spec:
containers:
- command:
- /bin/bash
- -c
- '/usr/local/anaconda2/envs/edu_pytorch/bin/python3.6 /persistent/test.py '
image: 192.168.168.10:5000/library/pytorch-gpu:v3
env:
- name: NVIDIA_DRIVER_CAPABILITIES
value: compute,utility
- name: NVIDIA_VISIBLE_DEVICES
value: all
imagePullPolicy: IfNotPresent
name: gpu-job
resources:
limits:
nvidia.com/gpu: "1"
requests:
nvidia.com/gpu: "1"
securityContext:
privileged: true
procMount: Default
volumeMounts:
- name: code-host-path
mountPath: /persistent
dnsPolicy: ClusterFirst
hostname: gpu-job
restartPolicy: OnFailure
schedulerName: default-scheduler
securityContext: {}
volumes:
- name: code-host-path
hostPath:
path: /root/gpu/gpucode
【版权声明】本文内容来自摩杜云社区用户原创、第三方投稿、转载,内容版权归原作者所有。本网站的目的在于传递更多信息,不拥有版权,亦不承担相应法律责任。如果您发现本社区中有涉嫌抄袭的内容,欢迎发送邮件进行举报,并提供相关证据,一经查实,本社区将立刻删除涉嫌侵权内容,举报邮箱: cloudbbs@moduyun.com

  1. 分享:
最后一次编辑于 2023年11月13日 0

暂无评论

推荐阅读
  TEZNKK3IfmPf   2023年11月15日   33   0   0 k8s
  TEZNKK3IfmPf   2023年11月15日   22   0   0 k8s
TEZNKK3IfmPf