konduktor secret (see complex example here). The model deepseek-ai/DeepSeek-R1-Distill-Llama-8B does not require one.
Documentation Index
Fetch the complete documentation index at: /llms.txt
Use this file to discover all available pages before exploring further.
🎓 Pluto PRO promo: academic users and early-stage startups can email founders@trainy.ai for a PROMO code.
Example deployment yamls for vLLM deployments with konduktor serve
konduktor secret (see complex example here). The model deepseek-ai/DeepSeek-R1-Distill-Llama-8B does not require one.
$ ls
deployment.yaml
$ konduktor serve launch deployment.yaml
# no autoscaling + default port (8000) + single GPU
name: serving-vllm-simple
resources:
cpus: 4
memory: 32
accelerators: A100:1
image_id: vllm/vllm-openai:v0.7.1
labels:
kueue.x-k8s.io/queue-name: user-queue
serving:
min_replicas: 1
run: |
python3 -m vllm.entrypoints.openai.api_server \
--model deepseek-ai/DeepSeek-R1-Distill-Llama-8B \
--max-model-len 4096
Was this page helpful?