AI Inference Operator for Kubernetes
kubernetes
ai
k8s
whisper
autoscaler
openai-api
llm
vllm
faster-whisper
ollama
vllm-operator
ollama-operator
inference-operator
-
Updated
Nov 22, 2024 - Go