LogoAIAny
Icon for item

KServe

CNCF-incubating model inference platform (formerly KFServing) that provides Kubernetes CRDs for scalable predictive and generative workloads.

Introduction

Overview

KServe standardises ML model deployment with opinionated control-plane & data-plane components built on Knative, Istio and Envoy.

Key Capabilities
  • InferenceService & ServingRuntime CRDs
  • Canary / blue-green rollout & autoscaling
  • GPU / multi-model support (Triton, vLLM)
  • Pluggable explainers, loggers and drift detectors

Information

Categories