Skip to main content
← Back to sources

DeepInfra Joins Hugging Face Inference Providers as Routed Serverless Backend

Published 2026-04-29Ingested 2026-05-05AI Infrastructure and ComputeMedium⭐ Timeline Candidate

Summary

Hugging Face added DeepInfra as a supported Inference Provider on April 29, 2026, opening another path for developers to run open-weight models through HF's Router with no markup over DeepInfra's direct rates. Initial coverage includes DeepSeek V4 Pro (862B), Kimi-K2.6 (1.1T multimodal), and GLM-5.1 (754B) for chat/text-generation, with text-to-image, text-to-video, and embeddings expected to follow. The integration supports both UI-driven configuration and the existing OpenAI-compatible Python/

Alignment: Reinforces current position
Related Positions: AI Infrastructure Strategy, Multi-Model Multi-Vendor Strategy
hugging-facedeepinfrainference-providersdeepseek-v4kimi-k2glm-5open-weightsmodel-routing
DeepInfra Joins Hugging Face Inference Providers as Routed Serverless Backend — Intelligence — Agentic Developer Tools Radar · Signal