image

TrueFoundry provides an enterprise-grade AI Gateway that encompasses an LLM Gateway, MCP Gateway, and Agent Gateway, enabling enterprises to securely connect, observe, and govern access to models, tools, guardrails, and agents from a single control plane. It lets teams orchestrate agentic AI workloads that are secure (solving key management, authentication, and authorization), efficient (optimizing cost, latency, and multi-region failovers), and future-safe (unified, composable connections across LLMs, MCPs, and guardrails from any provider).

Beyond the gateway, TrueFoundry is a full agentic deployment platform on a Kubernetes-native interface. Teams can host any AI model (LLMs, embeddings, custom models) on high-performance backends like vLLM, TGI, and Triton; fine-tune models on their own data and push checkpoints to production; deploy dedicated MCP servers; and serve agents built with LangGraph, CrewAI, AutoGen, or their own orchestration—fully containerized and production-ready.

Key capabilities:

  • AI Gateway — universal API across providers, virtual models, playground, weight/latency/priority-based routing, fallbacks, budget and rate limiting, simple and semantic caching.
  • MCP Gateway & Agent Skills Registry — register and govern MCP servers/tools, RBAC on MCPs, virtual MCP servers, schema validation, and access control.
  • Prompt Lifecycle Management — version, manage, and monitor prompts with variables.
  • Observability & Tracing — framework-agnostic, OpenTelemetry-compliant tracing from prompt to tool/model execution; GPU/CPU/cluster infra monitoring; plugs into Grafana, Datadog, Prometheus.
  • Governance & Security — granular RBAC, SSO, immutable audit logging, real-time policy enforcement (data residency, quotas, cost control).
  • GPU optimization — GPU orchestration and autoscaling, fractional GPU support (NVIDIA MIG and time slicing), automated rightsizing.
  • Flexible deployment — SaaS, VPC, on-prem, air-gapped, hybrid, or multi-cloud, so no data leaves your domain.

Pricing (7-day free trial, 24×7 support, setup assistance):

  • Developer — $0/month. 50K requests/mo, 3 users. For explorers and early builders prototyping AI workflows.
  • Pro — $499/month. 1M requests/mo, 10 users. Usage-based scaling with higher limits and essential governance (additional usage billed per-unit; e.g., 2M requests + 5 API keys for +$499/mo).
  • Pro Plus — $2,999/month. 1M requests/mo, 25 users. Stricter data controls, advanced account management, priority SLAs—without self-hosting.
  • Enterprise — Custom (10M+ requests/mo). VPC/on-prem/air-gapped deployment, SSO, audit logs, advanced governance, enterprise-grade SLA. AI deployment pricing via sales.
  • Self-hosting the Gateway (or Control + Gateway planes) adds ~$600–$1,000/month in infra costs; fully managed SaaS has no hosting cost.

Unique capabilities: Single control plane spanning LLM + MCP + Agent gateways; deploy anywhere including air-gapped; Kubernetes-native model/agent/MCP deployment plus fine-tuning in one platform; fractional GPU sharing; framework- and provider-agnostic throughout.

Try now

FAQs

Q What happens if I exceed my request limits?

You're billed for additional usage at transparent per-unit rates (e.g., 2M requests and 5 API keys for an additional $499/month). For Pro Plus overages, contact sales.

Q Does Truefoundry support on-prem or VPC deployments?

Yes. The Enterprise plan supports full VPC and air-gapped installations for both the control plane and gateway plane, so no data leaves your domain.

Q What are the deployment options?

Four modes: (1) AI Gateway SaaS only, (2) SaaS AI Gateway with data stored on your own infrastructure, (3) Gateway Plane only, and (4) Control Plane + Gateway Plane. Available as SaaS, VPC, on-prem, air-gapped, hybrid, or multi-cloud.

Q Will I have to bear additional infrastructure costs?

With the fully managed SaaS AI Gateway there are no hosting costs (only your own storage if you choose to store data on your infra). Self-hosting the Gateway, or both Control and Gateway planes, costs roughly $600–$1,000/month.

Q What's the difference between the Standard SLA and Enterprise SLA?

Standard SLA typically offers 24–48 hour response times; Enterprise SLA provides customizable response times and support levels based on the customer's premium tier.

Promote TrueFoundry

Write a review

Your Rating
angry
crying
sleeping
smily
cool
Browse

Your review recommended to be at least 140 characters long :)

image

Additional Details

  • Paid, Free Trial
image