Requesty — LLM Gateways Platform

LLM GatewaysLayer 1Usage-based (5% markup)

Founded 2023|London, United Kingdom|2-10

What is Requesty?

Requesty is a unified LLM gateway and router that provides access to 400+ models across 20+ providers through a single OpenAI-compatible API endpoint. Based in London, the company positions itself as 'Cloudflare for AI' — an infrastructure layer that sits between applications and LLM providers, handling intelligent routing, automatic failover, cost optimization, and enterprise governance.

Founded in 2023, Requesty pivoted from data analytics to the LLM gateway space in early 2025. The company raised a GBP 2M (~$3M) seed round led by 20VC (Harry Stebbings) with participation from Tapestry VC and others. At the time of funding, Requesty had reached $1.5M ARR and 25,000+ developers, growing to 50,000+ developers by early 2026. Notable enterprise clients include Shopify, Amadeus, Chargebee, and Pfizer.

Requesty differentiates through its transparent pricing model (flat 5% markup on base model costs), smart routing that automatically selects the optimal model per request, semantic caching that delivers up to 40% cost reduction, and enterprise governance features including PII detection, spending controls, and EU data residency. The platform claims 99.99% uptime SLA with failover in under 20ms.

Key Features

✓400+ model access
✓Intelligent routing & failover (<20ms)
✓Cost optimization & spending controls
✓PII redaction
✓Prompt caching

Pros & Cons

Pros

+Extreme cost savings with smart routing and semantic caching delivering 40-80% API cost reduction
+Near-zero integration friction — OpenAI-compatible API requires only a single base_url change
+Best-in-class reliability with 99.99% uptime SLA and automatic failover in under 20ms
+Transparent pricing with flat 5% markup, no hidden fees, and generous free tier
+Enterprise governance with PII scrubbing, spending limits, approved model lists, and EU data residency

Cons

-Tiny team of ~5 employees — startup durability is a concern for enterprise buyers
-Limited community presence with no organic developer discussions on Reddit or Hacker News
-Recent pivot from data analytics (2025) — long-term commitment to LLM gateway space unproven
-No open-source option unlike competitor LiteLLM

Requesty Pricing

Free trial available

FreeFree

✓$6 in free credits
✓Access to all features
✓All 400+ models
✓Never auto-bills

Pro5% markup on model costsusage-based

✓All models and providers
✓Smart routing
✓Automatic failover
✓Load balancing
✓Cost tracking
✓Email support

EnterpriseCustom

✓Volume discounts
✓EU data residency
✓SSO/SAML
✓SOC 2 compliance
✓Custom SLAs
✓On-premise option
✓Dedicated support

View official pricing page

Common Use Cases

Enterprise AI teams needing governed LLM access

•Multi-provider LLM access
•Cost optimization
•Enterprise governance
•Failover & reliability
•Data privacy compliance

Using Requesty with Respan

Requesty and Respan operate at adjacent layers of the AI infrastructure stack. Requesty handles model routing and gateway management, while Respan provides deep observability, evaluation, and optimization of the LLM calls flowing through gateways like Requesty.

✓Layer Respan observability on top of Requesty-routed LLM calls for comprehensive monitoring
✓Use Respan evaluations to validate output quality across models that Requesty routes between
✓Compare Respan cost analytics with Requesty routing decisions to optimize spend
✓Monitor Requesty failover events and routing patterns through Respan dashboards

Add deep observability to your LLM gateway with Respan

Best Requesty Alternatives & Competitors

Top companies in LLM Gateways you can use instead of Requesty.

RespanLLM Gateways

OpenRouterLLM Gateways

Cloudflare AI GatewayLLM Gateways

Vercel AI GatewayLLM Gateways

StainlessLLM Gateways

UnifyLLM Gateways

MartianLLM Gateways

Kong AI GatewayLLM Gateways

Apigee AI GatewayLLM Gateways

The Token CompanyLLM Gateways

View all Requesty alternatives →

Compare Requesty

Requesty vs Respan Requesty vs OpenRouter Requesty vs Cloudflare AI Gateway Requesty vs Vercel AI Gateway Requesty vs LiteLLM

Best Integrations for Requesty

Companies from adjacent layers in the AI stack that work well with Requesty.

MilvusVector Databases

PineconeVector Databases

QdrantVector Databases

ChromaVector Databases

RAGFlowRAG Frameworks

UnstructuredRAG Frameworks

LlamaIndexRAG Frameworks

SupabaseVector Databases

ApifyWeb Scraping

WeaviateVector Databases

Bright DataWeb Scraping

Mem0Memory Layer

Last verified: March 27, 2026