For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DiscordPlatform
DocumentationIntegrationsAPI referenceSDKsChangelog
DocumentationIntegrationsAPI referenceSDKsChangelog
  • Integrations
    • Overview
      • OpenAI
      • Anthropic
      • Google Gemini
      • Mistral
      • Baseten
      • Fireworks AI
      • Groq
      • Perplexity AI
      • Replicate
      • Together AI
      • xAI
      • AI21 Labs
      • Cohere
      • DeepSeek
      • Inference
      • Moonshot
      • Nebius AI
      • Nextbit
      • Novita AI
      • OpenRouter
      • Parasail
      • Reducto
      • AWS Bedrock
      • Google Vertex AI
      • Azure OpenAI
      • Custom Provider
LogoLogo
DiscordPlatform
On this page
  • Prerequisites
  • Supported SDKs / integrations
  • Configuration
  • Via UI (Global)
  • Via code (Per-Request)
  • Log Fireworks AI requests
IntegrationsModel Providers

Fireworks AI

Route Fireworks AI model calls through Respan Gateway using your own Fireworks credits.
Was this page helpful?
Previous

Groq

Route Groq model calls through Respan Gateway using your own Groq credits.
Next
Built with
Set up Respan
  1. Sign up — Create an account at platform.respan.ai
  2. Create an API key — Generate one on the API keys page
  3. Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page
Use AI

Add the Docs MCP to your AI coding tool to get help building with Respan. No API key needed.

1{
2 "mcpServers": {
3 "respan-docs": {
4 "url": "https://mcp.respan.ai/mcp/docs"
5 }
6 }
7}
This section is for Respan LLM gateway users.

Use Respan Gateway to call Fireworks AI models while keeping unified observability (logs, cost, latency, and reliability metrics) in Respan — and optionally charge usage to your own Fireworks credits.

Prerequisites

  • A Respan API key
  • A Fireworks API key (BYOK credits)
Get Fireworks API key

Retrieve your API key from Fireworks to begin integration.

Supported SDKs / integrations

✅ Supported Frameworks
  • OpenAI SDK
  • LangChain SDK
  • Vercel/OpenAI
  • Vercel/Google
  • LlamaIndex SDK
  • Google GenAI
  • Respan native (Otel)
❌ Unsupported Frameworks
  • Anthropic SDK
  • Vercel/Anthropic

Configuration

There are 2 ways to add your Fireworks credentials to your requests:

Via UI (Global)

1

Navigate to Providers

Go to the Providers page. This page allows you to manage credentials for over 20+ supported providers.

Respan Providers Page
2

Add your Fireworks API Key

Select Fireworks and paste your API key.

Add Fireworks Credentials
3

Configure Load Balancing (Optional)

You can add multiple Fireworks API keys for redundancy. Use the Load balancing weight field to determine how traffic is distributed between keys.

Via code (Per-Request)

You can pass credentials dynamically in the request body. This is useful if you need to use your users’ own API keys (BYOK credits).

Add the customer_credentials parameter to your Gateway request:

1{
2 // Rest of the request body
3 "customer_credentials": {
4 "fireworks": {
5 "api_key": "YOUR_FIREWORKS_API_KEY"
6 }
7 }
8}

Log Fireworks AI requests

If you are not using the Gateway to proxy requests, you can still log your Fireworks requests to Respan asynchronously. This allows you to track cost, latency, and performance metrics for external calls.

Fireworks AI Python SDK
1import requests
2
3url = "https://api.respan.ai/api/request-logs/create/"
4payload = {
5 "model": "accounts/fireworks/models/llama-v3p1-8b-instruct",
6 "prompt_messages": [
7 {
8 "role": "user",
9 "content": "Summarize the benefits of renewable energy"
10 }
11 ],
12 "completion_message": {
13 "role": "assistant",
14 "content": "Renewable energy offers benefits like lower emissions, improved air quality, and long-term cost stability..."
15 },
16 "cost": 0.0003,
17 "generation_time": 1.5,
18 "customer_params": {
19 "customer_identifier": "user_202"
20 }
21}
22headers = {
23 "Authorization": "Bearer YOUR_RESPAN_API_KEY",
24 "Content-Type": "application/json"
25}
26
27response = requests.post(url, headers=headers, json=payload)
Get Started with Logging
Learn how to set up comprehensive logging for all your LLM requests