Qwen2.5 32B is available on the Respan AI Gateway for production LLM workloads. Supports tool use. Up to 128K context window.
from openai import OpenAI client = OpenAI( base_url="https://api.respan.ai/api/", api_key="YOUR_RESPAN_API_KEY",) response = client.chat.completions.create( model="nebius/Qwen/Qwen2.5-32B-Instruct", messages=[{"role": "user", "content": "Hello!"}], extra_body={ "fallback_models": ["Qwen/Qwen2.5-32B-Instruct"], },)print(response.choices[0].message.content)from openai import OpenAI client = OpenAI( base_url="https://api.respan.ai/api/", api_key="YOUR_RESPAN_API_KEY",) response = client.chat.completions.create( model="nebius/Qwen/Qwen2.5-32B-Instruct", messages=[{"role": "user", "content": "Hello!"}], extra_body={ "fallback_models": ["Qwen/Qwen2.5-32B-Instruct"], },)print(response.choices[0].message.content)This model can be reached through several gateway routes. Copy a provider prefix from the table to target a specific host in your app. Read the gateway docs for routing and failover.
Other models from the same provider available through the gateway.
ISO 27001
Respan is fully compliant with ISO 27001, the internationally recognized standard for information security management.
SOC 2
We meet SOC 2 requirements to ensure secure and compliant management of data across all our systems.
GDPR
With operations designed for global compliance, we operate under GDPR - the world's strictest standard for data privacy.
HIPAA
Respan is HIPAA compliant with a Business Associate Agreement available for healthcare organizations.