Compare Cohere and Microsoft side by side. Both are tools in the Foundation Models category.
Updated March 10, 2026
Choose Cohere if enterprise-grade security and privacy features.
Choose Microsoft if exceptional performance-to-size ratio—2.7B Phi-2 outperforms 13B models.
Want to compare Cohere and Microsoft on your own traffic?
Respan lets you trace LLM and agent calls across any model or framework, A/B test prompts on production traffic, and route requests across 250+ models through one gateway. Free tier covers 10K traces per month. Setup in 5 minutes, no credit card.
| Category | Foundation Models | Foundation Models |
| Pricing | Usage-based | open-source |
| Best For | Enterprises building RAG-powered search and knowledge applications | Developers needing efficient local AI models |
| Website | cohere.com | azure.microsoft.com |
| Key Features |
|
|
| Use Cases |
| — |
Cohere is an enterprise AI company founded in 2019 in Toronto by Aidan Gomez, Ivan Zhang, and Nick Frosst, all University of Toronto alumni. Headquartered in Toronto and San Francisco with offices in Montreal, New York, London, Paris, and Seoul, Cohere develops secure and private AI technology for real-world business challenges. The company offers multiple model types including Command for text generation (Command R+ at USD 2.50/USD 10 per 1M tokens), Embed v3 for embeddings at USD 0.10 per 1M tokens, and Rerank v3 at USD 2 per 1,000 searches. Cohere also provides multilingual Aya Expanse models. The platform offers a Trial API key for free testing and production keys charged on pay-as-you-go basis, with billing issued monthly or upon reaching USD 250 in outstanding balances. Known for enterprise-grade security and strong multilingual capabilities, Cohere serves businesses requiring private, scalable AI solutions.
Microsoft Phi is a family of small language models designed for resource efficiency without compromising performance. Starting with Phi-2 (2.7B parameters) that surpassed Mistral and Llama-2 models at 7B-13B parameters, the Phi family now includes Phi-4, Phi-4-multimodal (text, audio, vision), and Phi-4-mini. Phi-4 costs USD 0.13 per 1M input tokens and USD 0.50 per 1M output tokens on Azure, with a blended rate of USD 0.22 per 1M tokens. The models excel at math and reasoning tasks, with Phi-4 outperforming comparable and larger models through high-quality synthetic datasets and post-training innovations. Phi models are particularly effective for resource-constrained environments, on-device inference, latency-sensitive scenarios, and cost-constrained use cases. Available through Azure AI Foundry with pay-as-you-go and provisioned throughput options, Phi models provide a 200,000-word vocabulary in 20+ languages. While impressive for their size, limitations include primary English design, reduced factual knowledge capacity, code generation primarily in Python, and tendency for textbook-like verbose responses.
Companies that train and release their own large language models and foundation models. These organizations invest in large-scale model training, publish research, and offer API access to their proprietary models.
Browse all Foundation Modelstools →One platform for routing, observability, tracing, and evals across every LLM provider.