Companies that train and release their own large language models and foundation models. These organizations invest in large-scale model training, publish research, and offer API access to their proprietary models.
22 tools compared · Layer 4 · Updated March 27, 2026
Ranked by community traction, recent activity, and breadth of capabilities. Tap any tool for full pros, cons, pricing, and alternatives.
OpenAI is the creator of ChatGPT and the GPT series of large language models, having pioneered the commercial LLM API market. Founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, and others as a non-profit, it transitioned to a capped-profit model in 2019 and is currently restructuring as a for-profit public benefit corporation.
+Broadest range of AI models under one API (text, reasoning, image, audio)
Anthropic is an AI safety and research company that builds the Claude family of large language models. Founded in 2021 by Dario Amodei (CEO) and Daniela Amodei (President), along with five other former OpenAI employees, the company is structured as a Public Benefit Corporation with a Long-Term Benefit Trust to prioritize societal benefit.
+Industry-leading instruction-following and reasoning capabilities
Google AI develops the Gemini family of multimodal models, capable of processing text, images, audio, and video in a single model. The division traces its roots to Google Brain (founded 2011), which merged with DeepMind (acquired by Google in 2014) in April 2023 to form Google DeepMind under CEO Demis Hassabis.
+Most generous free tier with unlimited access to Flash models
Meta AI develops the Llama series of open-weight large language models, which have become the foundation for a large portion of the open-source AI ecosystem. The AI division, formerly known as Facebook AI Research (FAIR), was founded in 2013 by Mark Zuckerberg and Yann LeCun.
+Completely free open-weight models for commercial use
Mistral AI is a French artificial intelligence company founded in April 2023 by Arthur Mensch, Guillaume Lample, and Timothée Lacroix, three leading AI researchers. Based in Paris at 15 rue des Halles, Mistral builds high-performance open-weight language models known for exceptional efficiency and strong multilingual capabilities. The company offers models ranging from small (7B parameters) to large (Mixtral 8x22B), featuring innovative mixture-of-experts architecture that enables faster inference at lower cost compared to traditional models. Mistral serves developers through its API platform La Plateforme and supports self-hosting, making it particularly popular with European enterprises seeking data sovereignty and teams optimizing for inference costs. With 251-500 employees, Mistral raised $2 billion in September 2025 at a $14 billion valuation, with ASML holding an 11% stake. The company's focus on open-weight models and competitive performance-to-cost ratios has positioned it as a leading alternative to closed-source models from OpenAI and Anthropic.
+Exceptional performance-to-cost ratio with mixture-of-experts architecture
State-of-the-art embedding models and rerankers for search, retrieval, and RAG. Acquired by MongoDB for $220M.
Cohere provides enterprise-focused language models optimized for business applications including search, classification, and retrieval-augmented generation (RAG). Their Command, Embed, and Rerank models are designed for production workloads with features like fine-tuning, private deployments, and multi-language support across 100+ languages. Cohere differentiates with its focus on enterprise security, compliance, and the ability to deploy models on any cloud or on-premises infrastructure.
xAI, founded by Elon Musk, builds the Grok series of large language models. Grok is integrated into the X (formerly Twitter) platform and is available through an API for developers. Known for its real-time information access and conversational style, Grok competes with frontier models on reasoning and coding benchmarks. xAI operates one of the world's largest GPU training clusters.
DeepSeek is a Chinese AI research lab that develops high-performance open-source language models. Their DeepSeek-V2 and DeepSeek-Coder models have achieved state-of-the-art results on coding and reasoning benchmarks while using innovative architectures for cost-efficient training and inference. DeepSeek models are freely available for commercial use and are popular with developers seeking powerful open alternatives to proprietary models.
Moonshot AI is a Chinese AI company that builds the Kimi series of large language models. Kimi supports ultra-long context windows (up to 2M tokens), multimodal capabilities, and advanced reasoning. With over 20M monthly active users, it is one of the most widely used AI assistants in China.
Alibaba's Qwen (Tongyi Qianwen) is a family of open-source large language models that excel in multilingual tasks, particularly Chinese and English. Qwen models range from 0.5B to 72B+ parameters and include specialized variants for coding (Qwen-Coder), math, and multimodal understanding. Available through Alibaba Cloud and open-source channels, Qwen is one of the most capable model families for Asian language applications.
Stability AI is the company behind Stable Diffusion, the most widely used open-source image generation model. Beyond images, Stability develops language models (StableLM), video generation (Stable Video), audio models, and 3D generation tools. Their open-source approach has spawned a massive ecosystem of fine-tuned models, tools, and applications built by the community.
Guide Labs is building the first inherently interpretable LLMs. Their open-source Steerling-8B model features a novel concept layer inserted into the transformer architecture that makes every generated token traceable back to its training data. Unlike post-hoc explainability tools, Guide Labs bakes interpretability directly into the model, achieving 90% of standard model capability with less training data. YC-backed with $9M seed.
Distills proprietary large models into smaller, deployable models — making proprietary intelligence portable and cost-effective.