Based on 4 capability dimensions
6 versions
Anthropic introduced Claude Sonnet 4.6, described as delivering frontier performance across coding, agents, and professional work at scale. This represents a new version in the Sonnet model line.
Extended thinking mode introduced. Biggest leap in coding and reasoning. Computer use improved.
Outperforms Claude 3 Opus at twice the speed. Sets new benchmark for coding tasks.
Most capable Claude model at launch. Beats GPT-4 on most benchmarks. Three tier family introduced.
Major upgrade. Context window expanded to 100K tokens. Improved reasoning and safer responses.
5 versions
Mistral introduced Mistral 3, described as a family of frontier open-source multimodal models. This represents a new generation of models with multimodal capabilities. It is positioned as a significant step forward in both openness and frontier-level performance.
Significantly improved coding and reasoning. 128K context window. Better instruction following and reduced hallucinations.
Flagship commercial model. Top tier reasoning capabilities. Native function calling. Available via API and Azure.
Mixture of Experts architecture. Matches GPT-3.5 performance at fraction of the cost. Multilingual support across 5 languages.
First Mistral model released. Outperformed Llama 2 13B on all benchmarks despite being smaller. Apache 2.0 license for commercial use.