Based on 6 capability dimensions
4 versions
Reasoning model that thinks before answering. Excels at complex math, science and coding problems.
Omni model. Faster and cheaper than GPT-4. Real-time audio and vision. Emotional voice responses.
Massive capability jump. Multimodal inputs introduced. Passed bar exam in top 10 percentile.
Initial ChatGPT launch. Shocked the world with conversational AI. Reached 1 million users in 5 days.
6 versions
Gemini 3.1 Flash-Lite is introduced as the fastest and most cost-efficient model in the Gemini 3 series. It is positioned as an optimized variant prioritizing speed and cost efficiency over other capabilities.
The Gemini app now features Lyria 3, described as Google DeepMind's most advanced music generation model. Users can create 30-second music tracks using text or image prompts directly within the Gemini app.
Next generation model. Faster, more efficient. Native tool use and agentic capabilities built in.
Breakthrough 1 million token context window. Can process entire codebases, books and long videos.
Rebranded from Bard to Gemini. Three sizes: Ultra, Pro and Nano. First natively multimodal model.