Core Judgment
With Google I/O just days away, leak information about Gemini’s product line is pouring in. The core signal: Google is no longer satisfied with being “a better chatbot” — it wants to build full-scenario AI infrastructure covering text, video, and vision.
The leaks involve three product lines, each targeting a different market position — this is not a single model upgrade, but a strategic-level product matrix restructuring.
Three Leaked Product Lines
1. Gemini “Omni” Multimodal Model
| Attribute | Information |
|---|---|
| Positioning | Ultra-deep multimodal understanding and generation |
| Key Capability | Beyond current Veo’s video generation quality, supporting bidirectional video understanding + generation |
| Current Status | Internal testing, UI already shows “powered by Omni” label |
| Release Window | During or shortly after Google I/O |
Omni’s core value lies in unifying understanding and generation. Current AI models are typically one-way — they can understand video but not generate it, or generate but not understand. If Omni achieves bidirectional capability, it will become the first truly “omni-modal” model.
Leaked UI screenshots show a “powered by Omni” label, indicating Google plans to integrate it as an underlying engine across multiple products, rather than as a standalone chat interface.
2. Gemini 3.5 Flash
| Attribute | Information |
|---|---|
| Positioning | High-speed, low-cost everyday reasoning model |
| Current Status | Already in internal testing |
| Expected Release | Google I/O |
| Competitor Target | GPT-4o mini, Claude Haiku |
3.5 Flash continues Google’s “Flash” series positioning — not pursuing the strongest intelligence, but pursuing the fastest response speed and lowest cost. For enterprise users needing large-scale AI deployment, this is the most pragmatic choice.
3. “spark Robin” Vision Model
| Attribute | Information |
|---|---|
| Positioning | New model focused on image/vision understanding |
| Current Status | Leaked stage, limited details |
| Potential Use | Google Lens upgrade, Photos smart search, Android system-level vision |
The “spark Robin” naming suggests it belongs to Google’s “Spark” model series (Muse Spark is the flagship of this series). If this is a standalone vision model, Google may integrate it into the Android system, achieving system-level AI vision capability.
Google I/O 2026 Potential Full Picture
| Product | Positioning | Target Users |
|---|---|---|
| Gemini 3.5 Flash | High-speed low-cost reasoning | Developers, enterprise batch deployment |
| Gemini Omni | Full-modal understanding + generation | Premium users, creative industries |
| spark Robin | Vision-specific model | Mobile devices, system integration |
| Daily Brief | AI daily briefing | Individual users |
| Cosmo AI (Nano) | On-device AI application | Mobile devices |
| AI Avatars | Digital humans | Social, customer service scenarios |
Competitor Landscape Comparison
| Company | Multimodal Strategy | Current Strongest |
|---|---|---|
| Omni unifies understanding + generation | Gemini 2.5 Pro | |
| OpenAI | GPT-4o multimodal + Veo video | GPT-5.5 |
| Anthropic | Claude native multimodal | Claude 5 (Mythos) |
| Alibaba | Qwen-VL + Tongyi Wanxiang | Qwen3.6-Max |
Google’s Omni strategy is most similar to OpenAI’s GPT-4o — both pursuing a single model that handles all modalities. But unlike OpenAI’s “one super-large model” approach, Google has chosen a multi-model matrix strategy: Flash for volume, Omni for heavy lifting, Robin for specialization. This strategy’s advantage is flexibility and cost control; the disadvantage is higher ecosystem integration difficulty.
Action Recommendations
For Developers
- Watch Flash 3.5 API pricing: If it continues the Flash series’ low-price strategy, it could be the best choice for batch deployment
- Evaluate Omni’s video capabilities: If Veo-level video generation is available via API, it will dramatically lower the barrier for video content production
- Prepare multi-model routing: Google’s multi-model matrix means you’ll need smart routing strategies to choose the right model
For Enterprises
- Google ecosystem users get priority: Companies already using Google Workspace will experience Gemini’s deep integration first
- Video content producers: Omni’s video generation capability may change video content production workflows
- Mobile developers: Cosmo AI (Nano)‘s on-device capabilities are worth watching, especially for privacy-sensitive scenarios
Risk Reminder
All current information comes from leaks and is not officially confirmed. Google I/O’s actual release content may differ from leaked information. Historically, Google has also leaked multiple pieces of information before I/O that never materialized. Await official announcements.