Google I/O Preview Leaks: Gemini "Omni" Multimodal Model + 3.5 Flash + New Vision Model, Triple Release Warmup

Core Judgment

With Google I/O just days away, leak information about Gemini’s product line is pouring in. The core signal: Google is no longer satisfied with being “a better chatbot” — it wants to build full-scenario AI infrastructure covering text, video, and vision.

The leaks involve three product lines, each targeting a different market position — this is not a single model upgrade, but a strategic-level product matrix restructuring.

Three Leaked Product Lines

1. Gemini “Omni” Multimodal Model

Attribute	Information
Positioning	Ultra-deep multimodal understanding and generation
Key Capability	Beyond current Veo’s video generation quality, supporting bidirectional video understanding + generation
Current Status	Internal testing, UI already shows “powered by Omni” label
Release Window	During or shortly after Google I/O

Omni’s core value lies in unifying understanding and generation. Current AI models are typically one-way — they can understand video but not generate it, or generate but not understand. If Omni achieves bidirectional capability, it will become the first truly “omni-modal” model.

Leaked UI screenshots show a “powered by Omni” label, indicating Google plans to integrate it as an underlying engine across multiple products, rather than as a standalone chat interface.

2. Gemini 3.5 Flash

Attribute	Information
Positioning	High-speed, low-cost everyday reasoning model
Current Status	Already in internal testing
Expected Release	Google I/O
Competitor Target	GPT-4o mini, Claude Haiku

3.5 Flash continues Google’s “Flash” series positioning — not pursuing the strongest intelligence, but pursuing the fastest response speed and lowest cost. For enterprise users needing large-scale AI deployment, this is the most pragmatic choice.

3. “spark Robin” Vision Model

Attribute	Information
Positioning	New model focused on image/vision understanding
Current Status	Leaked stage, limited details
Potential Use	Google Lens upgrade, Photos smart search, Android system-level vision

The “spark Robin” naming suggests it belongs to Google’s “Spark” model series (Muse Spark is the flagship of this series). If this is a standalone vision model, Google may integrate it into the Android system, achieving system-level AI vision capability.

Google I/O 2026 Potential Full Picture

Product	Positioning	Target Users
Gemini 3.5 Flash	High-speed low-cost reasoning	Developers, enterprise batch deployment
Gemini Omni	Full-modal understanding + generation	Premium users, creative industries
spark Robin	Vision-specific model	Mobile devices, system integration
Daily Brief	AI daily briefing	Individual users
Cosmo AI (Nano)	On-device AI application	Mobile devices
AI Avatars	Digital humans	Social, customer service scenarios

Competitor Landscape Comparison

Company	Multimodal Strategy	Current Strongest
Google	Omni unifies understanding + generation	Gemini 2.5 Pro
OpenAI	GPT-4o multimodal + Veo video	GPT-5.5
Anthropic	Claude native multimodal	Claude 5 (Mythos)
Alibaba	Qwen-VL + Tongyi Wanxiang	Qwen3.6-Max

Google’s Omni strategy is most similar to OpenAI’s GPT-4o — both pursuing a single model that handles all modalities. But unlike OpenAI’s “one super-large model” approach, Google has chosen a multi-model matrix strategy: Flash for volume, Omni for heavy lifting, Robin for specialization. This strategy’s advantage is flexibility and cost control; the disadvantage is higher ecosystem integration difficulty.

Action Recommendations

For Developers

Watch Flash 3.5 API pricing: If it continues the Flash series’ low-price strategy, it could be the best choice for batch deployment
Evaluate Omni’s video capabilities: If Veo-level video generation is available via API, it will dramatically lower the barrier for video content production
Prepare multi-model routing: Google’s multi-model matrix means you’ll need smart routing strategies to choose the right model

For Enterprises

Google ecosystem users get priority: Companies already using Google Workspace will experience Gemini’s deep integration first
Video content producers: Omni’s video generation capability may change video content production workflows
Mobile developers: Cosmo AI (Nano)‘s on-device capabilities are worth watching, especially for privacy-sensitive scenarios

Risk Reminder

All current information comes from leaks and is not officially confirmed. Google I/O’s actual release content may differ from leaked information. Historically, Google has also leaked multiple pieces of information before I/O that never materialized. Await official announcements.