OpenClaw 4.24 Update: DeepSeek V4 Flash as Default Model, Real-Time Voice Fully Enabled

OpenClaw 4.24 Update: DeepSeek V4 Flash as Default Model, Real-Time Voice Fully Enabled

In late April, OpenClaw (NousResearch’s open-source AI Agent platform, ~105k GitHub Stars) released its 2026.4.24 version update. Three changes combined signal a redefinition of the Agent platform competitive landscape.

What Changed

1. DeepSeek V4 Flash as Default Model

New OpenClaw installations now default to DeepSeek V4 Flash instead of Claude Sonnet. V4 Pro is also available in the built-in model directory. This means new users can start using Agent capabilities at near-zero cost — community testing confirms V4 Flash’s cost-effectiveness makes it “fully sufficient” for daily Agent tasks.

2. Real-Time Voice Fully打通

The largest upgrade in 4.24: Talk, Voice Call, and Google Meet now support realtime voice loops with full OpenClaw Agent invocation during calls. AI no longer just “listens then speaks” — it can think and operate tools while listening.

Voice calls now include full Agent mid-call access: the Agent can query data, browse the web, or call external APIs without hanging up or switching interfaces.

3. Browser Automation: Pixel-Level Clicking

OpenClaw now supports viewport-coordinate-based clicking, enabling precise coordinate interaction on web pages. This covers scenarios where DOM element targeting falls short (Canvas, custom UI components).

4. WhatsApp Voice Message Transcription

The new version supports WhatsApp voice message transcription, expanding OpenClaw’s messaging integration capabilities.

Comparison

CapabilityBefore 4.24After 4.24
Default modelClaude SonnetDeepSeek V4 Flash
Real-time voiceLimited supportAll channels (Talk/Voice Call/Meet)
Agent in voice callsNot supportedReal-time mid-call access
Browser clickingDOM element targetingNew viewport pixel-level clicking
WhatsApp voiceNot supportedTranscription supported

Quick Start

If you already have OpenClaw installed:

# Update to latest version
openclaw update

# Check current version
openclaw --version  # Should show 2026.4.24

New users can install directly — DeepSeek V4 Flash is pre-configured as the default:

# Fresh install
openclaw install

Observations

Setting DeepSeek V4 Flash as the default model is a notable strategy: it dramatically lowers entry costs while reducing dependency on a single model provider. However, V4 Flash’s stability in complex Agent workflows still needs more real-world testing.

The integration of real-time voice with full Agent capabilities moves “natural language dialogue to control Agent workflows” from concept to usable. Whether this becomes mainstream in Agent interaction paradigms depends on further optimization of response latency and accuracy.

Primary Sources