April 24, 2026Present1 min read

V4 Preview: million-token context, dancing with hardware

DeepSeek released V4 Preview in Pro and Flash variants: Pro at 1.6T parameters, 49B active; Flash at 284B/13B. Context window jumped to 1 million tokens, using a Hybrid Attention architecture. V4-Pro pricing: $3.48 per million output tokens — versus OpenAI's $30, a value gap pulled wider still. Another signal: native compatibility with Huawei's Ascend 950, all open weights available. This isn't only model size growth — it's an ecosystem move. While the world still debates software-hardware decoupling, DeepSeek has quietly opened a parallel pipeline.

Sources