Track DeepSeek V4's pricing against incumbent frontier models — at $0.14/M input for Flash and $1.74/M for Pro, it sets a new low-cost reference point that could pressure pricing across the entire API market.
Developers running local LLMs can now access a model that claims flagship-level agentic coding performance in a 16.8GB quantized package, runnable on consumer hardware via `llama.cpp`.