DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence
DeepSeek introduces its V4 series with two mixture-of-experts models supporting one million token context. New architectures include hybrid attention and optimizations that significantly improve efficiency in long-context scenarios.