November 29, 2023Breakthrough1 min read
A bilingual base model, plainly delivered
Just 27 days later, DeepSeek-LLM 7B and 67B arrived. This was the team's first complete reveal of a base model, covering both Chinese and English. Coder handled programming; LLM took on general understanding. Behind both: the Firefly cluster's nonstop hum. The world started noticing this “a-model-every-other-week” rhythm — no launch event, just an arXiv paper and downloadable weights on HuggingFace. That release style would become DeepSeek's signature: put the engineering facts on the table, let developers judge for themselves.
Related product
DeepSeek-LLM
Bilingual base model, minimalist release →
Sources