First shot in code: who said open source couldn't?
November 2, 2023: DeepSeek-Coder shipped in four sizes (1.3B–33B), fully open from day one. Trained on 2 trillion tokens, 87% code and 13% natural language, covering 80+ programming languages. The 33B variant beat CodeLlama-34B on multiple benchmarks — less than four months after the company spun out.
DeepSeek-Coder shipped — four sizes from 1.3B to 33B, all open-sourced from day one. The models were trained on 2 trillion tokens, mixing 87% code and 13% natural language, covering more than 80 programming languages. The 33B variant beat the then-prominent CodeLlama-34B on multiple benchmarks. What shook developers more: this wasn't the side-project of a tech giant. It came from a team that had spun out four months earlier, built entirely for writing code. GitHub stars poured in overnight, HuggingFace rankings climbed, and programmers truly felt for the first time that the labor of love behind open-source code models could come from a company with no PR roadshow.
Related product
DeepSeek-Coder
Built for code, open from day one →
Sources