← Home

Quick answer

The rapid development of large language models has revolutionized code intelligence in software development. However, the predominance of closed-source models has restricted extensive research and development.

Claim

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Authors
Daya Guo·
Qihao Zhu·
Dejian Yang·
Zhenda Xie·
Kai Dong·
Wentao Zhang·
Guanting Chen·
Xiao Bi·
Y. Wu·
Y. K. Li·
Fuli Luo·
Yingfei Xiong·
Wenfeng Liang

ABSTRACT

The rapid development of large language models has revolutionized code intelligence in software development. However, the predominance of closed-source models has restricted extensive research and development. To address this, we introduce the DeepSeek-Coder series, a range of open-source code models with sizes from 1.3B to 33B, trained from scratch on 2 trillion tokens. These models are pre-trained on a high-quality project-level code corpus and employ a fill-in-the-blank task with a 16K window to enhance code generation and infilling. Our extensive evaluations demonstrate that DeepSeek-Coder not only achieves state-of-the-art performance among open-source code models across multiple benchmarks but also surpasses existing closed-source models like Codex and GPT-3.5. Furthermore, DeepSeek-Coder models are under a permissive license that allows for both research and unrestricted commercial use.

Review Snapshot

Explore ratings

0.0
★★★★★
0 ratings
5 star
0%
4 star
0%
3 star
0%
2 star
0%
1 star
0%

Recommendation

0%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence | Attendemia