What is StarCoder?

StarCoder and StarCoderBase are sophisticated Large Language Models crafted for coding tasks, built from freely available data sourced from GitHub, which includes an extensive array of over 80 programming languages, along with Git commits, GitHub issues, and Jupyter notebooks. Similarly to LLaMA, these models were developed with around 15 billion parameters trained on an astonishing 1 trillion tokens. Additionally, StarCoderBase was specifically optimized with 35 billion Python tokens, culminating in the evolution of what we now recognize as StarCoder.

Our assessments revealed that StarCoderBase outperforms other open-source Code LLMs when evaluated against well-known programming benchmarks, matching or even exceeding the performance of proprietary models like OpenAI's code-cushman-001 and the original Codex, which was instrumental in the early development of GitHub Copilot. With a remarkable context length surpassing 8,000 tokens, the StarCoder models can manage more data than any other open LLM available, thus unlocking a plethora of possibilities for innovative applications. This adaptability is further showcased by our ability to engage with the StarCoder models through a series of interactive dialogues, effectively transforming them into versatile technical aides capable of assisting with a wide range of programming challenges. Furthermore, this interactive capability enhances user experience, making it easier for developers to obtain immediate support and insights on complex coding issues.

Pricing

Price Starts At:
Free
Free Version:
Free Version available.

Screenshots and Video

StarCoder Screenshot 1

Company Facts

Company Name:
BigCode
Date Founded:
2023
Company Website:
huggingface.co/blog/starcoder

Product Details

Deployment
Windows
Mac
Linux
On-Prem
Training Options
Documentation Hub

Product Details

Target Company Sizes
Individual
1-10
11-50
51-200
201-500
501-1000
1001-5000
5001-10000
10001+
Target Organization Types
Mid Size Business
Small Business
Enterprise
Freelance
Nonprofit
Government
Startup
Supported Languages
English

StarCoder Categories and Features