Bookmark: StarCoder: A State-of-the-Art LLM for Code

lqdev👽06/01/2023

StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively licensed data from GitHub, including from 80+ programming languages, Git commits, GitHub issues, and Jupyter notebooks. Similar to LLaMA, we trained a ~15B parameter model for 1 trillion tokens. We fine-tuned StarCoderBase model for 35B Python tokens, resulting in a new model that we call StarCoder.

Permalink: /feed/hf-starcoder-llm/

Tags: #ai #code

Back to feed

Send me a message or webmention