Pinned
Introducing: StarCoder2 and The Stack v2 โญ๏ธ
StarCoder2 is trained with a 16k token context and repo-level information for 4T+ tokens. All built on The Stack v2 - the largest code dataset with 900B+ tokens.
All code, data and models are fully open!
hf.co/bigcode/starcoโฆ















