Post by @leading • Hey

Language distribution of pretraining data for Llama 2 in percentage. 90% is English, performance is naturally the best with English prompts

Stats

Comments