chatbotEnglish
Meta AI has unveiled Llama 3, the next generation of its open-source large language models (LLMs), setting a new benchmark for performance and capabilities in the field of artificial intelligence.
Meta's internal evaluations have shown that Llama 3 outperforms competing models across various benchmarks:
MMLU (undergraduate-level knowledge):
GPQA (graduate-level questions):
HumanEval (coding):
GSM-8K (grade-school math):
Real-world scenarios (human evaluation):
One of the standout features of Llama 3 is its enhanced reasoning capabilities and improved ability to follow instructions. Meta attributes these improvements to advancements in pretraining and post-training procedures, which have:
Additionally, Meta claims that Llama 3 has demonstrated significant improvements in tasks such as reasoning and coding, thanks to the incorporation of preference rankings during the training process.
Llama 3 was trained on a massive dataset of over 15 trillion tokens, seven times larger than the one used for Llama 2. This dataset includes:
To ensure data quality, Meta developed a series of data-filtering pipelines, including:
Interestingly, Meta leveraged Llama 2's capabilities to identify high-quality data, using it to generate the training data for the text-quality classifiers that power Llama 3.
While the 8B and 70B models represent the initial release, Meta is currently training even larger models, exceeding 400 billion parameters. These models are expected to offer enhanced capabilities, including:
To train these massive models, Meta employed advanced techniques such as:
Meta also developed an advanced training stack that automates error detection, handling, and maintenance, ensuring efficient and reliable training processes.
Meta has adopted a system-level approach to ensure the safe and ethical use of Llama 3, introducing new trust and safety tools:
Meta has also updated its Responsible Use Guide, providing a comprehensive framework for developers to follow when working with Llama 3 models.
Llama 3 models will soon be available on various platforms, including:
Additionally, Meta has integrated Llama 3 technology into its virtual assistant, Meta AI, now available across Meta's platforms, including Facebook, Instagram, WhatsApp, Messenger, and the web.
The release of Llama 3 represents a significant milestone in the field of artificial intelligence, showcasing Meta's commitment to pushing the boundaries of open-source LLMs. With its impressive performance, enhanced reasoning capabilities, and emphasis on responsible development, Llama 3 is poised to shape the future of AI applications and drive innovation across various industries.