Meta AI: Understanding Llama 3 and Its Significance Indul Hassan, April 20, 2024June 30, 2024 Meta Llama 3 Llama 3 is Meta’s latest series of open-source large language models (LLMs), representing a significant step in AI technology. This model family is Meta’s answer to OpenAI’s GPT and Google’s Gemini, but with a unique twist: it’s available for almost anyone to use for research and commercial purposes without cost. This accessibility is a major advancement. Over the past year, Llama 2, the previous model, has become integral to open-source AI projects. Llama 3 builds on this foundation. Here’s an in-depth look at what Llama 3 is and why it matters. What is Llama 3? Llama 3, similar to GPT-4 and Google Gemini, is a series of LLMs. It follows Llama 2, Meta’s earlier AI model generation. While there are technical distinctions between Llama and other LLMs, they are essentially built using the same transformer architecture and development principles such as pretraining and fine-tuning. When given a text prompt, Llama 3 predicts the most likely subsequent text using its neural network—a complex algorithm with billions of parameters, designed to mimic the human brain. By adjusting the weights of these parameters and incorporating some randomness, Llama 3 generates highly realistic responses. Meta has introduced four versions of Llama 3: Llama 3 8B Llama 3 8B-Instruct Llama 3 70B Llama 3 70B-Instruct The 8B models have 8 billion parameters, while the 70B models have 70 billion parameters. The instruct models are fine-tuned to better follow human instructions, making them more suitable for chatbot applications. A 400 billion parameter version of Llama 3 is also in development and expected to be available later this year. Additionally, Meta is working on a multimodal version to handle various inputs like images, handwritten text, video, and audio, as well as multilingual versions. How to Try Llama 3 Llama 3 powers Meta AI, the AI assistant integrated into Facebook, Messenger, Instagram, and WhatsApp. It can also be accessed via a new dedicated web app, Meta AI. For those outside the initial launch countries, the 70B-Instruct model can be demoed using HuggingChat on HuggingSpace. How Does Llama 3 Work? Llama 3’s neural network was trained on over 15 trillion tokens—seven times more data than its predecessor, Llama 2. Sources include Common Crawl, Wikipedia, and Project Gutenberg, among others. No Meta user data was used. The model’s tokenizer has a larger vocabulary, making it more efficient. To mitigate issues like bias and harmful content, Llama 3’s development included strategies like reinforcement learning with human feedback (RLHF). This involved human testers ranking responses to guide the model toward appropriate outputs. The instruct versions were fine-tuned with specific data to enhance their response to human instructions. Meta has also developed safety models, Llama Guard and Llama Code Shield, to prevent the generation of harmful content or insecure code. Llama vs. GPT, Gemini, and Other AI Models In a blog post, Meta’s researchers compared Llama 3’s performance on various benchmarks with equivalent open-source and closed-source models. In selected examples, Llama 3 models performed well, particularly in a head-to-head evaluation with models like GPT-3.5 and Mistral Medium. While Meta hasn’t yet compared Llama 3 to state-of-the-art models like GPT-4, it is anticipated that the forthcoming 400B model will offer strong competition. Why Llama Matters Most well-known LLMs, like OpenAI’s GPT-4 and Google’s Gemini, are proprietary and closed-source. Researchers and businesses can use their APIs but can’t access or modify the underlying models. Llama 3, however, is available for download, allowing for extensive customization and use. Meta plans to release a detailed research paper on Llama 3’s training, adding to its transparency. The model can also be deployed on cloud platforms like Microsoft Azure and Amazon Web Services, facilitating custom AI application development. By making Llama 3 open-source, Meta is enabling other companies to develop AI applications with greater control, adhering to acceptable use policies. This openness ensures a credible alternative to closed-source AI models, fostering innovation and competition in the AI landscape. Meta’s approach ensures that AI advancements remain accessible, encouraging widespread experimentation and development. With Llama 3, the future of AI looks more collaborative and open than ever. LLM aillm