UPDATED 13:05 EST / JULY 23 2024

A ghostly semi-transparent face limned with blue overlaying a background of golden orange and blue cyberpunk circuitry generated by an generative AI

Meta introduces Llama 3.1, its biggest and best open-source AI model to date

Meta Platforms Inc. today unveiled its largest-ever open-source artificial intelligence model to date, Llama 3.1 405B, that the company claims can rival even the most powerful closed models on the market, including those from OpenAI and Anthropic PBC.

According to Meta, Llama 3.1 excels at state-of-the-art capabilities such as general knowledge, math, tool use and multilingual translation. On the language front, the company added support for eight new languages, including French, German, Hindi, Italian, Portuguese and Spanish, with more on the way.

“Our experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet,” Meta’s research team said in a blog post. “Additionally, our smaller models are competitive with closed and open models that have a similar number of parameters.”

Llama 3.1 is an upgrade to the Llama 3 large language model the company released in April 2024 and will come in a staggering 405 billion-parameter size. However, that model is available only in 8 billion and 70 billion parameter versions. With this new ultra-large release, those two models are also getting upgrades.

The new model 3.1 has a 128,000-token context window, this is the size of the input that users can feed it before text sent to it gets cut off. That many tokens would allow the model to read most extremely large reports, medium-sized books, long transcripts and other large documents. That many tokens amount to about 96,000 words or the length of a standalone novel of about 400 pages.

Meta said that the new context window and multilingual support will also come to the 8B and 70B models. That will enable them to be remain easy to use in smaller footprints, while still providing advanced reasoning, support advanced use cases, such as long-form summarization, multilingual conversation and coding capabilities.

The company also said it’s changing its licensing so that developers may now use the outputs from Llama models, including its new 405B model, to “teach” smaller models. That will allow developers to use larger, smarter models to improve other models through training and fine-tuning.

Meta Chief Executive Mark Zuckerberg said the release of Llama 3.1 represents the company’s commitment to open-source innovation.

“Today, Linux is the industry standard foundation for both cloud computing and the operating systems that run most mobile devices — and we all benefit from superior products because of it,” Zuckerberg said in a blog post. “I believe that AI will develop similarly. Today, several tech companies are developing leading closed models. But open source is quickly closing the gap.”

According to Zuckerberg, keeping the Llama models open source maintains each individuals’ capability to use and train their own without the worry that some organization can pull the rug out from under them. It allows them to “control their own destiny.” It also makes models more affordable and efficient in the long run.

Zuckerberg said that inference on Llama 3.1 405B can be run on developer’s own infrastructure at roughly 50% the cost of large closed-source models such as OpenAI’s flagship model GPT-4o.

“The open-source nature of Llama 3.1 405B represents a significant step forward in democratizing access to AI technology,” Victor Botev, co-founder and chief technology officer of AI research assistant tool developer Iris.ai, told SiliconANGLE. “Meta is enabling researchers and developers worldwide to explore, innovate, and build upon state-of-the-art language AI without the barriers of proprietary APIs or expensive licensing fees. This approach emphasizes transparent development, fosters collaboration and accelerates progress in the field, potentially leading to breakthroughs that benefit society as a whole.”

Botev warned, however, that the extremely large size of the model could work against it. Prioritizing colossal model sizes in AI development come with pitfalls, such as computational resource and energy consumption needs, that can lead to both cost and environmental sustainability issues down the road.

“Innovations in model efficiency might benefit the AI community more than simply scaling up to larger sizes,” Botev said.

Everyday users can try out Meta’s new Llama 3.1 405B model right now using the company’s Meta AI app. However, it must be selected manually and it’s in preview currently, which means users only get a certain number of queries each week before it drops down to a lower-quality model (Llama 3.1 70B).

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Meta introduces Llama 3.1, its biggest and best open-source AI model to date

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

SC25

Refresh North America 2025

Meta introduces Llama 3.1, its biggest and best open-source AI model to date

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

SC25

Refresh North America 2025

Cookies