UPDATED 18:04 EST / MARCH 17 2024

SiliconANGLE highlights the transformative role of artificial intelligence in enterprise data management, emphasizing the emergence of the sixth data platform as a pivotal development that signifies a generational shift toward intelligent data applications and the necessity of adapting to AI-driven operational models.

Elon Musk’s xAI releases Grok-1 architecture, while Apple advances multimodal AI research

The Elon Musk-run artificial intelligence startup xAI Corp. today released the weights and architecture of its Grok-1 large language model as open source code, shortly after Apple Inc. published a paper describing its own work on multimode LLMs.

Musk first said that xAI would release Grok as open source on March 11, but the release today of the base model and weights, fundamental components of how the model works makes this the company’s first open-source release.

What has been released is part of the network architecture of Grok’s structural design, including how layers and nodes are arranged and interconnected to process data. Base model weights are the parameters within a given model’s architecture that have been adjusted during training, encoding the learned information and determining how input data is transformed into output.

Grok-1 is a 314 billion parameter “Mixture-of-Experts” model trained from scratch by xAI. A Mixture-of-Experts model is a machine learning approach that combines the outputs of multiple specialized sub-models, also known as experts, to make a final prediction, optimizing for diverse tasks or data subsets by leveraging the expertise of each individual model.

The release is the raw base model checkpoint from the Grok-1 pre-training phase, which concluded in October 2023. According to the company, “this means that the model is not fine-tuned for any specific application, such as dialogue.” No further information was provided in what was only a brief blog post.

Musk revealed in July that he had founded xAI and that it will compete against AI services from companies such as Google LLC and OpenAI. The company’s first model, Grok, was claimed by xAI to have been modeled after Douglas Adams’ classic book “The Hitchhiker’s Guide to the Galaxy” and is “intended to answer almost anything and, far harder, even suggest what questions to ask!”

Meanwhile, at Apple, the company Steve Jobs built quietly published a paper Thursday describing its work on MM1, a set of multimodal LLMs for captioning images, answering visual questions, and natural language inference.

Thurott reported today that the paper describes MM1 as a family of multimodal models that support up to 30 billion parameters and “achieve competitive performance after supervised fine-tuning on a range of established multimodal benchmarks.” The researchers also claim that multimodal large language models have emerged as “the next frontier in foundation models” after traditional LLMs and they “achieve superior capabilities.”

A multimodal LLM is an AI system capable of understanding and generating responses across multiple types of data, such as text, images and audio, integrating diverse forms of information to perform complex tasks. The Apple researchers believe that their model delivers a breakthrough that will help others scale these models into larger sets of data with better performance and reliability.

Apple’s previous work on multimodal LLMs includes Ferret, a model that was quietly open-sourced in October before being noticed in December.

Image: DALL-E 3

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Elon Musk’s xAI releases Grok-1 architecture, while Apple advances multimodal AI research

Image: DALL-E 3

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Microsoft Ignite 2025

SC25

Refresh North America 2025

QAD Champions of Manufacturing 2025

Agentic AI Unleashed: The Future of Digital & IT Operations 2025

Elon Musk’s xAI releases Grok-1 architecture, while Apple advances multimodal AI research

Image: DALL-E 3

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Microsoft Ignite 2025

SC25

Refresh North America 2025

QAD Champions of Manufacturing 2025

Agentic AI Unleashed: The Future of Digital & IT Operations 2025

Cookies