UPDATED 15:21 EDT / AUGUST 14 2024

Elon Musk’s xAI debuts new Grok-2 and Grok-2 mini language models

Elon Musk’s xAI Corp. has debuted two new language models, Grok-2 and Grok-2 mini, that it claims can perform some tasks with similar accuracy to OpenAI’s GPT-4o.

The models rolled out to X on Tuesday. Later this month, they will also become available to developers through an application programming interface. The API will make it possible to integrate Grok-2 and Grok-2 mini into third-party services.

The debut didn’t go swimmingly, as apparent lack of guardrails allowed some distasteful images to be produced.

Musk launched xAI early last year to develop large language models. The company released its first LLM, Grok-1, later in 2023 and subsequently raised $6 billion from investors to finance the development of additional models. Grok-2 and Grok 2 mini, the latest fruits of the engineering effort, are rolling out about four months after the previous addition to xAI’s LLM lineup.

Grok-2, the more advanced of the two models, can generate text, troubleshoot code and perform related tasks. It’s also capable of analyzing user-provided images. Grok-2 mini is a scaled-down version of the LLM that trades off some output quality for faster response times and lower inference costs.

In an internal test, xAI compared Grok-2 against several competing models to assess the quality of its output. The evaluation comprised eight benchmark datasets that researchers commonly use to measure LLMs’ accuracy. According to xAI, Grok-2 achieved “performance levels competitive” with the most advanced LLMs on the market.

One of the benchmark datasets that xAI used, GPQA, comprises 448 multiple-choice questions spanning several scientific fields. LLMs that complete the test receive a score reflective of how many questions they answered correctly. Grok-2 achieved a score of 56, which put it ahead of both GPT-4o and Meta’s newly released Llama 3 405B model.

The only LLM that outperformed Grok-2 in the GPQA test is Anthropic PBC’s Claude 3.5 Sonnet. The latter model achieved higher scores across most of the benchmark datasets that xAI used in the evaluation with the exception of two that comprised math questions. Grok-2 mini, in turn, achieved lower scores than the other LLMs across nearly all the benchmark datasets.

Both of xAI’s new models became available in X on Tuesday for users with paid Premium and Premium+ subscriptions. The LLMs are accessible through a ChatGPT-like chatbot interface.

X’s implementation of Grok-2 is integrated with a third-party AI model called FLUX.1. The latter model, which was developed by a startup called Black Forest Labs Inc., allows users to generate images with natural language prompts. The Verge reported that Grok-2’s image generation features currently appear to have few guardrails against harmful output.

Later this month, xAI plans to make Grok-2 and Grok-2 mini available through an API. The offering will enable developers to integrate the models into their own applications. The API includes cybersecurity controls, a traffic analytics tool and the option to deploy the models in data centers near end-users to reduce latency.

Image: xAI

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Elon Musk’s xAI debuts new Grok-2 and Grok-2 mini language models

Image: xAI

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

The Networking for AI Summit 2025

Fal.Con 2025

VMware Explore 2025

Future of Data Platforms Summit 2025

WOW: World of Workato 2025

Elon Musk’s xAI debuts new Grok-2 and Grok-2 mini language models

Image: xAI

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

The Networking for AI Summit 2025

Fal.Con 2025

VMware Explore 2025

Future of Data Platforms Summit 2025

WOW: World of Workato 2025

Cookies