UPDATED 17:21 EDT / JANUARY 08 2025

Microsoft open-sources its Phi-4 small language model

Microsoft Corp. today released the code for Phi-4, a small language model that can generate text and solve math problems.

The company first detailed the model last month. Initially, Phi-4 was only accessible through Microsoft’s Azure Foundry artificial intelligence development service. The model is now downloadable on Hugging Face, a popular website for hosting open-source AI projects.

Phi-4 is the fourth iteration of a small language model series that Microsoft introduced in 2023. It features 14 billion parameters, the configuration settings that determine how a neural network goes about processing data. Microsoft researchers trained it on a cluster of 1,920 H100 graphics processing units from Nvidia Corp. over the course of 21 days.

The model is based on the industry-standard Transformer architecture that underpins most large language models. When they receive a user prompt, Transformer models break down the input into individual words and determine the meaning of each word by analyzing the surrounding text. Moreover, they prioritize the parts of the surrounding text that are deemed to be most relevant.

Phi-4 implements a so-called decoder-only variant of the Transformer architecture. A standard Transformer model analyzes the text before and after a word to determine its meaning. Decoder-only models focus solely on the text that precedes the word, which reduces the amount of data they have to process and thereby lowers inference costs.

In a research paper, Microsoft detailed that it honed Phi-4’s output quality using two post-training optimization techniques. Those methods are known as direct preference optimization and supervised fine-tuning. Both involve supplying a language model with examples explaining how it should generate prompt responses.

In an internal evaluation, Microsoft compared Phi-4 against Llama 3.3 70B, an LLM with five times as many parameters. The company says that Phi-4 delivered better performance across the popular GPQA and MATH benchmarks. The two test datasets contain science questions and math problems, respectively.

Phi-4 joins the growing list of small language models that have been open-sourced by major tech firms over the past year.

Last February, Google LLC introduced a series of small language models called Gemma. The algorithms in the series have between 2 billion and 27 billion parameters. According to Google, the version with 27 billion parameters can outperform models more than twice its size.

More recently, Meta Platforms Inc. released two Llama 3.2 models with under five billion parameters. The company followed up the release by open-sourcing even more efficient versions of those models that implement a machine learning called quantification. The technique compresses the data that a neural network ingests in order to reduce the amount of hardware necessary to process it.

Photo: Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+

CUBE Alumni Network

C-level and Technical

Domain Experts

15M+

theCUBE

Viewers

Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Microsoft open-sources its Phi-4 small language model

Photo: Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Join theCUBE Alumni Trust Network

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

Google Cloud Partner AI Series

Black Hat USA 2025

Open Storage Summit 2025

World of Workato 2025

VMware Explore 2025

RECENT CUBE EVENTS

AWS Mid-Year Leadership Summit 2025

RAISE Summit 2025

Blue Yonder AI and the Autonomous Supply Chain 2025

Data Protection & AI Summit 2025

Open Source Summit NA 2025

Microsoft open-sources its Phi-4 small language model

Photo: Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Join theCUBE Alumni Trust Network

LATEST STORIES

LATEST STORIES

Google Cloud Partner AI Series

Black Hat USA 2025

Open Storage Summit 2025

World of Workato 2025

VMware Explore 2025

AWS Mid-Year Leadership Summit 2025

RAISE Summit 2025

Blue Yonder AI and the Autonomous Supply Chain 2025

Data Protection & AI Summit 2025

Open Source Summit NA 2025

Cookies