UPDATED 12:30 EST / AUGUST 26 2024

INFRA

Startup FuriosaAI debuts RNGD chip for LLM and multimodal AI inference

FuriosaAI Inc., a semiconductor startup that’s laser-focused on artificial intelligence, has unveiled a new accelerator chip it says is geared for large language models and multimodal AI.

Its new chip is called RNGD, pronounced “Renegade,” and it was unveiled at the Hot Chips 2024 conference in Stanford University today. It’s sampling to early access customers now, with broader availability slated for next year.

According to Furiosa, the RNGD chip is an extremely efficient data center accelerator that’s designed to support high-performance LLMs and multimodal model inference. The company is positioning it as an alternative to Nvidia Corp.’s graphics processing units.

RNGD is based on a Tensor Contraction Processor or TCP architecture, which the company says provides the perfect balance between efficiency, programmability and performance. It boasts some formidable specifications, with a Thermal Design Power of 150-watts, compared to more than 1,000 watts for some of the leading GPUs on the market today.

Furiosa also claims extremely high performance, with the chip packing 48 gigabytes of high-bandwidth memory. That makes it possible to run open-source LLMs such as Meta Platforms Inc.’s Llama 3.1 8B efficiently on a single card.

The RNGD chip was built on Taiwan Semiconductor Manufacturing Co.’s five-nanometer process and boasts a frequency of 1 gigahertz and 1.5 megabytes of memory bandwidth, with 256 megabytes of on-chip standard random-access memory and a PCIe Gen5 x16 interconnect that supports up to 64-gigabits-per-second throughput.

Programmability is enabled by a “robust compiler” that’s co-designed to be optimized for TCP-based chips, treating entire AI models as a single-fused operation. This means that the RNGD chips can be customized to run almost any LLM or multimodal AI workload, the company said.

What all of these numbers mean is that the Furiosa RNGD chip (pictured, adjacent) is extremely capable when it comes to running some of the best-known LLMs. Indeed, the startup claims some impressive results on industry standard benchmarks with models such as OpenAI’s GPT-J 6B, where it was able to process 15.13 queries per second.

Furiosa has a decent pedigree. It was founded in 2017 by three hardware and software engineers who previously worked for chipmaking giants such as Advanced Micro Devices Inc., Qualcomm Inc. and Samsung Electronics Co. Ltd.

Since its founding, the company has focused on a strategy of rapid iteration and product delivery. Its first-generation chip, known as Warboy, is a high-performance data center accelerator specifically designed for computer vision workloads that compares well with some of Nvidia’s older GPU designs in in the ResNet-50 image classification and SSD – MobileNetV1 object detection benchmarks.

Furiosa co-founder and Chief Executive June Paik revealed RNGD is the result of years of innovation by the startup. “RNGD is a sustainable and accessible AI computing solution that meets the industry’s real-world needs for inference,” he said. “With our hardware now running LLMs at full speed, we’re entering an exciting phase of continuous advancement.”

Featured image: SiliconANGLE/Microsoft Designer

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.