UPDATED 15:14 EDT / AUGUST 21 2023

AI

Generative AI startup Contextual AI names Google Cloud its preferred cloud provider

Contextual AI Inc., a startup developing large language models for the enterprise, today named Google Cloud as its preferred cloud provider.

The company will use Google Cloud services to power several parts of its business. Most notably, it plans to leverage the search giant’s infrastructure to train its language models.

Palo Alto, California-based Contextual AI launched from stealth mode earlier this year with $20 million in funding. It’s led by co-founder and Chief Executive Officer Douwe Kiela, an adjunct professor at Stanford University. The language models the startup is building are based on a technology called retrieval augmented generation, or RAG, that Kiela helped pioneer.

Artificial intelligence models usually draw upon the dataset on which they were trained to answer user questions. According to Contextual AI, its RAG technology allows a neural network to draw on information from external sources as well. Moreover, a RAG-powered neural network can do so with no need for retraining, which reduces infrastructure costs.

Contextual AI says its technology provides several benefits. The startup’s language models can cite their sources when answering a user question. Moreover, Contextual AI claims its models are less prone to AI hallucinations than a traditional neural network.

“Building a large language model to solve some of the most challenging enterprise use cases requires advanced performance and global infrastructure,” said Kiela.

The startup plans to train its neural networks using Google Cloud’s A3 and A2 instances. The former instance offers access to eight of Nvidia Corp.’s flagship H100 graphics processing units. The A2 instance includes 16 processors from the A100 chip family, which was Nvidia’s flagship GPU series before the launch of the H100.

Contextual AI also plans to use Google’s internally developed TPU machine learning chips. The latest addition to the chip series, the TPU v4, debuted last year. Google says the processor is 2.1 times faster than its previous-generation silicon and nearly three times more power-efficient.

The manner in which Google has deployed TPU v4 chips within its data centers is also a part of the processor series’ value proposition.

According to Google, each TPU v4 cluster comprises 4,096 chips linked together by a custom optical interconnect. This interconnect automatically reconfigures itself based on AI models’ requirements. More specifically, it adjusts the TPU v4 cluster’s network settings in a way that speeds up the neural network running on top.

Contextual AI says the language models it’s training on Google Cloud will lend themselves to a variety of use cases. According to the startup, customer support is one area where its technology could be applied. Additionally, it sees opportunities to deploy its language models in the financial sector.

Image: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.