UPDATED 17:46 EST / NOVEMBER 25 2024

Shimon Ben-David, chief technology officer of WekaIO talks to theCUBE about AI infrastructure at SC24. AI

Weka SC24 highlights from theCUBE: Tackling AI infrastructure challenges

Artificial intelligence is driving transformative advancements across industries, and AI infrastructure innovations showcased at SC24 highlight the potential to reshape modern computing.

From overcoming data center inefficiencies to accelerating cancer research breakthroughs, Weka is collaborating with leaders such as Nvidia Corp., Super Micro Computer Inc. and Dell Technologies Inc. to develop solutions that enable scalable enterprise AI deployments. With groundbreaking tools such as the Weka AI RAG Reference Platform, these partnerships are setting the stage for seamless AI integration across diverse sectors, solving complex challenges and unlocking new possibilities.

“We created this reference architecture called Weka AI RAG Reference Platform,” said Shimon Ben-David (pictured), CTO of WekaIO Inc. “Weka is a high-performance data platform … we are seeing customers still struggling with how to implement RAG inferencing. It has a lot of moving components. Honestly, there’s no real blueprint or protocols defined yet for that.”

Ben-David; Nilesh Patel, chief product officer of WekaIO; and Jonathan Martin, president of WekaIO, spoke with theCUBE Research’s Savannah Peterson at SC24, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how AI infrastructure innovations showcased at SC24, including Weka’s WARRP and collaborations with Nvidia, Supermicro and Dell, are addressing challenges in scalable AI deployments, data center efficiency and cancer research advancements.

Here’s a special recap of key themes discussed with Weka executives during SC24, and be sure to check out SiliconANGLE and theCUBE’s full coverage. Find more articles here, and our on-demand broadcast here(* Disclosure below.)

Transforming AI infrastructure

TheCUBE’s live coverage from SC24 highlighted the transformative potential of cutting-edge AI infrastructure, as showcased by Weka, Nvidia and Run:ai. Ben-David talked about how these companies are addressing challenges in deploying enterprise AI at scale through collaborative solutions, such as Weka’s WARRP. Designed to simplify retrieval-augmented generation workflows, the platform integrates Nvidia GPUs and Run:ai’s orchestration tools to create a cohesive system for scalable AI deployment.

“What we found when we went through that journey of describing WARRP, creating WARRP, building it, we saw that obviously, as I mentioned, there’s a lot of moving parts, a lot of frameworks, orchestration, data challenges, whether you are scaling or not,” Ben-David said. “Not all of them are actually the GPUs. We are hitting some, we measure our efficiency by times to token, cost per token, token throughput.”

Read More: https://siliconangle.com/2024/11/20/ai-infrastructure-expanding-arena-modern-supercomputing-sc24/

Tackling AI data center challenges

In addition, at SC24, Nvidia, Supermicro and Weka unveiled a collaborative approach to addressing power efficiency, scalability and cost concerns in AI data centers. Patel discussed how their combined innovations aim to balance system design while meeting the growing demands of AI infrastructure.

“As we continue to see the build-out [of AI data centers], two challenges are happening,” Patel said. “One is the power consumption and the power requirement in data centers is growing like crazy. The second thing is now we are getting into influencing space where it’s becoming a token economy. The cost token for dollars, tokens per wattage use and so on … have become our important KPIs.”

Read More: https://siliconangle.com/2024/11/21/future-ai-data-centers-sc24/

Advancing cancer research with AI supercomputing

Also, at SC24, experts from Memorial Sloan Kettering Cancer Center, Dell and Weka discussed how their collaboration is driving breakthroughs in cancer research through advanced AI infrastructure. MSK’s innovative supercluster has dramatically reduced research timelines, enabling faster discoveries and improved patient care. Progressive companies prioritize GPUs, fast networking and advanced infrastructure, explained Martin.

“It’s very hard to kind of leap forward 30 years and think that you can walk around with a plastic rectangle in your pocket with some of the world’s knowledge on it,” Martin said. “That’s kind of where AI is right now. We are very early in the journey … but it is going to transform every walk of life.”

Read More: https://siliconangle.com/2024/11/20/ai-in-cancer-research-transforming-breakthroughs-msk-sc24/

Find all of our reporting here, and watch the full playlist from our Nov. 19-21 broadcast below:

(* Disclosure: WekaIO Inc. sponsored this segment of theCUBE. Neither WekaIO nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU