UPDATED 13:45 EDT / MAY 19 2026

Google targets AI agents and video generation with Gemini 3.5 Flash and Omni

Google LLC today introduced two new generative artificial intelligence models that push its Gemini family further into AI agents and multimodal creation: Gemini 3.5 Flash, a fast reasoning model designed to power agentic workflows, and Gemini Omni, a creative model that can generate and edit video from nearly any input.

Gemini 3.5 is the newest generation of Google’s flagship model family, combining frontier intelligence with tool use. This version provides the scaffolding for building reasoning agents and begins with the release of Flash, the smallest and most nimble model in the series, which balances high speed with high performance at low cost.

According to Google, Flash 3.5 is designed to outperform Gemini 3.1 Pro on challenging benchmarks such as Terminal-Bench 2.1, GDPval-AA and MCP Atlas. The company added that it also exceeds other frontier models on the market in speed, running four times faster than the fastest in the industry.

Flash 3.5’s speed and performance enable it to handle the long-horizon tasks required for AI agent work. When coupled with the new update to Antigravity, the company’s agentic coding editor, the new large language model becomes a powerful AI engine capable of orchestrating multiple agents that collaborate at scale to solve complex problems.

The company also released a new personal assistant named Spark. Google said it built 3.5 Flash to act as the “brain” that can help people navigate their lives and take actions on their behalf. It is rolling out to trusted testers today.

The same model has also become the default for the Gemini app and AI Mode in Search globally.

Gemini Omni: Generate with true multimodal reasoning

Today, Google introduced Gemini Omni, bringing the company’s flagship large language model reasoning to the ability to create anything from any input, starting with video.

The company said that with Omni, users can combine images, audio, video and text as input, and it will generate videos using Gemini’s real-world knowledge to produce high-fidelity output. Users can then use conversation to iterate on and edit those videos.

The first model in the new family, Omni Flash, will be available starting today in the Gemini app, Google Flow and YouTube Shorts.

Google said that using Gemini Omni Flash, users can start with whatever formats they like to produce wild but lifelike videos. That means they can take an image or a video and insert themselves into it. They could also take a short video and change the style from realistic to cartoon or anime, or make it look as though they were walking through a Renaissance painting.

Every conversation with the model layers changes and transformations according to the last request. This allows users to change specific details or broader visual elements. The model also takes into account the physics and consequences of requests, allowing users to change the environment, angle, style and action, as well as add new characters, objects, details and more.

The company stressed that it’s dedicated to developing AI responsibly and is designing policies to protect users from harm involving the use of its AI tools. In line with this, it’s incorporating SynthID, an imperceptible watermark that identifies videos generated by Omni and other AI sources.

Image: SiliconANGLE/DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Google targets AI agents and video generation with Gemini 3.5 Flash and Omni

Gemini Omni: Generate with true multimodal reasoning

Image: SiliconANGLE/DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

KB4-CON 2026

VeeamON 2026

Boomi World 2026

Red Hat Summit 2026

Securing the AI Factory with Dell Technologies and Intel 2026

Google targets AI agents and video generation with Gemini 3.5 Flash and Omni

Gemini Omni: Generate with true multimodal reasoning

Image: SiliconANGLE/DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

KB4-CON 2026

VeeamON 2026

Boomi World 2026

Red Hat Summit 2026

Securing the AI Factory with Dell Technologies and Intel 2026

Cookies