UPDATED 18:29 EDT / DECEMBER 12 2024

AI

OpenAI finally launches screen and live video observation for paying ChatGPT users

OpenAI today announced that it’s finally giving ChatGPT the ability to observe screens and live video to provide comments and feedback, a day after Google LLC launched similar features in Gemini 2.0.

The features, which essentially give ChatGPT eyes, were first teased in May at the launch of the multimodal GPT-4o model, with fans waiting for it since that time as OpenAI has rolled out other features.

With the new features, unveiled in a live stream today, ChatGPT can recognize objects via a camera on a smartphone or webcam, and react to what is on a user’s computer screen, as well as anything open on a user’s computer or phone.

In a demo from OpenAI, researchers turn on the screen-sharing feature to allow ChatGPT to provide real-time feedback during a graphic design project. ChatGPT identified on-screen elements such as colors, layouts and text, offering actionable suggestions to enhance the design.

The live video feature works similarly. When users give access to the camera on their device, ChatGPT processes the video feeds in real-time to identify and analyze objects, movements and contexts. The information is then used to provide relevant insights or instructions, such as identifying a piece of hardware in a tech setup or giving step-by-step guidance during a repair.

Additionally, OpenAI introduced a Santa voice feature, which allows users to interact with ChatGPT in a festive tone, complete with holiday-themed responses. The seasonal addition has been designed to add a layer of engagement and fun, particularly for families and children.

During the demonstration, ChatGPT’s Santa voice delivered personalized holiday stories and answered questions about Christmas traditions.

Though the features sound great, anything that uses live video or has live access to screens is going to raise security concerns.

To address those concerns, OpenAI noted that the new functions use guardrails to protect user data. For live video and screen-sharing, all data is processed locally whenever possible to ensure that sensitive information remains private. Additionally, OpenAI added, any data shared with the cloud is encrypted and not stored beyond the session’s duration.

The new features are being rolled out to ChatGPT Plus and Enterprise users starting from today, with wider availability planned for early next year. However, free users will not be getting access to the live video and screen-sharing capabilities since OpenAI wants to focus on making money from its premium offerings.

Image: SiliconANGLE/Ideogram

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.