UPDATED 10:10 EST / FEBRUARY 21 2024

AI

Google releases Gemma family of open-source AI models inspired by Gemini

Google LLC released Gemma today, a family of lightweight open-source artificial intelligence large language models built using the same research as Gemini, the company’s largest and most powerful AI technology.

According to Google, the model’s name comes from the Latin word “gemma,” which is the root for “gem,” meaning precious stone.

Gemma shares inspiration and technical components with Google’s Gemini model, which is the most powerful model that the company has produced to date. Gemini underlies the company’s Gemini AI chatbot, recently renamed from Bard, which is available on the web and mobile devices.

Gemma is available in two sizes, one with two billion adjustable parameters, and the other with seven billion parameters. Both have pretrained and instruction-tuned variants for developers and researchers to use. The company said that the models are capable of achieving best-in-class performance compared to other models of similar sizes and can run directly on AI-enabled laptops and desktop computers.

Although Gemini is a fully robust multimodal model capable of receiving audio, video and images and outputting text and images, Gemma is only capable of text inputs and text outputs. Gemini is also a multilingual model and capable of speaking in multiple languages, whereas at release Google said Gemma will only use English.

By releasing these models as open source, Google is making certain that developers and researchers have access to AI models that have the same technical infrastructure as Gemini to experiment on even if they cannot afford access to Gemini. By providing these models as open, researchers and developers will have direct access to the parameters and the underlying technical architecture, which will allow them to easily tune and adjust the models to fit their needs.

The Google team that created Gemma added that the models were designed with the company’s AI safety principles. “As part of making Gemma pre-trained models safe and reliable, we used automated techniques to filter out certain personal information and other sensitive data from training sets,” Google said. The models were also produced using reinforcement learning from human feedback to create responsible behaviors and security teams did extensive security evaluations.

Additionally, Google said that the company is releasing a responsible AI tool kit together with the models “to help developers and researchers prioritize building safe and responsible AI.” The toolkit will assist developers provide safety classification, debugging Gemma’s behavior and accessing best practices based on Google’s own experience.

Developers interested in working with Gemma will find it ready to use on Colab and Kaggle notebooks starting today, and can also quickly grab it from repositories such as Hugging Face and Nvidia NeMo.

Image: geralt/Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.