

Databricks Inc. today introduced a new serverless database called Lakebase that can process more than 10,000 queries per second.
The service is based on PostgreSQL, a popular open-source relational database. It’s powered by technology that Databricks obtained through its recent acquisition of Neon Inc., a startup with an eponymous serverless PostgreSQL platform. The deal was reportedly worth about $1 billion.
PostgreSQL was originally designed to store relational data organized into rows and columns. By installing an open-source extension called pgvector, developers can add support for vectors, the data structures in which artificial intelligence models keep information. Lakebase is compatible with pgvector as well as another extension, PostGIS, that adds support for geospatial data such as maps.
Databricks has integrated Lakebase into its flagship data lakehouse platform. As a result, customers can manage both using a single set of controls. According to Databricks, the integration addresses one of the main challenges involved in running databases such as PostgreSQL: they historically had to be managed separately from a company’s analytics environment. That arrangement created additional work for administrators.
Databricks’ platform ships with a data governance tool called Unity Catalog. It enables administrators to regulate which user can access what record, as well as monitor how business information is used. Customers can use Unity Catalog’s controls to manage their Lakebase environments.
Databricks’ platform stores information in structures called lakehouse tables. According to the company, customers can automatically sync data between lakehouse tables managed with Unity Catalog and Lakebase records. There are three ways of going about the task. Databricks’ platform can be configured to sync data on a one-off basis, automatically when certain conditions are met or continuously.
The integration also extends to other areas. Customers may use the Databricks SQL Editor to run queries on Lakebase data. Meanwhile, custom applications deployed on the company’s platform using the Databricks Apps tool can likewise access the new PostgreSQL service.
Databricks manages the infrastructure on which Lakebase runs. According to the company, each customer database is spread across multiple cloud zones to mitigate the impact of outages. If one of the zones experiences downtime, the rest can still process queries. A recovery feature allows customers to restore records created up to 35 days ago in case user error or technical issues cause data loss.
“It’s now possible to launch thousands or even millions of agents with their own databases cost-effectively, without requiring specialized engineers (e.g. DBAs) to maintain/support staging environments; this reduces TCO,” a group of Databricks executives wrote in a blog post.
Lakebase is currently in public preview. Databricks plans to follow up today’s launch by releasing “significant improvements” for the database in the coming months.
Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.
Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.