UPDATED 15:37 EST / MARCH 14 2019

BIG DATA

Microsoft open-sources technology behind Azure’s powerful data compression

International Data Corp. estimates that the total volume of digital information in the world will balloon from 33 zettabytes, or trillion gigabytes, today to 175 zettabytes in 2025. This rapid growth is being felt particularly strongly by cloud providers such as Microsoft Corp., which host not just their own information but also that of countless other organizations.

To reduce the strain on its infrastructure, the company has developed a cutting-edge system for compressing data. Microsoft this morning released the specifications for the system under an open-source project dubbed Zipline.

The company touts its technology as considerably more powerful than the compression software commonly used in the industry today. Kushagra Vaid, the general manager of the Azure Hardware Infrastructure team, used the popular Zlib tool as a reference point in the blog post announcing Zipline.

Zlib is an industry-standard compression library that can be found in the Linux kernel, iOS and other foundational software platforms. Vaid wrote that Zipline provides data compression rates up to twice as high as high those offered by Zlib. Moreover, the system is described as capable of doing so while providing better throughput and lower latency than several other popular compression tools.

In practice, this means that Zipline can shrink workloads to just a fraction of their size. Microsoft claims that the system compresses the storage footprint of application data stored on Azure by as much as 92 percent. Zipline provides even greater reductions for other types of data such as machine-generated logs from connected devices.

7362c425-5a6d-41e2-8d75-80c475551269

Microsoft is open-sourcing the algorithm that the system uses to perform compression, as well as the specifications for the custom hardware on which the algorithm is designed to run. These specifications include the low-level register transfer language in which Zipline expresses data operations.

“Over time, we anticipate Project Zipline compression technology will make its way into several market segments and usage models such as network data processing, smart SSDs, archival systems, cloud appliances, general purpose microprocessor, IoT and edge devices,” Microsoft’s Vaid wrote.

Zipline is not the first component of Azure that the company has contributed to the open-source community. Previously, Microsoft Corp. released the code for an artificial intelligence engine that supports some of the cloud platform’s services. It has also shared the schematics for a homegrown chip called Cerberus that can protect a server’s firmware from tampering attempts.

Photo: Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.