UPDATED 02:40 EDT / NOVEMBER 01 2017

INFRA

Microsoft has built a secret network emulator it says can prevent most cloud outages

Microsoft Corp.’s researchers have lifted the lid on a new “cloud-scale” emulator they’ve built that’s designed to completely eliminate outages in its public cloud.

The emulator is called CrystalNet, and it was one of the main subjects of discussion at the ACM Annual Symposium on Operating Systems Principles 2017 event that’s taking place this week in Shanghai, China.

Victor Bahl, a distinguished scientist director with Microsoft’s Mobile & Networking Research team, said CrystalNet was the fruit of more than two years’ research and development efforts. He explained in a blog post that CrystalNet was designed in the belief that it should be possible to spot cloud network problems before they occur by taking a more proactive approach. Bahl and his colleagues quickly decided that the best way to do this was to emulate the network in order to test it.

“The idea of testing before deploying is age old, but following a two-year study by Microsoft Research looking at all documented outages across all major cloud providers, we believed that we could find most potential problems if we first validated a production network on an identical copy of the network,” Bahl said.

CrystalNet is identical because it uses the same network topology, hardware, software and configurations as Microsoft’s production network. It can be run on emulated devices inside virtual machines and can support the software images of most common networking devices.

Microsoft offered the following diagram that lays out CrystalNet’s architecture:

architecture-of-crystalnet

The whole point of having an emulated network such as CrystalNet is that Microsoft’s Azure engineers can use it to test planned changes and updates for any issues, rather than just implementing them and hoping that nothing breaks, something that happens with alarming regularity among the major public cloud providers. Bahl said that by using CrystalNet, Microsoft’s engineers can “reduce the significance of network issues” that sometimes crop up after updates have been made.

Early results look good. Bahl said that Microsoft’s Azure team has already used CrystalNet to validate a number of new network designs, architectural changes, hardware and firmware updates and network configuration changes. The company has also used CrystalNet to successfully build and test its Software for Open Networking in the Cloud, which is an operating system for network switches.

Despite the success of these tests, one expert said CrystalNet may not be able to prevent every kind of outage. Holger Mueller, vice president and principal analyst at Constellation Research Inc., said that although it’s nice to see Microsoft trying to address the network issues that have been responsible for several previous Azure cloud outages, there are many other aspects to consider when trying to prevent downtime.

“Being able to simulate and test before deploying is the right way to ensure uptime, but a test remains a test,” Mueller said. “Being able to test locally and roll back instantly are the key aspects to ensuring uptime. But rollback capabilities remain something that most infrastructure as a service providers need to work on, especially when it comes to network infrastructure updates.”

Bahl didn’t say anything about when this promising new technology might be made commercially available. But he did acknowledge that some Azure customers have already expressed an interest in using CrystalNet to prevent outages in their own networks. Most likely CrystalNet will be integrated with the Azure Cloud first, before the company starts thinking about selling it to others.

Image: noupload/pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.