UPDATED 13:06 EST / DECEMBER 13 2020

CLOUD

As AWS storage revenue nears $10B, legacy players feel the heat

The storage business as we know it has changed forever.

The storage business as we know it has changed forever. On premises, external storage was once a virtually unlimited and untapped bastion of innovation, VC funding and lucrative exits. Today it is a shadow of its former self and the glory days of storage in the data center will not return. We believe AWS’ storage business is around $6.5-$7B today and will approach $10B annually on a run rate basis by the end of next year. While several trends are affecting the traditional storage business, but by far, AWS’ moves are the most notable. 

In this Breaking Analysis, we’ll lay out our premise for what’s happening in the storage industry and the impact of AWS, and we’ll share some fresh insights from ETR data that support our current scenario.

Three decades of tectonic shifts

A simplified history of storage shows us there have been five major waves of innovation over five decades. The dominant industry model has evolved from:

  • mainframe-centric vertically integrated business model led by IBM;
  • To a dis-integrated business that saw 70 to 80 Winchester disk drive companies rise and then fall, serving a booming PC industry, a wave led by the likes of Seagate;
  • That supplied the emergence of an intelligent, controller-based, external disk array business that drove huge margins for function that while lucrative, was far cheaper than captive storage from system vendors, an era led by EMC and NetApp;
  • And then this business was disrupted by a flash and software-defined, led by Pure Storage and VMware.

The future of storage is being defined by cloud and intelligent data management, and it’s being led by AWS and a three-letter company we’ll call TBD.

The impact of AWS cannot be overstated

Although legacy storage players are tired of talking about cloud’s impact on their business, the reality cannot be ignored. Cloud has been the most disruptive force in storage over the past 10 years and we’ve reported on the spending impact extensively. But cloud is not the only factor pressuring the on-prem storage business. Flash has killed what we call “performance by spindles” — the practice of adding more disk drives to keep performance from tanking.

But drilling down into the cloud, AWS has been the most significant factor. Lots of people talked about object storage before AWS, but there sure wasn’t much spending going on. AWS’ Simple Storage Service or S3 changed that. AWS is getting much more aggressive about expanding its portfolio of storage offerings. S3 came out in 2006 as the very first AWS service.

Then Elastic Block Storage or EBS came two years later. But nobody really paid too much attention. Last fall at storage day, we saw AWS announce a number of services, many file-related, and this year we saw four new announcements from Amazon at re:Invent.

We think AWS’ storage revenue will surpass $6 billion this year and could be as high as $10 billion by 2022. There’s not much data out there but this would mean AWS’ storage biz is larger than that of NetApp, which means AWS is larger than every traditional storage player with the exception of Dell. 

Here’s a little glimpse of what’s coming at the legacy storage business: It’s a clip of Mai-Lan Tomsen Bukovecthe vice president of AWS block and object storage.

OK, now you may say, “What does that have to do with anything?” Well, we just think it’s badass that this woman boxes and runs a business that we think is approaching $10 billion; and the business she looks after is taking spending from traditional storage in a fairly major way.

AWS storage announcements at re:Invent 2020

AWS made four storage announcements this year at re:Invent.

io2 Block Express Volumes

The first is EBS io2 Block Express volumes. AWS claims this is the first storage area network or SAN for the cloud and it offers up to 256,000 IOPS and 4,000 MBPS throughput and 64 TB of capacity. First of all, it’s not really the first SAN in the cloud in our view, and there may be others, but Pure Storage announced Cloud Block Store in 2019 at its annual Accelerate customer conference.

And you may also be saying, “What’s the big deal?” The performance is a joke compared with what many on-premises products can demonstrate. As a point of comparison, there’s a startup called VAST Data. Its 4U enclosure with bundled storage and compute can do 400,000 IOPS and 40,000 MB/Sec. And that can be scaled. So yeah, we hear you.

But the point is that this feature will be great for a lot of workloads. And it’s fully integrated with AWS services. That means, for example, it will be very convenient for AWS customers to invoke this capability for Aurora and other AWS databases through its RDS service. It’s another easy button for developers to push. This is especially important as we see AWS rapidly expanding its machine learning and AI capabilities with SageMaker and embedding ML into RedShift and driving analytics. Integration is key for its customers.

Now is Amazon retail going to run its business on io2 volumes? I doubt it. I believe it’s running on Oracle and needs much better performance – but this is a mainstream service for the EBS masses to tap.

EBS Gp3 Volumes

The other notable announcement from AWS was EBS Gp3 volumes. This is essentially a service that lets you programmatically set service level agreements for IOPS and throughput, independently, without needing to add additional storage. Again, you may be saying, “Solidfire let me do this years ago and gave me more than 3,000 IOPS and 125 MBPS performance.” But this is great for mainstream customers that want more consistent performance. And it’s integrated into the AWS stack.

And AWS announced that Gp3 volumes is priced at 20% less than previous generation volumes, which you might say is also no big deal – and we would agree – 20% is probably not as aggressive as the average price decline per GB of any storage technology.

Automatic tiering and replication

AWS made two other announcements at re:Invent this year: one that automatically moves data to colder storage tiers, as well as a replication service. On the former, data migrates to tier 2 after 90 days of access and tier 3 after 180 days. AWS hired a bunch of folks out of EMC years ago and put them up in the Boston Seaport, so it has acquired lots of expertise in a lot of different areas and, although this is not rocket science, it saves customers money.

And for sure we’d like to see more policy options than just a fixed 90- and 180-day policy. More importantly we’d like to see intelligent tiering where the machine is smart enough to promote certain data sets when needed – like at quarter end for comparisons and the like. But as NFL Hall of Fame coach Hank Stram would say: AWS is matriculating the ball down the field.

Storage spending in context: not a CIO priority

The chart below shows spending across the Enterprise Technology Research taxonomy. It depicts the Net Score, or spending velocity for different sectors. We’ve highlighted storage. Don’t put too much weight on the January data because the survey was just launched. But you can see storage continues to be on the back burner relative to other spending priorities. As we’ve reported, CIOs are really focused on cloud, containers, automation, productivity and other key areas such as security.

 

The numbers don’t lie: Traditional storage is in decline

The chart below shows data for eight leading names in storage. We put storage in quotes because as we said, the market is shifting and for sure companies such as Cohesity and Rubrik are not positioning as storage players. Rather, they’re category creators around intelligent data management. But they’re an adjacency and they are in the ETR storage taxonomy.

As you can see above, we’re showing the year-over-year quarterly revenue growth for the leading storage companies as reported in their financial statements. NetApp is the big winner, growing a whopping 2% (adjusted for constant currency). It beat expectations but expectations were way down. You can see in the rightmost column we’ve added the ETR Net Score from October. A Net Score of 10% says that if you ask customers are you spending more or less with a company, 10% customers are spending more than spending less.

For comparison, Snowflake has a Net Score around 75%. So 10% is in the red zone. And yet NetApp is the big winner this quarter.

Nutanix isn’t really a storage company, but again it’s adjacent to storage. Nutanix essentially created and still leads the hyper-converged infrastructure market. And it, like many of these companies, is transitioning to a subscription pricing model. So that puts pressure on the income statement. Every company on this chart is moving to an annual recurring revenue model and that as-a-service approach, such as HPE GreenLake and Dell APEX, will be the norm by the end of the decade.

HPE’s Nimble business has momentum, but other parts of the company’s storage portfolio are softer. Dell continues to see pressure on its storage business, although VxRail is a bright spot. And IBM’s mainframe storage cycle seems to have run its course.

Data protection and data management vendors are different

And you can see the data protection, data management companies are showing spending momentum. They’re not public, so we don’t have revenue data. But one has to wonder: With all the money these guys have raised and the red-hot IPO and tech markets, why haven’t these guys gone public?

The answer has to be that they’re not ready. Most likely, their numbers aren’t where they want them to be, they’re not predictable enough, they don’t have their operational act together or some combination of those factors. They’ll give other answers, but if they had their stuff together they’d be going out right now.

Not shown are other backup, data protection and data management players such as Veeam, Commvault and Veritas that are all vying for a piece of the pie.

A deeper look at spending momentum

The chart below shows the Net Score breakdown for key players in the ETR storage taxonomy. Now again, ETR here is measuring the percentage of respondents that are adopting new, spending more, spending flat, spending less or retiring the platform. Net Score is calculated as follows: (Adoptions, lime green + Spending More, forest green) minus (Spending Less, pink + Leaving, bright red).

Let’s look at the picture here. Cohesity leads all players in the ETR storage category with a 55% Net Score – you can see the raw data in the upper right corner, followed by Nutanix. Now they’re really not in the scope of a pure storage play; they’re an adjacency – but they do partner with many of the major storage players.

Speaking of Pure, its Net Score has come down from its high of 73% in January of 2016. It won’t likely climb up that high again, but it will be interesting to see if Pure’s Net Score can rebound in a post-COVID world. We’re watching what Pure does in terms of unifying file and object, how it’s faring in cloud and what it does with the Portworx acquisition.

Dell is doing fine with VxRail, but VMware’s vSAN is well off its Net Score highs, which were in the 60%-plus range a couple of years ago. But vSAN has definitely been a factor. HPE with Nimble we think still has room to improve and will continue to be strong.

Dell and NetApp are the big leaders in terms of presence in the data other than VMware, which has a lot of instances in the data set because of its volume. Because Dell and NetApp are so large and mature,  you’d expect them to have Net Scores that are tepid and you can see a similar pattern with IBM.

Commvault, like Cohesity and Rubrik, is really a data management play – going beyond backup into business recovery and data protection, and bringing that to the cloud. We didn’t put Veeam in here and should have; it had pre-COVID Net Scores well into the 30s and has a steadily increasing share of the market, so we expect good things from Veeam going forward.

But it’s all relative. This is a large and mature market that is moving to the cloud and other adjacencies.

On-prem storage players are pivoting

The chart below gives you another view of the competitive landscape. It’s our classic XY graph plotting Net Score on the vertical axis and Market Share on the horizontal. Market Share remember is presence in the data set.

Now think about this from the chief information officer’s perspective. They have their on-prem estate and they’re putting a brick wall around their core systems. What do they want out of storage for that class of workload? They want it to perform consistently. They want it to be efficient and cost-effective.

So what are they going to do? They’re going to consolidate vendors, consolidate storage, minimize complexity. On-prem storage arrays are not giving them the innovation and strategic advantage back when virtualization, space efficient snapshots, data deduplication and other storage services were worth taking a flyer on with a “feature product” such as 3PAR, for example.

Flash gave them headroom and consistent performance and so as more and more work gets pushed to the cloud, you’re seeing the bunkering in on large-scale mission-critical workloads. As you saw earlier, the legacy storage market is consolidating and has been for a while. It’s becoming a managed-decline business where R&D is going to get increasingly squeezed.

We mentioned VAST Data earlier, and it’s growing. Another company that’s growing is Infinidat. It’s focused on petabyte-scale storage, and it’s having success consolidating storage. These are two companies with Israeli founders that are having success. But it’s all a share shift. These two companies are tiny in the grand scheme of things and their end game is to get acquired – so they can steal share, but they’re not going to reverse the trends.

It’s worth noting another point that underscores the consolidation that’s occurring in the market. Just look at the latest Gartner Magic Quadrant for primary storage. Sixty-two percent of the competitors are in the leaders quadrant!

The point is that differentiation is getting more difficult. And everyone on the ETR chart above – every on-prem player – has to have a cloud strategy to connect to the cloud, take advantage of native cloud services and help extend their respective installed bases into the cloud — including having a capability that is physically proximate to the cloud, with a co-lo or other approach. In addition, almost all players are going after multicloud and building subscription businesses.

AWS is redefining hybrid and the edge

AWS wants to bring AWS to the edge – and it looks at the data center as just another edge node. This is a different view than that of the traditional on-prem players. For example, at re:Invent we saw AWS’ hybrid strategy evolve. We saw smaller versions of Outposts, added Local Zones, software development kits for edge computing and other related announcements– all part of bringing AWS’ strategy to expand its presence.

We also saw a few companies – Pure, Infinidat and Veeam come to mind –that are connecting to Outposts. Also Qumulo, Clumio, Commvault and Weka made similar announcements. No doubt there are some we may have missed and more coming.

Prognosis for the future

Where have all the good times gone? Look, we would never bet against data, but the days of selling storage controllers that mask the deficiencies of spinning disk or add embedded hardware functions, or easily picking off a legacy installed bases with flash — those days are gone.

The narrative we hear from many on-prem vendors about repatriation isn’t happening in any meaningful way. CIOs are going to rationalize their on-prem portfolios so they can invest in the cloud, AI, automation and re-skill their teams. Low latency, high bandwidth workloads with minimal jitter are the sweet spot for on-prem. It’s becoming the mainframe of storage.

CIOs are also developing a cloud-first strategy. Yes the world is hybrid. What does that mean to CIOs? It means you’re going to have some work in the cloud and some on-prem. Everything that can go to the cloud will. Everything that can’t won’t. And they’ll build an abstraction layer to connect workloads from an observability standpoint so they can maintain control and manage lock-in risk.

Everything that doesn’t go to the cloud will likely have some type of “hybrid-icity” to it. The reverse won’t likely be the case because lots of workloads that are running in the cloud don’t need on-prem linkages.

For vendors, cloud strategies involve supporting customer migrations to the cloud. Enabling low latency hybrids, accommodating subscription models and managing the decline of a large estate, profitably. Traditional vendors are rethinking what business they’re in – for example, data management and developing an edge strategy that recognizes that edge workloads will require a new, more efficient architecture than we’ve seen built around general-purpose systems.

The future is cloud, the cloud is expanding, the market is large. It’s just a lot different than what we’ve seen over the last 20 years.

Remember these episodes are all available as podcasts wherever you listen. Check out ETR’s Web site. We also publish a full report every week on wikibon.com and on SiliconANGLE. Ways to get in touch: Email david.vellante@siliconangle.com, DM @dvellante on Twitter and comment on our LinkedIn posts.

Here’s the full video analysis:

Image: geralt/Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU