UPDATED 19:19 EDT / FEBRUARY 12 2014

NEWS

WANdisco defends HBase as complaints, standards emerge | #BigDataSV

konstantin-boudnikWebcasting from #BigDataSV in Santa Clara, California, theCUBE veterans John Furrier and Dave Vellante welcomed Konstantin Boudnik for a quick interview regarding the big data trends and the involvement of big companies in the open source process.

The former Yahoo and Cloudera architect and current Director of Advanced Technologies at WANdisco was asked for his view on Silicon Valley, Big Data and Strata.

“I think Big Data is being caught on more big enterprises’ radars. As always when it comes to new technologies, there’s a lot of noise, but also a lot of real deal. Some of the real-deal companies are the ones standing behind the open source movement, essentially the Hadoop and HBase technologies, as well as the real-time processes such as Spark and Shark,” said Boudnik. “We see huge momentum gain in these companies. I believe this segment of the market will grow up and keep growing.”

In Boudnik’s opinion, NonStop HBase is growing at a fast pace and all the analytical tool-sets are still struggling to reach HBase’s level of responsibility.

“In terms of interest from the companies, we see that the WAN edition is significantly higher in demand. People need to have a way to solve the disaster and recovery problem between the data centers, considering that the traditional solutions are very costly and have limited capabilities,” explained Boudnik. “NonStop WAN is being received very well.”

 

Addressing issues with emerging standards in Hadoop

 .

“HBase is pretty good,” agreed Vellante, “it is catching a lot of momentum in the marketplace and it is the de-facto standard in Hadoop, but there are also a lot of complaints about it. There’s a lot of heavy-lifting involved, it’s actually not Non-Stop, do you think those complaints are valid?” asked Vellante.

Boudnik replied: “HBase has redundant fail-over capabilities built in. The administrative component manages the regional servers that track the data placements. There is a way to fail-over regional server if the ActiVision server failed, but the problem in distributive systems is the speed. If you are lucky, fail-over could happen very quickly. If you are not so lucky it could take a few hours. That essentially means server interruptions.

“Another problem is that the HBase client is not HA capable; the HDFS clients can fail-over quickly because the proxy-providers help them bounce to a standby node. But HBase doesn’t have such capabilities,”Boudnik goes on. “You have a bunch of clients writing the data at high speed into the regions servers. If the region server goes down, the fail-over capability that HBase has right now can help you to re-register the region server on a different host, but client won’t fail over.”

In the offering that was announced yesterday, WANdisco tried to address this issue, making multiple region servers available.

“You are saying that the problem is the fact that recovery is very variable,” summarized Vellante.

“We are now offering the clients the possibility to run multiple region servers,” said Boudnik.

“What kind of apps, use-cases you see your solution to be utilized in?” asked Vellante.

“Most of the real-time applications will benefit from it, because even at 10-20 seconds of downtime it might mean huge loss of data, as well as anybody using HBase in production, search, e-commerce, ad serving and fraud detection,” Boudnik answered.

And every new day there’s a new technology coming out. “I see the community being focused on bringing the stable API, complete version of HBase into the game,” declared Boudnik.

Contributing to a revolution

 .

About the big companies who have recently embraced open source, Vellante asked: “You have been in the technical circle of Hadoop since the beginning, what’s the contribution to the core of the technical community? Are there companies that are stepping up and bringing their capabilities to open source, or is it pure marketing?”

“The open source development is similar to the evolutionary process,” said Boudnik. “We try to approach thousands of different ways and see what works best. The community is thrilled by the arrivals of big names because the more people will contribute to the product, the better the product will be.”

Some companies are riding this wave for marketing reasons though, admits Konstantin Boudnik. In terms of contributions the regional core teams that focused on Hadoop and HBase are still around.  “You have to innovate and collaborate, bouncing ideas very quickly.”

Asked by  Furrier on the revelations of the current show from his standpoint,  Boudnik declared: “I am thrilled to see a lot of big companies getting on the  bandwagon. I like the momentum and the speed. We will gain from all that high speed craze,” concluded Boudnik.

 


A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU