Microsoft’s Big Data Approach – SQL and Hadoop – Ok I Guess

Microsoft is putting out their big data roadmap and approach. I’ve captured it for you here (see below). SiliconANGLE.com covered the Hortonworks Microsoft Hadoop announcement here.

Microsoft has had great success with platforms and developers (as pointed out by the disgruntled Google engineer which we reported on – link). The Google engineer goes on to say about Micrsoft with respect to their understanding of platforms. He says ” Microsoft gets it. And you know as well as I do how surprising that is, because they don’t “get” much of anything, really. But they understand platforms as a purely accidental outgrowth of having started life in the business of providing platforms. So they have thirty-plus years of learning in this space. And if you go to msdn.com, and spend some time browsing, and you’ve never seen it before, prepare to be amazed. Because it’s staggeringly huge. They have thousands, and thousands, and THOUSANDS of API calls. They have a HUGE platform.

Microsoft has an opportunity with big data and hadoop. I hope that they don’t get to religious on the database side and see the opportunities in the software side around supporting huge clusters and more importantly security, multi-tenancy, and provisioning of big data infrastructure.

Good luck Microsoft and welcome to the game.

Here is the post from Microsoft.

From Microsoft SQL Team

A few months ago, we announced our commitment to Apache Hadoop™ providing details on interoperability between SQL Server and Hadoop. As we have noted in the past, in the data deluge faced by businesses, there is an increasing need to store and analyze vast amounts of unstructured data including data from sensors, devices, bots and crawlers and this volume is predicted to grow exponentially over the next decade. Our customers have been asking us to help store, manage, and analyze these new types of data – in particular, data stored in Hadoop environments.

During the Ted Kummert’s Day 1 keynote of SQL Server PASS Summit 2011, we disclosed an end to end roadmap for Big Data that embraces Apache Hadoop™.

To deliver on this roadmap, we announced:

– The general availability (GA) the release to manufacturing of the Hadoop connector for SQL Server and Hadoop connector for SQL Server Parallel Data Warehouse free to licensed SQL Server & PDW customers. These connectors will enable bi-directional data movement across SQL Server and Hadoop enabling customers work effectively with both structured and unstructured data

– Plans to deliver a Hadoop based distribution for Windows Server and Hadoop based service for Windows Azure. By enabling organizations to deploy Hadoop based big data analytic solutions in Hybrid IT scenarios either on premises, in the cloud or both, customers have the flexibility to process data wherever it is born and wherever it lives. Both distributions will offer simplified acquisition, installation and configuration experience of several Hadoop based technologies i.e. HDFS, Hive, Pig etc., enhanced security through integration with Active Directory, unified management through integration with System Center and a familiar and productive development platform through integration with Visual Studio and .NET – all of this optimized to provide the best in class performance in Windows environments.

– Plans to integrate Hadoop with Microsoft’s industry leading Business Intelligence Platform that will enable users to use the familiar productivity tools such as Microsoft Excel and award winning BI clients such as PowerPivot for Excel and Power View to perform analysis on Hadoop datasets in an immersive and interactive way. Our first set of deliverables here will include a Hive ODBC Driver and Hive Add-in for Excel.

– A strategic partnership with Hortonworks that enables us to build on the experience and expertise from the Hadoop ecosystem to help us enable Hadoop to run great on Windows Server and Windows Azure. Hortonworks was formed by the key architects and core Hadoop committers from the Yahoo! Hadoop software engineering team in June 2011 and the team is a major driving force behind the next generation of Apache Hadoop.

– Our commitment to working closely with the Hadoop community and proposing contributions back to the Apache Software Foundation and the Hadoop project which is very much in line with our goal of broadening the adoption of Hadoop. E.g. making JavaScript a first class language for Big Data by enabling the millions of JavaScript developers to directly write high performance Map/Reduce jobs is the sort of innovation that Microsoft hopes to contribute back as proposals to the community.

The CTP of our Hadoop based service for Windows Azure will be available by the end of this calendar year. This CTP will include the Hive ODBC Driver, Hive Add-in for Excel and JavaScript support. The Hadoop based distribution for Windows Server will be available in CY 2012.

Building on our leading Business Intelligence and Data Warehousing platform, we are extending our mission to ‘provide business insight to all users from not only the structured and unstructured data that exists in databases and data warehouses today, but from non-traditional data sources e.g. file systems that include large volumes of data that has not previously been activated to provide new business value.’

We hope to deliver on this mission by making Hadoop accessible to a broader class of developers, IT professionals and end users, by providing enterprise class Hadoop based distributions on Windows and by enabling all users to derive breakthrough insights from any data.

For more info their resource page is at http:// microsoft.com/bigdata.

About John Furrier

Founder and CEO of SiliconAngle.com.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>