You are on page 1of 5

Microsoft Big Data

Solution Sheet

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.
2011 Microsoft Corporation

CONTENTS
Introduction

......... . .............................3

Microsoft Big Data Solution . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . 4


Broadening Access to Hadoop. . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Enterprise ready Hadoop. . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . 5
Breakthrough Insights . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . 5

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.
2011 Microsoft Corporation

Key customer
Challenges:

Data explosion, driven


by declining hardware
cost and new data
sources

Greater variety of data


- customers need to
analyze both relational
and non-relational data

Over 80% of data


captured is
unstructured

Increased velocity of
data requiring
organizations to
respond quickly to
rapidly changing data

The need to explore


data interactively with
few preconceived
questions

Introduction
Todays organizations face growing challenges extracting
business value from their data: First, the relentless growth of
data continues, due to the proliferation of new devices and
sensors, and rapidly declining hardware cost.
More
organizations now store terabytes and even petabytes of
data. Second, data complexity is increasing as customers
store both structured data in relational format and
unstructured data such as Word or PDF files, images, videos
and geo-spatial data. Indeed industry analysts confirm that
over 80% of data captured is in unstructured format. Finally
customers are also challenged by the velocity of data
organizations that process streaming data such as click
streams from web sites, need to update data in real time to
serve the right advert or present the right offers to their
customers.
Microsoft has been doing Big Data long before it was megatrend in the market: At Bing we analyze over 100 petabytes
of data to deliver high quality search results. More broadly,
Microsoft provides a range of solutions to help customers
address big data challenges. Our family of data warehouse
solutions from Microsoft SQL Server 2008 R2, SQL
Server Fast Track Data Warehouse, Business Data
Warehouse and SQL Server 2008 R2 Parallel Data
Warehouse offer a robust and scalable platform for storing
and analyzing data in a traditional data warehouse. Parallel
Data Warehouse (PDW) offers customers: Enterprise-class
performance that handles massive volumes to over 600 TB.
We also provide LINQ to HPC (High Performance Computing)
a distributed runtime and a programming model for technical
computing.
In addition to our traditional capabilities mentioned above,
Microsoft is embracing Apache HadoopTM as part of an end to
end roadmap to deliver on our vision of providing business
insights to all users by activating new types of data of any
size.

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.
2011 Microsoft Corporation

Microsoft Big Data Solution

Broadening Access to Hadoop

Microsofts vision is to provide business insights


to all users from any data, including insights
previously hidden in unstructured data. To
achieve this goal Microsoft will ship an Apache
HadoopTM based distribution for Windows Server
and Windows Azure to help accelerate its
adoption in the Enterprise.

Microsoft is committed to broadening the


accessibility and usage of Hadoop to users,
developers and IT professionals.

This new Hadoop based distribution from


Microsoft enables customers to derive business
insights on structured and unstructured data of
any size and activate new types of data. Rich
insights from Hadoop can be combined
seamlessly with the Microsoft Business
Intelligence Platform.

Key Benefits

Broader access of Hadoop to end users, IT


professionals and Developers, through easy
installation and configuration and simplified
programming with JavaScript

Enterprise-ready Hadoop distribution with


greater security, performance and ease of
management

Breakthrough insights through the use of


familiar tools such as PowerPivot for Excel,
SQL Server Analysis and Reporting Services

Our Big Data solution also offers interoperability


with other Hadoop distributions, enabling
customers to derive insights from several
sources.

Two Hadoop Connectors: First, we offer


2 Hadoop connectors that enable customers
to move data seamlessly between Hadoop
and SQL Server or SQL Server Parallel
Data Warehouse. These 2 Hadoop
connectors are now available to existing
customers.

Hive ODBC Driver, plus Excel Hive AddIn: Second, we offer a new Hive ODBC
Driver and an Excel Hive Add-in that enable
customers to move data from Hive directly
into Excel, or Microsoft BI tools such as
PowerPivot, for analysis.

The new Hadoop based distribution for Windows


offers IT professionals ease of use by
simplifying the acquisition, installation and
configuration experience. Thanks to smart
packaging of Hadoop and its toolset, customers
can install and deploy Hadoop in hours instead
of days.
End users can use the Hive ODBC Driver or
Hive Add-in for Excel to analyze data from
Hadoop using familiar tools such as Microsoft
Excel and award winning BI clients such as
PowerPivot for Excel.

Outline of Microsoft's Big Data Solution


For developers, Microsoft is investing to make
JavaScript a first class language within Big Data
by making it possible to write high performance
Map/Reduce jobs using JavaScript. In addition,
our JavaScript console will allow users to write
JavaScript Map/Reduce jobs, Pig-Latin, and
Hive queries from the browser to execute their
Hadoop jobs. This is the sort of innovation that
Microsoft hopes to contribute back as proposals
to the community.

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.
2011 Microsoft Corporation

Enterprise ready Hadoop

Breakthrough Insights

To accelerate its adoption in the Enterprise, Microsoft


will make Hadoop Enterprise ready by

Microsofts Big Data solution offers breakthrough


insights by enabling customers to combine the
richness of relational data from databases with
unstructured data from Hadoop. Our Hadoop
based distribution for Windows Server and
Windows Azure enables customers to:

Active Directory Integration: Providing


Enterprise-class security through integration of
Hadoop with Active Directory

High Performance: Boosting Hadoop


performance to offer consistently high data
throughput

System Center Integration: Simplifying


management of the Hadoop infrastructure
through integration with Microsofts management
tools such as System Center

BI Integration: Enabling integration of relational


and Hadoop data into Enterprise BI solution with
Hadoop connectors

Flexibility and Choice with deployment options


for Windows Server and Windows Azure which
offers customers:
o

Freedom to choose: More control as they can


choose which data to keep in-house instead of the
cloud.

Lower TCO: Cost saving, as fewer resources


are required to run their Hadoop deployment in the
cloud

Elasticity to meet demand: Elasticity reduces


your costs, since more nodes can be added to the
Windows Azure deployment for more demanding
workloads. In addition, the Azure deployment of
Hadoop can be used to extend the on premise
solution in periods of high demand

Increased Performance: Bringing computing


closer to the data our solution enables
customers to process data closer to where data is
born, whether on premise or in the cloud

We do this while maintaining compatibility with


existing Hadoop tools such as Pig, Hive, and Java.
Our goal is to ensure that applications built on
Apache Hadoop can be easily migrated to our
distribution to run on Windows Azure or Windows
Server.

Analyze Hadoop data with familiar tools


such as Excel, thanks to a Hive Add-in for
Excel

Reduce time to solution through integration


of Hive and Microsoft BI tools such as
PowerPivot and Power View

Build corporate BI solutions that include


Hadoop data, through integration of Hive and
leading BI tools such as SQL Server
Analysis Services and Reporting Services

The Hive ODBC driver allows customers to


move data from Hive directly into either
Microsoft Excel or SQL Server BI tools such as
SQL Server Analysis Services, Reporting
Services, PowerPivot and Power View for rich
data visualization.
These insights can be
incorporated into dashboards for consumption
by decision makers and stakeholders.

As mentioned earlier, our broader goal is to


make Hadoop accessible to a broader class
of developers, IT professionals and end
users, by providing enterprise class Hadoop
based distributions on Windows and by
enabling all users to derive breakthrough
insights from any data.

Additional Information
For more information on Microsofts Big Data
solution, go to www.microsoft.com/bigdata
Download the Hadoop connector for SQL Server
from
www.microsoft.com/download/en/details.aspx?id
=27194

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.
2011 Microsoft Corporation

You might also like