Published by: debashisdas on Mar 13, 2012
Filing Information: February 2012, IDC #233348, Volume: 1, Tab: VendorsDatabase Management and Data Integration Software: Insight
INSIGHTOracle's All-Out Assault on the Big Data Market:Offering Hadoop, R, Cubes, and Scalable IMDBin Familiar Packages
Carl W. Olofson
The Big Data space is rapidly evolving. The first wave of adoption involved Web-based companies such as online retailers, service providers, and social media firms.These companies adopted open source technologies such as Apache Hadoop andused considerable in-house technical expertise to build business solutions on top of these open source foundations. The second wave will involve businesses that bothlack technical teams of the same size and depth as the Web-based companiesandare averse to the risk and cost associated with large investments in original softwaredevelopment. These businesses will be attracted to finished products fromestablished companies that offer short paths to business analytic solutions using BigData technologies. Oracle is seekingto appeal to such firms with:
Well packaged sets of preinstalled, integrated, and optimized software on selecthardware in the form of engineered systems and appliances
Products offered in a way that enables users to integrate them into their existingOracle Database and Fusion Middleware environment
Technologies that include the Big Data capabilities in highest demand, includingHadoop, support for the R language, and scalable in-memory databasefunctionality (IMDB)
This IDC Insight considers a number of key product announcements made by Oraclein January and February 2012as well astheir role in the company's strategy withrespect to Big Data and their likely impact on the software markets associated withBig Data technology. The most recent announcement concerns Oracle Advanced Analytics, an option of Oracle Database 11g. This announcement aligns strategicallywith the following three product announcements that establish comprehensive Oraclecoverage of the Big Dataspace:
Oracle Exalytics In-Memory Machine
Oracle Big Data Appliance
Oracle TimesTen In-Memory Database 11g Release 2
Taken together, these products address three key Big Data areas: advanced andlarge-scale analytics, Hadoop-based data classification and extraction, and scalablein-memory database(IMDB)technology.
On February 8, 2012, Oracle announced general availability of Oracle Advanced Analytics. This option of Oracle Database 11g Enterprise Edition includes OracleData Mining and a new component called Oracle R Enterprise, which embeds Ranalytic capability in the database server. Previously, Oracle announced the OracleExalytics In-Memory Machine and the Oracle Big Data Appliance at OracleOpenWorld 2011. In mid-January 2012, the company announced pricing and generalavailability for these two products plus a greatly enhanced version of the in-memoryrelational database management system (RDBMS), Oracle TimesTen. Takentogether, this database option and thesethree productsaddress key areas of the BigData space and represent a significant move by Oracle to establish itself as a major Big Data player. IDC identifies three key areas of Big Data as:
Large-scale advanced analytics
Hadoop-driven Big Data processing
Scalable in-memory database managementThis combination represents a comprehensive approach to the Big Data problemspace. This Insight considerseach areain turn,focusing on how Oracle is addressingit.
Oracle describes its approach to the Big Data space as encompassing four keystages:
Collect, ingest, and format data for analysis
Put data into an order that supports either deep analysis or integrationinto a larger structured data collection, such as a data warehouse
Perform either standard query-based/online analytical processing(OLAP) analysis or deep statistical analysis on the resulting data set
Yield results that can drive both tactical and strategic business decisionsThe Oracle Big Data Appliance takes the user from the
to the
stage, the Oracle Exadata Database Machine (or other Oracle Database 11gEnterprise Edition installation) with the Oracle Advanced Analytics option takes theuser from the
to the
stage, and the Oracle Exalytics In-MemoryMachine takes the user from the
to the
These products (note that Oracle Exalytics In-Memory Machine includes OracleTimesTen) fall into the three functional areas described in this Insight as keyelements of the Big Data space.
Large-Scale Advanced Analytics 
This functional area includes the ability to accumulate large amounts of data in ascalable space for high-performance deep analysis.Oracle is addressing this area withtwo product offerings:
Oracle Advanced Analytics
is an Oracle Database 11g Enterprise Editionoption that includes Oracle Data Mining and Oracle R Enterprise for those thatwish to perform deep data mining and analytics driven by the R language, withthose analytics executing in the database engine.
Oracle Exalytics In-Memory Machine
is for those seeking an engineeredsystem that is preconfigured to support classic online analytical processing usingin-memory cubes powered by Oracle Essbase, orrelational data held in memoryby Oracle TimesTen for fast execution. (Note that Exalytics can support largedata sets that extend beyond the main memory capacity of the system bysending SQL queries to a back-end database such as Oracle running onExadata.)
Oracle Advanced Analytics
This option of Oracle Database 11g Enterprise Edition has two components: OracleData Mining and Oracle R Enterprise. The former is an upgraded version of the datamining option that Oracle has offered for a number of years. The latter is a capabilityembedded in the database engine that allows the user to build R analytics thatexecute in the database close to the data for better performance. The system allowsR users to access table data within the database using the familiar variables andother constructs of the R language. Data retrieval, statistical and predictive analysisoperations, and advanced numerical computations expressed in R are converted intoSQL and executed under the covers, so the R programmer does not need to haveexpertise in relational database technology orthe structure of the database inquestion. The role of this option is to allow "quants"that prefer to use R as their means of doing deep analytics to use that language in a high-performance waydirectly against the database data rather than as an external facility that requiresconsiderable configuration to set up.It should be noted that Oracle Advanced Analytics is a database option and socan beused with any installation of Oracle Database 11g Enterprise Edition. This also meansthat it can be used within the Oracle Exadata Database Machine. When Oracle Advanced Analytics is usedwith the Oracle Real Application Clusters (RAC) option of Oracle Database, or within the Oracle Exadata Database Machine (which includesRAC), the user also takes advantage ofthe scalability of parallel SQL execution,which IDC also considers a key Big Data characteristic for relational database.

