Professional Documents
Culture Documents
Cloudera Enterprise Evaluating The True Value of A Modern Data Platform PDF
Cloudera Enterprise Evaluating The True Value of A Modern Data Platform PDF
Cloudera Enterprise:
Evaluating the True Value
of a Modern Data Platform
Version: 102
Fast
Data is the new normal. We have entered an age where we can measure anything and everything.
This is pervasive across industries and your competitors. The competitive advantage of data comes
from not only the insights we can gain from it, but also how quickly we can do so. You need to not
only understand what happened, but have the ability to understand why and how to change what
happens in the futureall in near real-time. Only a modern data platform such as Cloudera Enterprise
can support this new analytics paradigm, with the fastest time-to-insights.
This all starts with the ability to handle both data at-rest and data in-motion. With the rise of the
Internet of Things and sensor data, data is being generated and collected faster than ever before
and the ability to tap into the value of this data is critical. Your platform needs to support not only
the ability to ingest and process this streaming data in real-time, but also make it available for
analytics and data applications for immediate business value.
Once the data is available in the platform, insights cannot then be limited to a select few or siloed
off for different departments. From processing to serving, and all the analytics in between, a modern
data platform needs to support the full cycle of insights, all within a single enterprise data hub to
deliver the fastest time-to-value.
CLOUDERA ENTERPRISE:
EVALUATING THE TRUE VALUE
OF A MODERN DATA PLATFORM
WHITE PAPER
2
For the data engineers, brittle ETL pipelines and missed SLAs can become a thing of the past.
Cloudera Enterprise is designed to handle large-scale, batch processing workloads over flexible
data types. This means workloads will run orders of magnitude fastercutting days down to
minutesand scale to support more data sources and outputs, so data is always available for
reporting or other workload needs right when your business needs it.
CLOUDERA ENTERPRISE:
EVALUATING THE TRUE VALUE
OF A MODERN DATA PLATFORM
WHITE PAPER
3
For data scientists, they have often been separated from the rest of the business; forced to work
with small data samples as they train and test models, with no clean path to pass them off for
production scoring and serving, and a latent feedback cycle. Cloudera Enterprise opens up the
power of big data to these users, while allowing them to work with their preferred tools and libraries.
Data scientists can now have direct access to data in its entirety, and the best-of-breed processing
tool, Apache Spark, for faster model development. Integrations with popular machine learning
libraries and preferred languages such as Python and R means these users can be productive
out of the gate. Finally, as part of a single, unified platform that supports multiple applications,
these users can cleanly pass their models and applications to production for immediate results.
Data Engineering
& Science
Analytic
Database
Operational
Database
WHITE PAPER
4
UNIFIED SERVICES
STORE
DATA
MANAGEMENT
CLOUDERA ENTERPRISE:
EVALUATING THE TRUE VALUE
OF A MODERN DATA PLATFORM
OPERATIONS
INTEGRATE
Cloudera Enterprise
Easy
Especially at scale, a modern data platform must have easy administration to keep mission critical
applications up and running. Only Cloudera Enterprise provides the Operations Team with what
they need to focus on: new applications and results, not fighting fires. Supporting the largest scale
deployments and applications, Cloudera Manager is the most trusted tool for managing Hadoop in
production. Automated deployments and configurations let you get up and running quickly, and
fully customizable monitoring gives you the visibility and control you need to keep it running.
Whether you need to efficiently troubleshoot an issue, ensure optimal, multi-tenant performance,
or upgrade without downtime, Cloudera Manager is a single interface to manage it all with ease.
A direct connection to Clouderas expert support is also built in to Cloudera Manager. Using their
own modern data platform, Cloudera Support can quickly analyze your diagnostic information
against known issues, best practices, similar deployments, and more, to not only resolve issues
35% faster but also provide proactive guidance and protectionpreventing over 15% of issues
before they actually become issues.
For most enterprises, its only a matter of time before they have a footprint in the public cloud, if
not already. In fact, a recent study by Gartner found that the average enterprise is using 4.6 public
cloud providers. Its critical that a modern data platform can be deployed anywhere, so the business
can get value from all its data, whether its on-premises, in one or many cloud environment(s), or
all of the above. Cloudera Enterprise is the only hybrid platform that allows you to take advantage
of the scalability and flexibility of the cloud, while still getting the same high-performance,
enterprise-grade platform.
Using Cloudera Director, you can deploy how you want, when you want, and manage multiple
clusters across cloud providers from a single, unified interface. Additionally, Cloudera Director
makes it easy to reduce your overall operating costs, whether you want to orchestrate transient
workloads for efficient ETL and batch analytics, or support elastic demand for analytics and
reporting. Finally, by featuring native integration with cloud object stores, such as Amazon S3,
you can start getting value from your data immediately, no matter where it lives.
CLOUDERA ENTERPRISE:
EVALUATING THE TRUE VALUE
OF A MODERN DATA PLATFORM
WHITE PAPER
5
Perimeter
Access
Visibility
Data
Guarding access to
the cluster itself
Reporting on where
data came from and
how its being used
Protecting data in
the cluster from
unauthorized visibility
Technical Concepts
Technical Concepts
Technical Concepts
Auditing
Lineage
Encryption, Tokenization,
Data Masking
CLOUDERA MANAGER
CLOUDERA NAVIGATOR
Authentication
Network Isolation
Permissions
Authorization
Technical Concepts
Clouderas platform ensures you have everything you need to protect your data and your customers
optimized and automated for Hadoop scale. Even your most sensitive data can be used for analytics
with native, high-performance encryption that protects everything in your platform, without impacting
the time-to-insights. Paired with the only enterprise-grade key manager for Hadoop, you can rest
assured that your data and keys are protected.
Additionally, you can safely open up access to all users with uniformly enforced, role-based access
controls. No matter which platform tools they are using, they will get fine-grained access to the
data they need to do their job, without the manual burden on the Security Team.
Finally, no modern data platform is complete without integrated data management and governance.
Not only is governance a critical aspect for any compliance audit, but it provides necessary visibility
and controls to make sure your platform and data are actually usable to the business. From a security
perspective, you automatically get full audit and lineage information to understand who is accessing
what, and how data is changing. When paired with metadata discovery and policy management,
this also allows data stewards to curate data for the business based on usage and enable new
insights on new, trusted data.
Cloudera has led the way when it comes to security in Hadoop. In fact, Cloudera Enterprise is the
only Hadoop distribution to have passed compliance audits with our most regulated financial services,
healthcare, and retail customers.
As this solution not only dealt with sensitive data, but also had to integrate with other
regulated internal systems, MasterCard had to ensure that the platform could meet its
high security standards and comply with PCI DSS (Payment Card Industry Data Security
Standards). With Clouderas full security stack and industry expertise, they are able to
pass this compliance audit year after year.
STREAM
SQL
SEARCH
OTHER
UNIFIED SERVICES
RESOURCE MANAGEMENT
SECURITY
DATA
MANAGEMENT
OPERATIONS
FILESYSTEM RELATIONAL
NoSQL
OTHER
STORE
STRUCTURED
UNSTRUCTURED
INTEGRATE
Hybrid Development
Flexibility
Public Cloud
Private Cloud
Hybrid Environments
Conclusion
CLOUDERA ENTERPRISE:
EVALUATING THE TRUE VALUE
OF A MODERN DATA PLATFORM
WHITE PAPER
7
The right technology is key for removing the barriers to your data and turning it into business
value. You need a modern data platform built to handle any data, wherever it lives, while scaling
analytics and data science to the masses. Powered by Hadoop, Cloudera Enterprise is the fastest,
easiest, and most secure modern data platform that leading organizations trust to get the results
that drive their business. Contact us to get started.
About Cloudera
Cloudera delivers the modern platform for data management and analytics. Public sector
organizations trust Cloudera to help them apply data to the center of their missions with
Cloudera Enterprisethe fastest, easiest, and most secure platform built on Apache
Hadoop and the latest open source technologies. Agencies can efficiently capture, store,
process, and analyze vast amounts of dataempowering them to use advanced analytics
to drive business decisions quickly, flexibly, and at lower cost than has been possible
before. Focused on customer success, Cloudera offers comprehensive support, training,
and professional services. Learn more at cloudera.com.
cloudera.com
1-888-789-1488 or 1-650-362-0488
Cloudera, Inc. 1001 Page Mill Road, Palo Alto, CA 94304, USA
2016 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA and
other countries. All other trademarks are the property of their respective companies. Information is subject to change without notice.