Professional Documents
Culture Documents
Unstructured Data Guide
Unstructured Data Guide
nsider how
generating copious amounts of to move forward with managing unstructured data in the simplest and most
data every second that could fill sensible way for your business.
up a good portion of the dark
matter in the universe. IDC fore- LOOK AT YOUR DATA BREAKDOWN
casts that by 2025 there will be 10 Structured data—such as that found in databases—is highly organized, and the
times the amount of data that we patterns in it make it easy to manage and search. On the other hand, unstruc-
have today, which means that our tured data is raw and unorganized: think spreadsheets, documents, emails, PDFs,
data journey is just beginning, and text files, photos, videos, and data from social media. Organizations that place
we could create higher business priority on how to tackle unstructured data will be set apart from organizations
value with strategic management that collect all data and fill up storage infrastructure. Not all data is important to
of data. Most of the data being drive increased business value. If IT teams adopt business drivers from the onset,
produced falls into 2 camps: struc- they will not only optimize storage resources, they will be able to identify data
tured or unstructured data. And which is not valuable to business and discard it. This will also free up IT teams to
unstructured data is a difficult look for appropriate technologies versus constantly putting out storage fires.
animal to tackle.
Unstructured data is growing exponentially; in fact, 80% of corporate enterprise
data is unstructured. This means that you are going to be generating and manag-
ing terabytes or even petabytes of unstructured data, which is stored as either
file or object. So it’s imperative to look for a solution that is easy to deploy,
manage, and scale file and object storage.
2. Performance: Latency and throughput are the key parameters to evaluate performance for
your storage solution. This would include sequential file or object access as well as random
file access including small files.
These types of storage are naturally found close to compute resources. However,
as data continues to grow— specifically the kind that is not ready to be migrated
to cold storage and doesn’t need to be stored with compute—there should be
options for storing that data efficiently, securely, and cost effectively.
Object storage is designed for unstructured data that is highly scalable and
resilient for the world of cloud computing. Unstructured data is not a fixed
PRO TIP: format and consists of varying size files. It’s storage that’s accessible over the
Spend time upfront researching
network via simple S3-compatible HTTP REST API calls. S3 (Simple Storage
how enterprise data is broken down
—structured or unstructured. It will
Service) was developed by Amazon Web Services, which has become the
save your organization time and de-facto standard for many of the leading object storage providers today.
money in the long run. The reason why? They were first to market, and a well documented interface.
QUESTIONS TO ASK
YOURSELF AS YOU 1 Are you constantly being asked to allocate storage for your end users
or application owners?
ASSESS YOUR DATA
Various stakeholders in an
IT organization care (for 2 Have you considered more cost-effective solutions for your unstructured
data than hosting it on your primary NAS/SAN tier?
their own reasons) about
what’s happening with data. Have you considered reducing your datacenter footprint by
There are some key questions 3 consolidating your file, block, and object storage? (And are
you worried about losing some of the benefits by doing so?)
virtualization admins, storage
admins, infrastructure ops
leaders, and DevOps 4 Have you considered how object storage solutions can help
with active archives?
engineers will want to con-
sider as you think through
how to manage your data 5 How easy is it to scale your environment if you wanted to
start small?
efficiently and effectively.
While you may have other questions to answer that pertain more specifically
to your own datacenter’s needs, this list gives you a good starting place to
contemplate your next move in data storage and accessibility.
Enterprises typically get into a cycle of solving point problems and don't always
have time to look at technology updates holistically, which leads to over-purchasing
and under-utilization as they buy based on a three to five year cycle. Further-
more, datacenters end up with a variety of products and increased complexity
between vendor compatibility.
If you are lucky, the newer technology will reduce some of the datacenter
footprint, but not like hyperconverged infrastructure (HCI) solutions.
ONE-CLICK MANAGEMENT
A storage silo not only creates an infrastructure silo, but also involves complex
management tools that require specialized skills. Administrators have to learn
proprietary tools and concepts to manage legacy storage silos. Further, managing
storage performance and troubleshooting performance issues historically steals
away nights and weekends from already busy IT teams. Nutanix offers manage-
PRO TIP: ment of all services including storage from Prism, our unified management plane.
If you need specialized skills with There is a one-click performance optimizer that makes recommendations to
a steep learning curve to manage
scale up, scale out or load balance, and once the administrator accepts those
a storage solution, then think twice.
recommendations, it implements them. This frees administrators from constant
performance monitoring and troubleshooting, so that they can focus on higher
business value tasks.
PROACTIVE DIAGNOSTICS
Most enterprise IT solutions force a reactive approach to system maintenance
and issue resolution. The process usually begins with creating a ticket with the
vendor, followed by issue recreation in the vendor environment, after which the
debug can finally begin. Instead, if a more proactive approach is adopted, where
the system could raise alerts before IT administrators were trapped into a critical
PRO TIP: issue, or begin troubleshooting as soon as the issue arose, that would save time
Think of proactive versus and resources while achieving higher service levels.
reactive troubleshooting support
in your solution of choice.
Nutanix simplifies and streamlines this process through two important support
services: Pulse and Alerts. When enabled, Pulse captures purpose-driven diag-
nostic data every hour. Alerts are event driven and help in proactive support.
HEALTHCARE
Files make up a good part of data storage in the healthcare industry outside
of databases used for electronic health and medical records, and the industry
has a particular set of needs when it comes to storage that can manage it all:
Backup/Archiving: As images and diagnostic data are stored for a very long
time, storage needs to be high capacity and low cost.
FINANCIAL SERVICES
Financial Services is one of the biggest consumers of file storage due to the
sheer volume of files created, used, and shared. These files can be anywhere,
from home directories such as spreadsheets, text documents, and pdfs to
stand-alone Microsoft Access files. When thinking about data in this industry,
top-of-mind topics include:
Imaging data: As organizations embrace IoT and AI, imaging data is growing
in use—everything from tracking number of people in line at a store to facial
recognition for access authentication to automating detection of faulty products.
Supply chain data: While supply chain data is prolific, it is more structured,
dealing with inventory, distribution, and transportation.
Nutanix makes infrastructure invisible, elevating IT to focus on the applications and services that power their business.
The Nutanix enterprise cloud platform leverages web-scale engineering and consumer-grade design to natively converge
compute, virtualization and storage into a resilient, software-defined solution with rich machine intelligence. The result
is predictable performance, cloud-like infrastructure consumption, robust security, and seamless application mobility
for a broad range of enterprise applications. Learn more at www.nutanix.com or follow us on Twitter@nutanix.
©2018 Nutanix, Inc. All rights reserved. Nutanix is a trademark of Nutanix, Inc., registered in the United States
and other countries. All other brand names mentioned hereinare for identification purposes only and may be
the trademarks of their respective holder(s).