You are on page 1of 70

IBM Business Partner

Learn and Earn:

IBM STORAGE SYSTEM &


SOFTWARE

Watana Sutathongthai (watanas@th.ibm.com) | Technical Consultant


Digital transformation relies on data
Persistent data challenges require expertise and capabilities

of data remains
behind corporate
firewalls
Speed & agility
are essential

IBM & Red Hat are


Security threats the hybrid cloud &
are omnipresent container leader
Enterprises are facing storage challenges that span in three
key areas

Data Movement Data Access Data Protection

• Manage data across a hybrid • Leverage existing investments • Backup and protect storage for
cloud environment to access data containers while ensuring high
availability and disaster
• Speed provisioning of storage recovery
• Support container app
portability and data mobility • Address limited or poorly • Protect hybrid cloud apps
documented storage APIs spanning VMs and containers

• Scale-up and/or down • Ensure comprehensive security


across container deployments
IBM Storage
makes your IT
infrastructure
more agile and valuable
today and for your future
With Comprehensive Container Storage for Red Hat OpenShift

IBM FlashSystem family IBM Spectrum Protect Plus


IBM DS8900F
Block Data
Protection

IBM Spectrum Fusion HCI


IBM Elastic
Storage System
IBM Spectrum Scale
File
IBM Spectrum Discover
IBM Spectrum Virtualize
IBM Cloud IBM Cloud Object Storage software
Red Hat OpenShift Container Storage Software
Object Storage Defined
IBM Storage Suite for IBM Cloud Paks
Object
IBM Spectrum Fusion HCI Turn-key OpenShift solution with full data and platform
management and global data acess
A single integrated solution for
faster business results and faster
path to hybrid cloud
IBM Spectrum Fusion HCI
• Integrated HCI appliance for both containers and
VMs using Red Hat OpenShift
• Highly scalable containerized file system with A single source to A single source to
erasure coding manage your container access global data
infrastructure
• Data resilience for local and remote backup and
recovery Object
• Simple installation and maintenance of hardware and
software
• Global data platform stretching from public clouds to File
any on-prem or edge locations
• IBM Cloud Satellite and Red Hat ACM native
integration
• Ready for AI applications with optional NVIDIA A100
GPUs and global data for better AI
• Starts small with 6 servers and scales up to 20 (with
HPC GPU enhanced options)
IBM FlashSystem family

Cloud Enabled Storage for IBM Spectrum Virtualize


IBM Spectrum Virtualize for Public Cloud
Containers IBM Storage Insights

● Seamlessly build your hybrid cloud with


high performance, efficiency and resiliency
FlashSystem
9200R
● Support for Red Hat OpenShift and
FlashSystem
Kubernetes container environments 9200

● Accelerate deployment of persistent


FlashSystem
volumes with the Container Storage 7200

Interface (CSI) driver


FlashSystem
5000
IBM DS8900F

Mission-Critical Workloads for


Containers
● Unify data intensive mission-critical Cloud
workloads and container-based Native
applications with up to 35% savings in
CAPEX and up to 91% in OPEX

● Support for Container Storage Interface Traditional workloads


(CSI) and IBM Cloud Pak solutions

● Next-level performance, security and


resiliency across your hybrid cloud
solutions

● Malware and ransomware protection with


IBM Safeguarded Copy
IBM Spectrum Protect Plus
Integrated Experience

Data Resilience for Containers

• Deploy IBM Spectrum Protect Plus Server as


a container using a Red Hat OpenShift
operator

• Familiar Kubernetes CLI and SLA policies to


manage and automate persistent volume
(PV) backup and recovery

• Integration with OpenShift APIs for Data


Protection (OADP) protects metadata to

Apps
Apps
ensure complete recovery and data mobility

• Quickly access IBM Spectrum Protect Plus


from IBM Cloud Pak for MCM
IBM Storage Suite for IBM
Cloud Paks

Data Services for Containers


• Enable quick deployment of data services for IBM Cloud Paks
your Cloud Pak with the choice of a
comprehensive set of software defined IBM Storage Suite for IBM Cloud Paks
storage

Data services &


• Simplify access to data wherever it exists

Management
• Consistent experience for your OpenShift IBM
Cloud
IBM
Spectrum
IBM
Spectrum
IBM
Spectrum Virtualize
containerized environment across platforms Object
Storage
Scale Discover for Public Cloud

• Dynamic scale for demanding applications

• Leverage the flexibility to mix the offerings as


needed in order to meet any of your workload
requirements
IBM Spectrum Scale
Comprehensive enterprise storage for AI and the
hybrid cloud data center
Container Native Storage
access
• Simple and optimized for hybrid cloud

• Globally accessible pool of data across the


enterprise

• Storage for containers in minutes for DevOps

• Simple management and online scalability

• Policy based data optimization and transparent


data lifecycle management

• Data availability, integrity and security


IBM Spectrum Discover
Containerized
IBM Spectrum Discover
Data Catalog for Containers

auto-cataloging
• Analyze and identify data anywhere
ingest-notification
• Easily deploy in Red Hat OpenShift environments
making it more portable and flexible across
clouds

• Collect data directly from Red Hat OpenShift


Container Storage to help organize data in
containerized environments
IBM Cloud Object Storage

Efficient and secure

• Highly scalable cloud storage solution for on-


premises and cloud-based application and
container-based solutions

• Access data from edge, data center or cloud

• Always-on availability with Geo-dispersed data

• Auto balance data and simplify protection


What IBM Storage
does for containers
and hybrid cloud
Fastest application response time Seven nines (99.99999), a
latencies as low as 18 statistical average of 3 seconds
microseconds and IOPS up to 18M down time in a year

Secure, fast and seamless data Data lakes scalable to yottabyte


movement to hybrid cloud and configurations with performance up
object environments to 2.5TB/s

Malware and ransomware protection 100% data encryption, at-rest,


with Spectrum Protect Suite and IBM in-flight and in the cloud
Safeguarded Copy
16
Why IBM Storage for Red Hat OpenShift and Cloud Paks

A complete infrastructure Hybrid Cloud Integration


foundation Create application and data agility with integration
Choose the data resources you need when you of data center to public cloud resources
need them...block, file or object, and as your
needs change deploy new data resources

Speed development and ease Quickly deploy data services


Integrate IBM Cloud Paks as IBM Storage is fully
operations tested and validated with Kubernetes and Red Hat
Simplify management with scalability for future
OpenShift
growth

Build new applications Speed up container


with modern process storage provisioning
Helps enterprises transform and move to the Supports Container Storage Interface (CSI),
cloud by enabling them to build new applications simplifying management and reducing costs
using modern processes for containerized storage deployments
“Managing huge amounts of data in a hybrid multi-
ZF and IBM create a hybrid cloud environment is very important. Transparent
cloud collaboration solution access to files in data lakes with low latency is
essential when developing autonomous vehicles,
where we have to process images and information
ZF Friedrichshafen automotive from many different data sources.”

– Harald Holder, Director of IT Infrastructure


Platforms, ZF Friedrichshafen
25,000
Healthcare | Data Governance & Optimization
engineers worldwide

Low latency transparent access across

hybrid cloud
Accelerating
ingest and data centric workflows
IBM Storage / © 2021 IBM Corporation
Business challenge Outcome
Porsche Informatik
Deploy a Red Hat Client deployed IBM
modernizes for the future with OpenShift platform for Cloud Object Storage
thousands of users
Red Hat OpenShift and in multiple data centers
• Secure: immutable serving edge and
IBM Cloud Object Storage • Stable and always-on centralized Red Hat
• Easy to scale OpenShift clusters
• World-wide access to
persistent storage
Healthcare | Data Governance & Optimization

Investment Protection
Consolidates data for container and non container workloads

Worldwide Access
30 countries in 4 continents

Resilient Environment
IBM COS immutable (WORM) storage and global access control
Key takeaways

• Hybrid cloud is happening now


• Containers are gaining momentum
• Without a solid storage foundation, projects will fail
• IBM Storage for Red Hat OpenShift provides
that foundation
Feb 04, 202

IBM Storage for


Hybrid Cloud

Flash System for Hybrid Cloud

Norawee Seneewongse (Aun+)


IBM Technical Consultant
norawee.seneeweongse@ibm.com
Applications services to accelerate delivery of data

Capacity and Performance Discovery and


Data Resilience
Management with AI Orchestration

Unified protection across traditional, Monitor performance and capacity Always-on data identification,
virtualize, cloud and container and with predictive analytics makes Identifying problems, and root cause
environments real time adjustments analysis.

HA/DR Security/Encryption Migration

Policy based and application Enterprise security that is centrally Clone entire application
aware to maintain an always on maintained for global data and environments and access remote
access to data access data from multiple clusters

IBM Storage / © 2021 IBM Corporation


IBM Award-Winning Storage Portfolio
Driving Hybrid Cloud and Container Deployments
Container Storage for AI and
Primary Storage
Native SDS Unstructured Data
IBM Storage Insights IBM IBM Spectrum Discover
Spectrum
IBM Spectrum Scale
IBM Spectrum Virtualize Fusion
IBM Cloud Object Storage
All-Flash and Hybrid Systems SVC Storage for Z IBM Spectrum Elastic Storage Cloud Object
Fusion HCI System Storage
FlashSystem
9500

FlashSystem SAN Volume ESS 3200, 5000


7300 Controller
FlashSystem
9500R DS8900F TS7700
FlashSystem 5000

Networking

IBM Spectrum Protect Suite


Hybrid Snapshots Tape VMs Containers
Cloud
Data Resilience and Modern Data Protection
Ability to simultaneously run FlashSystem FlashSystem FlashSystem
features drives selection 5200 7300 9500 Family

• NVMe Flash and NVMe-oF Host Connections


Highest Performance • FCM (enhanced NVMe Flash w/ onboard
FlashSystem and compression and encryption – no impact)
5035 External Virtualization • Storage Class Memory
• External Storage Virtualization

• Clustering
FlashSystem Data Protection • Encryption
and • HyperSwap
5015 • DRPs (Data Reduction Pools)
Data Reduction
• FlashSystem 5035 DRP - Software Only

• Hybrid Cloud enabled • Metro/Global Mirror (including to cloud)


• VMware and Container Integration • FlashCopy (local snapshots)
Uniform Feature Support
• Multi-tenancy • EasyTier
• Three-Site Data Copies • Data Migration from >500 arrays

IBM Spectrum Virtualize


Foundational Software
Storage Insights (AI Predictive Analytics and Proactive Monitoring)
IBM FlashSystem Family
FlashSystem FlashSystem FlashSystem FlashSystem FlashSystem
5015 5035 5200 7300 9500
Software IBM Spectrum Virtualize Software
2x Broadwell @ 2.2GHz 2x Broadwell @ 2.2GHz 2x Skylake @ 2.3GHz 4x Cascade Lake 4x Ice Lake
CPU (2 cores) (6 cores) (8 cores) @ 2.4GHz (10 cores) @ 2.4GHz (24 cores)
16/32Gb FC / NVMeoF 32/64Gb FC / NVMeoF
16Gb FC 16Gb FC 16/32Gb FC / NVMeoF
100GbE iSCSI/RDMA NVMe 100GbE iSCSI/RDMA NVMe
25Gb iSCSI 25Gb iSCSI 25Gb iSER / iSCSI
Connectivity 10Gb iSCSI 10Gb iSCSI 10Gb iSCSI
25Gb iSCSI/RDMA NVMe 25Gb iSCSI/RDMA NVMe
10Gb iSCSI 10Gb iSCSI
12Gb SAS 12Gb SAS 12Gb SAS
12Gb SAS 12Gb SAS
Cache 32GB or 64GB 32GB or 64GB 64GB to 512GB 512GB to 1.5TB 1TB to 3TB

Advanced SW
Optional Optional Included Included Included
Features
Encryption -- Yes Yes Yes Yes
Compression -- SW HW-assist HW-assist HW-assist x2

Deduplication -- Yes Yes Yes Yes


Clustering -- Yes (2-way) Yes (4-way) Yes (4-way) Yes (2-way/4-way by RPQ)
Ext. Virtualization -- -- Yes Yes Yes
3-year
1-year
1-5 year 3-year warranty warranty
3-year warranty 3-year warranty warranty
Service & Support 9x5, CSU Install 9x5, CSU Install
IBM Storage Expert Care 9x5, CSU Install
24x7, IBM
24x7, IBM
Basic or Advanced (ECS optional) Install
Install
(ECS)
Power AC or DC (800W PSU) AC or DC (800W PSU) AC (1200W PSU) AC (2000W PSU) AC (2000W PSU)

Max Performance 400k IOPS 1.2M IOPS 1.5M IOPS 3.5M IOPS 8M IOPS 16M IOPS
8.3.1* 8 GB/s 12 GB/s 21 GB/s 45 GB/s 100 GB/s 200 GB/s
IBM FlashSystem makes it simple to… Choose your capacity

Choose your support


1U 12 drives, 1PBe
Up to 300k real-world
FlashSystem 5200 IOPS, 21GB/s
Expert Care
Basic or Advanced

2U 24 drives, 2.2PBe
FlashSystem 7300 Up to 580k real-world IOPS, 50GB/s

Expert Care Basic, Advanced or Premium

4U 48 drives, 4.5PBe
FlashSystem 9500 Up to 1.6M real-world IOPS, 100GB/s
Expert Care Advanced or Premium
IBM FlashSystem Family
Four key messages

Resilience Cloud
Enabled

Performance Built for


and Efficiency the future
IBM FlashSystem Data Resilience
Protection from failures, disasters, data theft and cyber attacks

Disaster
Failure Protection Data Theft Cyber Attack
Protection

Enhanced High Availability Enterprise Encryption Immutable


High-availability Disaster Recovery FCM FIPS 140-2 Level 2
copies
Backup Recover
[ ]
DR site

Multi-Platform Support 100% data On-prem, on cloud No performance impact Logical Air Gap
availability guarantee or hybrid
Immutable Built with
IBM primary storage IBM Spectrum
copies
solutions for When you need a
Virtualize
immutable copy copy that bad actors
can’t reach

Cross-vendor support Safeguarded Copy


Backup Recover
Immutable point-in-time copies of production data
[ ]
Enterprise storage from entry to high-end
IBM FlashSystem Isolated logical air-gap offline by design
Heterogeneous storage with
Separate privileges for admins of production data
SAN Volume Controller
Pure HPE IBM and Safeguarded Copies
Dell/EMC NetApp 500+ others…
… Consistent automation with other copy services
What FlashSystem Cyber Vault is simple

Immutable Proactive
Copies of Data Monitoring
Created with IBM Safeguarded Copy Early warning signs of attack with
Can not be changed once created IBM Storage Insights
Recommend integration with SIEM
Cyber Vault such as IBM QRadar

Methodology &
Rapid Automation
Test / Validation
Recovery of Data Copies
Restore production from validated data Recover data copies to isolated environment
copies on primary storage to check they are corruption free
Recovery from point-in-time copy Test recovery procedures
Forensics & Diagnostics Services
IBM Spectrum Protect for your business

• Scalable data protection for


physical file servers, applications,
and virtual environments

• Scale to manage billions


of objects per backup server

• Built-in data efficiency capabilities

• Migrate data to tape, public


cloud services, and on-premises
object storage

• Leverage existing investments


for long-term data retention and
disaster recovery
IBM Spectrum Protect in the IBM Cloud Marketplace

IBM Spectrum Protect 8.1.12 can be deployed


rapidly from the IBM Cloud Marketplace catalog

• Available item in the IBM Cloud


Marketplace catalog
• Auto-deploys the Spectrum Protect
server and Operations Center
• Follows IBM Spectrum Protect
blueprints
• Licensing: BYOL

Demo video: https://youtu.be/alSkLnPN4xo


Presentation: https://youtu.be/s59AgHA3S3I
IBM Storage Insights SI

AI driven insights to improve the health and performance of your infrastructure

Deploy

23 million telemetry points 66% of System issues 40% faster action plan Monitoring to help 3 Exabytes of capacity
per SVC System resolved automatically after case is opened prevent problems monitored in our data lake
collected every day

Source: IBM Cloud service metrics and IBM Storage installed base sample, Jan 2020
Spectrum Control - Enterprise Storage Infrastructure Dashboard
SC
Advanced Storage Area Network Management
Bringing consistent hybrid cloud to over 500 different systems

VMware turns one Spectrum Virtualize turns


physical server into many arrays (over 500) into
many virtual servers one consistent environment
A new FlashCore Module 3
Building on IBM’s FlashCore technology, the latest FCM 3 drive
delivers increased performance and greater storage density

2022: FCM 3
Same SLC/QLC technology
2020: FCM 2
4.8, 9.6, 19.2 and 38.4TB
SLC/QLC technology for physical capacities
performance and cost

2018: FCM 1 Expanded metadata management


4.8, 9.6, 19.2 and 38.4TB delivers up to 3:1 effective to usable
3DTLC technology physical capacities ratio improvement with no performance
penalty
4.8, 9.6 and 19.2TB
physical capacities 2:1 inline native hardware 22, 29, 58, 115TB effective capacities
compression
2:1 inline native hardware
compression Significantly increased throughput

FlashSystem 5200, 7300 and 9500


IBM Spectrum Virtualize - On-prem and in the cloud
On IBM Cloud, IBM On AWS, IBM On Microsoft Azure, IBM
Spectrum Virtualize for Spectrum Virtualize Spectrum Virtualize for
Public Cloud is for Public Cloud is Public Cloud is deployed
deployed on bare deployed on Elastic on Azure VMs, with
metal servers, with Compute Cloud shared Azure Managed
Endurance or (EC2) instances, Disks. It is also enabled
iSCSI/FC Performance block with EBS block with Safeguarded Copy
storage. storage. for cyber resiliency.

FC or iSCSI storage FC or iSCSI

IBM Cloud

On-premises IBM Cloud AWS AZURE


Spectrum Virtualize for Public Cloud
IBM Storage for Red Hat OpenShift – full stack architecture

IBM Cloud Pak IBM Cloud Pak IBM Cloud Pak IBM Cloud Pak IBM Cloud Pak for
for Applications for Data for Integration for Automation Multicloud Management

IBM containerized IBM containerized IBM containerized IBM containerized IBM containerized
software software software software software

Operational services Operational services Operational services Operational services Operational services

Container platform Container platform Container platform Container platform Container platform

IBM Power

Virtualize Scale COS Protect Plus


IBM extends data protection for containers
in hybrid cloud environments

IBM has been demonstrating


Red Hat OpenShift container backup
support. This enables developers to test
the ability to easily back up, recover and
retain containers in a Red Hat
OpenShift environment.
A Global Data Platform: Data Lake
AI Media
Anyone. Anywhere. Everywhere.
Cloud Paks
Collaboration HPC

NVIDIA

Backup OpenShift
Archive

Analytics

Connect, maximize, and optimize with a Global Data Platform


Others Isolate while IBM connects your infrastructure

IBM Storage Other options

OpenShift Enterprise
Data Lake Apps Containers
AI Analytics
Silo
Media
Silo Backup
Silo Archive
Collaboration HPC
NVIDIA
AI Silo
HPC
Backup Silo
Archive
Analytics Edge Cloud
Silo Core Silo
Silo
Anyone – can access the same global data

• Lowers costs
with fewer storage resources
to manage

• Increase productivity Extreme High High


Big Data
as developers choose the interface Performance Performance Network Performance
that they need to provide File Object attached Containers
the results they require
HDFS GDS POSIX S3 NFS. SMB CSI CNSA

• Faster results
with multiple high performance data
access options that can cache data
locally with the interface you choose

Same data
Anywhere – your data is stored

• Investment protection
with an open ecosystem of storage options leveraging
Spectrum Scale Spectrum Fusion
multi-vendor and multi-cloud resources

• Increase application agility


accessing data from edge to core to cloud by bringing Remote

more data to applications wherever they are deployed Spectrum Scale IBM COS Spectrum Scale

• Quickly scale your data


from resources you choose with performance you
require S3 S3
PowerScale

• Faster access to remote data


by transparently caching remote data locally
when needed
Connect any file/object storage
Everywhere – you need access

Align ITOps with business requirements


One Global Data Platform to simplify
and connect your data infrastructure
• Speeds AI results and accuracy
• Support multiple concurrent projects
• React in real time to business needs

HPC AI / ML Analytics Enterprise Containers Backup/Archive


Lower Costs
• Eliminates duplicate data
• Lower costs as data grows with cloud and tape
• Investment protection with current
data sources
• Single Global data Platform to manage
(fewer IT resources)
Access your global data
Improve agility/Hybrid cloud integration
• Access data with multiple interfaces
• Supports concurrent edge and cloud access
• High performance access to OpenShift data
Edge. Core. Cloud.
Resilient storage that is protected and secured

Protected Secured

2-site and 3-site High-availability Enhanced Immutable data Encryption


replication high-availability and snapshots
Short RTO Zero RTO Zero RTO
and audit logs

DR site [ ]

On-prem, on cloud Always online Multi-platform support Air-gap FIPS 140-2


or hybrid
Optimized to lower cost
High performance

Transparent data lifecycle management ESS 3200

• Incorporates file and object into IBM Cloud | AWS | Azure


active capacity
High capacity
• Individual files in the file set can
be compressed (policy based)
IBM Spectrum
IBM Cloud
• Incorporate tape or cloud (S3) Object Storage
IBM Cloud | AWS | Azure
Scale capacity
or ESS 5000
into archive tier
• Globally manage with data Archive capacity
orchestrator (Spectrum Discover)
Object
storage

IBM Cloud | AWS | Google Tape


Easy to start
Grow performance, capacity, and efficiency
You choose how to start

All flash
Low-cost capacity
X86 or Power servers
Container storage
Public cloud storage
External vendor storage

Edge Core Cloud

As your business requirements grow your storage expands with ease


Anyone, anywhere, everywhere
Take the easy route with a single protected node

ESS 3200
“I don’t have a full-time
person who looks after
Spectrum Scale on my
team...For the most part, it
looks after itself.”
Up to 80 GB/s per node | 48TB to 912TB per node
Up to 1.5M IOPs per node | Scale 1 to 1000s of nodes – IT Manager, Univ. of Birmingham
Global data access | Container-native OpenShift access

IBM Spectrum Scale


Easy to start and grow into multiple use cases
NVIDIA DGX AI / Analytics
AI / Kubernetes
SuperPod
Spark
/Hadoop
IBM Cloud Paks

One or more
NVMe Flash
One or more NVMe
One or more NVMe Spectrum Flash ESS3200
ESS3200
Flash ESS3200
Fusion

Backup/Archive HPC and Performance Collaboration Data Lake / Consolidation


Applications Hybrid Cloud
Video/Images/Genomes
Policy Archive
Global Data Platform

Edge Multi-cloud
One or more NVMe One or more NVMe
Flash ESS3200 Tape Flash ESS3200 Enterprise
Cloud
storage One or more ESS 5000
Challenge Results
Financial services Multiple use cases Easier to manage
including large regulatory
case study Cloudera deployment requirements and
and growing AI access data globally
requirements needed for multiple
to consolidate for applications to speed
A multi-national bank needed to scale capacity and regulatory modernization and
performance with a solution that also would help them requirements and new AI workloads
modernize and expand for growing AI and data requirements global access

25+ million Solution


File High Performance
digital customers AI / Analytics
Collaboration
Containers
Backup

Resiliency
was required with agile business growth

Accelerating growth IBM Spectrum Scale


with AI and data centric high-performance requirements Global
Global datadata fabric
Platform
Build your foundation tuned to your industry

Financial Research/Public sector Telecommunication Automotive/Industrial Oil & gas


Fraud Detection w/ Collaborate for better results Connect distributed data Faster results with more data Analysis w/ more data
more data faster

Healthcare Retail Banking Media/Entertainment Service Providers


Connect and access faster Custom results w/ more data Analysis w/ secure data Right data to right place - fast Ease of data connection
Trusted by customers across the globe
Over 6000 customers
9 of the top 10 auto manufactures
9 of the top 10 investment banks
18 of the top 25 banks
8 of the top 10 global retailers
4 of the top 5 insurance companies

Cloud Storage and AI and Analytics HPC Archive/Backup


Hybrid Cloud

80 PB Government 126 PB Financial 500 PB Research 60 PB Healthcare


62 PB Automotive 64 PB Life Sciences 75 PB Government 14 PB Retail
And market leadership for 6 consecutive years
Gartner: MQ for Distributed File Systems
and Object Storage, September 2021

● IBM’s acquisition of Red Hat expands its reach into


enterprises with cloud-native workloads, including
enhanced support for containerized applications
and OpenShift in IBM Spectrum Scale
● IBM COS is the only object storage offering on the
market that is also the underlying storage of a major
public cloud. This provides the assurance of
running a large-scale environment for object
native workloads
● IBM Spectrum Discover analyzes data stored in IBM
Spectrum Scale and COS, providing data
visibility, classification and labeling with custom
metadata to enable AI-powered applications

Gartner: Critical Capabilities for Scale-out File


Storage, October 2021
#1 Ranked Storage for Analytics and HPC Use Cases
A Global Data Platform: Anyone. Anywhere. Everywhere.
Data Sources Data and AI
and Locations Outcomes
“We did testing with AI and
machine learning workloads
with Spectrum Scale POSIX /
and [it] outperformed all GDS
File and Object High Performance
other file systems that we
AI
tested against.”1
Cloud
Senior system administrator Kubernetes Native S3

Backup /
Global Data Archive
Platform
Return on investment Edge
NFS
/
380%1 IBM Spectrum Protect Plus SMB
Enterprise Apps
IBM Spectrum Protect
Core Data
Center
IBM
Spectrum HDFS
“We’ve managed to unify the
entirety of the data storage Scale High Performance
where we’re putting IBM Cloud Paks Analytics
Public Cloud
everything into one holistic
platform.”1 CSI

Tape/Cloud Containers
CTO of research and advanced computing
Hybrid Cloud
Tape
Get started today!
Spectrum Spectrum Cloud Object
Learn more about IBM Storage for AI: Discover Scale Storage

• IBM Storage Data and AI web page

Learn more about IBM Storage for NVIDIA:


• IBM Storage and NVIDIA web page
IBM Spectrum Scale
Learn more about IBM Spectrum Scale and
Elastic Storage System (ESS):
• IBM ESS web pages
• IBM Spectrum Scale web page

Learn more about IBM Cloud Object Storage (IBM COS):


• Learn about the IBM COS story (interactive experience)
• IBM COS web page
IBM ESS
Learn more about IBM Spectrum Discover:
• IBM Spectrum Discover web page

Learn more about IBM Spectrum Fusion:


• IBM Spectrum Fusion web page
Faster results Global access
Large service provider
Performance is Global access
case study maintained even means more access
as data grows with for more users and
multiple paths to flexibility to access
A global service provider used IBM Spectrum Scale and the data enabling high analysis tools
global data platform to bring faster analytics and AI workloads throughput and regardless of
connecting edge to core to cloud
low latency location and
connectivity options

Anywhere Solution
Active data analysis Active data analysis
access to single data fabric from multiple concurrent locations Ingest Data
data Center
100PB+
data growth in 3 years as more customer data analyzed Global Data Platform

backup
Remote sites
Public cloud integration Reporting

to all data leveraging resources in Azure Tape IBM Spectrum Scale


Challenge Results
Medical research
Now able to identify
case study Catalog large data
and classify data
images, monitoring
faster while
and reporting on
optimizing data
data location,
IBM Spectrum Scale and IBM Spectrum Discover provides a location based on
PHI/PII data, cost
secure environment for faster research to more data to help bring need and
better patient care efficiencies and
performance
performance of data
requirements

30 PB cataloged data Medical


for optimized workflows and data governance Images and IBM Spectrum
patient data Discover

60 PB accessible data
optimized on multiple tiers of storage (lower costs)

Secure and accessible IBM Spectrum Scale


archive

data across multiple locations


Global data platform Tape
Faster results Lower costs
Large research hospital
Performance is By eliminating data
creates a Global data Platform maintained even silos and manual
as data grows with data movement the
multiple paths to entire infrastructure
A research hospital used IBM Spectrum Scale and IBM Cloud data enabling high requires less
Object Storage to create a interconnected data platform that throughput and resources and is
optimized both file and object workloads
low latency simpler to manage

100% Solution
growth expected in next four to five years with
growing capital investment
Medical image images High performance
and other object data access requirements
Concurrent
access to data from both a high-performance file and
Red Hat OpenShift interface and an S3 object interface

IBM Cloud IBM Spectrum


Interconnected Object Storage Scale
data provides access efficiencies and optimized capacity Transparent
interface
Faster results Lower cost
Multi-site industrial example
Data is accessed Accessing data
directly from one without movement
source with multiple meant customer did
IBM Spectrum Scale and a global data
concurrent parallel not need 2 copies
platform with S3 and file access helps store
paths for multiple of data from
once and access anywhere for better AI
concurrent solutions multiple sites and
collaboration
multiple applications

Multiple Applications
Big Data, HPC, ADAS, CAE, Kubernetes applications Ingest File Data Big Data

Access S3
200PB+ Global Data Platform
shared data across 2 remote design centers
Tape and cloud backup
Site C
access
High Performance S3 IBM Spectrum Scale and ESS
required but data ingesting to a file source
Storage for one of the
world’s smartest
supercomputers with
IBM ESS storage

2.5 TB/sec
throughput to storage architecture

500 PB
of storage capacity

“Summit is providing scientists


with incredible computing
power to solve challenges in
energy, artificial intelligence,
human health, and other
research areas, that were
simply out of reach until now

64
“Managing huge amounts of data in a hybrid
multi-cloud environment is very important.
Automotive Transparent access to files in data lakes with
case study low latency is essential when developing
autonomous vehicles, where we have to
process images and information from many
different data sources.”
ZF Friedrichshafen automotive leverages a global data fabric
to help process images and information faster for better – Harald Holder, Director of IT Infrastructure
collaboration Platforms, ZF Friedrichshafen

25,000
engineers worldwide 100s
TB/Day
Video data

Multi-cloud Homegrown AI/ML


containers for
Container storage
Archive
Low latency transparent access across ESS hardware to tape
autonomic driving

Global IBM
Data Spectrum
Platform Scale
Accelerating AI
ingest and data centric workflows 65
“[Spectrum Scale] gives us a single data management
Public sector plane across our many different storage systems,
allowing us to make price-performance decisions when
Case Study matching workloads to platforms, without complexity
spiraling out of control."

University of Birmingham is driving innovative research - Simon Thompson,


Research Computing Infrastructure Architect
forward by taking control of data

Global data Platform


provides a single data management plane across
many different storage systems

Lower Costs
by enabling price-performance decisions matching
workloads to platforms, without complexity spiraling
out of control

Agility
by deploying applications where it makes sense and
immediately have data available to them
Faster results Lower cost

Telecom Provider Data is accessed Accessing data


Case Study directly from one without manual
source with multiple movement meant
concurrent parallel customer did not
IBM Spectrum Scale with 5 ESS3200s chosen as the highest paths for multiple need multiple
performing file system for AI workloads on NVIDIA SuperPOD
concurrent solutions copies of data

Highest performance
solution that supports new NVIDIA SuperPOD that is
enterprise grade and can scale NVIDIA DGX SuperPOD with IBM ESS

Multiple applications
will access data concurrently providing better resource
utilization of SuperPOD and simpler data management

Multiple use cases for multiple AI teams including network


Proven scalability and support optimization, customer service, NLP, and Covid 19 risk assessment
with demonstrated performance and high-quality support
67
Collapse silos Integrated Solution
Sharing data efficiently with
Retail the Machine Learning team
that was using and planning
Thorough testing and
reference architectures for
Case Study growth in the use of NVIDIA NVIDIA DGX and NVIDIA
DGXA100 GPUS Networking with IBM Storage
Data and AI
Availability of enterprise
grade features that they can An Integrated Elastic Storage
IBM Spectrum Scale with IBM ESS3200s grow to: multi-site capabilities System was easy for Kroger
for stretch cluster and global to deploy, manage, and
data sharing or ILM grow.

Highest performance
Strong performance to supportt the demanding needs of an AI
data scientist team using NVIDIA DGX A100 servers Solution Description Solution Components
• 5 – IBM ESS 3200
• approximately 1.1 PB usable
• ~400 GB/sec Read
Multiple applications 1

2
3

4
5

6
7

8
9

10
11

12
13

14
15

16
17

18
19

20
21

22
23

24
25

26
27

28
29

30
31

32
33

34
35

36
37

38
39

40
1

2
3

4
5

6
7

8
9

10
11

12
13

14
15

16
17

18
19

20
21

22
23

24
25

26
27

28
29

30
31

32
33

34
35

36
37

38
39

40
• ~275 GB/sec Write
• Appliance solution
Requiring shared data so that data scientists can share data • Includes 5105 Management Node
• 1 - ESS 5000
and collaborate more effectively • approximately 373TB usable
• 2 - Power9 5105 Protocol Nodes
• Allow data scientist flexibility in cluster access
• InfiniBand (IB) QM8700 switches
• Lab Services Included
Proven scalability and support • 16 Days for initial configuration and setup

With NVIDIA DGX systems was a key component of earning


credibility with this AI team that lacked prior IBM experience 1

2
3

4
5

6
7

8
9

10
11

12
13

14
15

16
17

18
19

20
21

22
23

24
25

26
27

28
29

30
31

32
33

34
35

36
37

38
39

40
1

2
3

4
5

6
7

8
9

10
11

12
13

14
15

16
17

18
19

20
21

22
23

24
25

26
27

28
29

30
31

32
33

34
35

36
37

38
39

40

Not for public distribution


“As a result of our “The collaboration
new infrastructure, between
Automotive we can now run 20, Continental, IBM
Case Study 40, 80 GPUs Storage and
simultaneously to NVIDIA is bringing a
really speed up our promise to life in
IBM Spectrum Scale with multiple ESS3000s chosen as training,” terms of safety.”
the fastest AI storage system for all data
Balazs Lorand, PhD
Head of AI Competence Centre,
at Continental.

Faster results
with more GPUs simultaneously being utilized

70%
improved AI training time

IBM Spectrum Scale


14x and ESS
more deep learning experiments per month
69
Network Analysis on Telco CSR Center:
Telco AI Case Studies for IBM Spectrum Scale Customer Traffic:
Elastic Storage System (ESS) - High speed back-up: Millions
of calls per day in a limited
- Customer data records back-up window
processing. IBM Streams is
used to ingest 50B records
Large Telco Provider: and to determine network
Cell Provisioning:
- Use IBM Spectrum Scale and ESS for almost everything: ingestion of health. IBM Spectrum Scale - Used to add new cell users
data from the network, networks statistics, network usage. Tiering is gives them the performance onto the network. Two
used to deliver an NVME Flash tier for fast performance of 1- and simple scalability needed systems collapsed into one
200GB/s of throughput. for streaming applications and across more than 10 sites
ML
- Data is moved to an ESS tier with disk where they have one of the
world’s largest SAP HANA/Sybase implementations for analytics on
capacity planning.

- External analytics for minutes used, data usage, texts sent etc. to Large Telco Provider:
support prepaid customers
- Data ingestion at more than 20 Points of Presence (POPS). Data
- Internal analytics on call records that is fed into ML algorithms used gets pushed to a central location where ETL and analytics gets
to identify influencers that they can focus special attention on for performed. Network threat analysis is the biggest use case.
customer retention Analytics on capacity usage for planning

Cloudera Data Lakes:


- Personalized marketing via customer
360-degree views
- Several other Hadoop use cases
70
Journey of A fast path to
Major US financial institution application modernization is
Success Story modernization chosen

Customer developing new Customer chose a dual


applications and migrating cluster that not only
Integrated OpenShift solution provides faster some existing apps to connects to each other but
path to application modernization and hybrid containers and using cloud also connects to IBM Cloud
resources. Journey has been Satellite and the IBM Cloud
cloud slowed with operational and existing data resources.
complexity and integration of Customer is now able to
new technologies. drive modernization forward
OpenShift made easy by many months.
with a comprehensive private cloud experience including
integrated data services to leverage existing resources was a
key driver for IT operations

Global Data Platform


provides a fast and easy way for DevOps to move more Connected
quickly with application modernization and leverage global with a
resources including the public cloud Global Data
Platform

Completeness of hybrid cloud vision


was key contributor to IBM success vs. others considered

You might also like