Professional Documents
Culture Documents
Expert Talks
Episode 3:
Spectrum Scale Strategy
Ted Hoover
Program Director Spectrum Scale Development
Wayne Sawdon
CTO for Spectrum Scale and ESS
https://www.surveygizmo.com
/s3/5727746/47520248d614
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
2
The first 20+ years
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
3
GPFS has evolved …
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
4
IBM Spectrum Scale
• GPFS is known for scale-out high performance on
the world’s largest supercomputers…
• BUT: If you still just think GPFS, you miss:
– Support for workflows which for example
inject data via object, analyze results via
Hadoop/Spark and view results via POSIX
– Storing and accessing large and small objects
(S3 and Swift) with low latency
POSIX HDFS NFS SMB Swift/S3
– Storing and starting OpenStack VMs without
copying them from object storage to local file IBM Spectrum Scale
system
– Common namespace between Spectrum
Scale clusters on-prem and in the cloud
– Namespace includes Data Management to
automatically destage cold data to on premise
NVMe SSD Disk Tape
or off premise tape or object storage
– GUI , REST API, Grafana Bridge
– HA, DR, Real time Audit & Security
– And much, much more
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
5
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
57 of the Global 100 run IBM Spectrum Scale
9 of the top 10 automobile manufacturers
9 of the top 10 investment banks
18 of the top 25 banks
8 of the top 10 global retailers
4 of the top 5 insurance companies
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
Strategic Trends
Connected Clouds
Dev Ops
Inescapable AI
Security
Performance
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
8
Companies
average almost
Reasons to Hybrid
migrate from
5 public cloud multicloud
private and
public clouds • Security is the platform
• Performance
• Cost
80% • Control of companies
of companies
moved their IDC Survey
85% operate in a
hybrid multicloud
environment today
applications or
data from public
clouds in 2018
of companies
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
11
Evolving Storage Market
Traditional Storage
• Deliver underlying infrastructure Container-Ready Storage
needs to support enterprise
requirements. • Leverage existing investments in Container-Native Storage
• Centralized administration for traditional storage to support
organization. container deployments. • Storage deployed inside
containers with enterprise level
Examples: DS8900, FlashSystem, • Allows use of snapshots, clones, data management services to
IBM Spectrum Scale and replication but doesn’t take
support mission critical
advantage of container
applications deployed in
framework and related benefits.
containers.
• Not optimized for Kubernetes so
• Direct attach and external
can be a bottleneck to achieving
storage support varying
increased agility and elasticity.
performance and capacity
Examples: DS8900, FlashSystem, needs.
IBM Spectrum Scale
• Kubernetes control plane allows
self service capabilities driving
higher levels of efficiencies.
13
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
Evolution of IBM Spectrum Scale Containers
Scale in a Container
w/ CloudPaks
Scale for Containers
Scale for Containers v2 Scale in a
v1 Container
Spectrum
Scale
(bare metal deployment) CSI 2.x
OpenShift Interoperability
Kubernetes
Kubernetes
CSI 2.x
Spectrum Scale
SEC 2.0 CSI 1.0
Spectrum
Scale
Spectrum Spectrum
Spectrum Scale Scale Scale
OpenShift
Common Services
OpenShift
Kubernetes
Kubernetes
OS Support OS Support OS Support RH CoreOS
RHEL / RH CoreOS
Infrastructure Infrastructure Infrastructure Infrastructure
Infrastructure IBM & Expanded Partner Ecosystem
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation 14
Evolution of IBM Spectrum Scale on Cloud
Current Future
Partner and Scale Offerings Partner and Scale Offerings
Common Services
Spectrum Scale
Spectrum Scale
Infrastructure Scale on AWS CSI
Infrastructure
IBM & Partners
Spectrum Scale
OpenShift
AMI
AWS Common Services Common Services
Kubernetes
OS Support OS Support RHEL
Spectrum Scale
Infrastructure
Infrastructure IBM & Partners
Infrastructure
IBM & Expanded Partner Ecosystem
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
15
Evolution of Hybrid Cloud with IBM Spectrum Scale
Spectrum Scale Scale in a Container
Spectrum Scale on AWS w/ CloudPaks
(bare metal deployment)
Spectrum Scale (Multi-Cloud)
(bare metal deployment)
AMI
AWS Common Services Single Name Space
Spectrum Scale w/AFM CSI
Single Name Space Spectrum Scale
Spectrum Scale
w/AFM
Spectrum Scale
OS Support
OS Support
Infrastructure OS Support
IBM & Partners
Infrastructure
Scale in a Container OpenShift
Infrastructure IBM & Partners On Cloud Common Services
IBM & Partners
Kubernetes
Spectrum Scale RH CoreOS
IBM Cloud Infrastructure
IBM & Expanded Partner Ecosystem
Common Services
Scale in a Container
CSI
Spectrum Scale
Spectrum Scale
OS Support OS Support
Infrastructure
Infrastructure
IBM & Partners
IBM & Partners
Current Future
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation 16
Why DevOps?
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
17
Spectrum Scale Deployment: Strategy
Microservices based reusable Ansible infrastructure
(Provides installation, configuration and upgrade capabilities for all Spectrum Scale form factors)
Customer
Ansible ReST API SDI ESS Cloud Containers
Infrastructure
CLI
(Install Toolkit) Cloud Provisioning Container Deployment
Ansible Tower Hardware
(AWX) Ansible Playbooks (Terraform) Ansible Orchestrators
Scale Ansible Scale Ansible Scale Ansible Scale Ansible
Playbook Playbook Playbook Playbook
Protocols Install AFM Install GUI Install ECE Install Callhome Install File Audit Install
Protocols Configure AFM Configure GUI Configure ECE Configure Callhome Configure File Audit Configure
Microservices Protocols Upgrade AFM Upgrade GUI Upgrade ECE Upgrade Callhome Upgrade File Audit Upgrade
based
Reusable
Ansible Roles Spectrum Scale Core Install
Spectrum Scale Core Configuration
Spectrum Scale Core Upgrade
18
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
Spectrum Scale Deployment: Strategy
Infrastructure specific resource provisioning
Unified Installation and Configuration through reusable Ansible playbooks
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
20
IBM Storage and SDI
The Goal: Move Data from Ingest to Insights
Transient Storage
SDS/Cloud Insights Out
Throughput-oriented,
software defined
temporary landing
zone Trained Model
Fast Ingest /
Real-time Analytics ETL Archive
SSD SSD/Hybrid HDD Cloud Tape Inference
High throughput High throughput, random
performance tier I/O, performance & High scalability, large/sequential I/O capacity tier
capacity Tier
1. Single name space across storage platforms Spectrum Scale & ESS
2. Global collaboration / Hybrid Multi-Cloud Cloud Object Storage
IBM
3. Indexing, Auto tagging / metadata management Cloud Paks Spectrum Discover
4. Integrated analytics platform IBM Cloud Paks
© Copyright IBM Corporation 2018
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation 22
Data Accelerator for AI and Analytics
The Problem
We see:
ML / DL
Hadoop / Spark
Prep ⇨ Training ⇨ Inference
• Customers across all verticals are creating
SSD/NVMe HDD Cloud Tape
large PB to EB data stores.
HOT COLD
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
23
Moving Data as Close as Possible to Compute
➢ 2 storage tiers with different storage characteristics
▪ Data Lake – minimize storage cost
▪ High Performance Tier (HPT) – maximize storage
performance
➢ HPT – high-performance data analytics on shared data,
scale-out cluster FS, common namespace, no data
transformations, data on-demand or prefetch,
periodically revalidates cache
➢ S.Discover – curates data lake, metadata search engine,
loads HTP, starts analytics, overall governance
➢ S.Conductor – Intelligent workload manager
➢ Data Lake – COS, Cloud or any high capacity data store
➢ Performance – single node & scale out
➢ End to end security and monitoring
➢ Can be deployed on-prem to on-prem, one-prem to
cloud or cloud to cloud.
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
Spectrum Scale as Data Acceleration for AI and Analytics (DAAA)
- Accelerate model training output by prefetching selected dataset real-time in your
ML/DL environment from the Hadoop/Spark data lake.
- Accelerate real time analytics / inference output by prefetching selected dataset real-
time in a near-edge environment from the remote centralized data lake.
Accelerated Insight
Data Scientist
Data ingest to capacity Select the right data set for Cache selected dataset into Spectrum
tier caching Scale namespace
NAS
NVMe Storage
Filers High Performance Tier
Capacity
Tier /Data Lake Complete solution across your data’s life cycle
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
Spectrum Scale Strategic Areas: Security Feature Outlook
Strategic Areas
Spectrum Scale Expert Talks / Episode 3 / Spectrum Scale Strategy © 2020 IBM Corporation
26
IBM Spectrum Scale and IBM QRadar: Threat Detection and Data Protection
Motivation Solution Architecture
• Attacks against businesses have almost doubled in
five years, and incidents that would once have been
considered extraordinary are becoming more and
more commonplace.
Benefits to Customers
Integrating IBM Spectrum Scale with IBM QRadar
allows: Blueprint & Redpapers: http://www.redbooks.ibm.com/redpieces/abstracts/redp5591.html
• Customers to proactively safeguard their data
residing on Spectrum Scale or be alerted on New!
potential threats (internal / external) in real
time.
✓ Eventual Durability
www.spectrumscaleug.org