Professional Documents
Culture Documents
Solution
N g u yễ n T h ế H ù n g
T h e h u n g.n gu yen@de ll .co m
0 9 3 6 3 91 525
U D S So l u tio n En g in e e r
Innovate using leading storage innovations
Hyperconverged infrastructure
Modern storage portfolio Software-Defined Storage (SDS)
Dell
Dell Dell Dell Dell Dell Dell Dell
PowerScale/
PowerVault Unity XT PowerStore PowerMax ECS VxRail PowerFlex
Isilon
✓ Block (SAN/DAS) ✓Simple ✓ Scale-up/out ✓ Scale-up/out ✓ Scale-out ✓ Cloud-scale ✓ Turnkey system ✓ Scalable SDI
✓ Affordable/simple ✓All-flash/hybrid ✓ NVMe ✓ Cyber resiliency ✓ All-flash to archive ✓ Deep archive ✓ VMware vSAN ✓ Multi-hypervisor
✓ CloudIQ support ✓Virtual option ✓ AppsON ✓ Highest availability ✓ Multiprotocol file ✓ Mobile App and ✓ Lifecycle mgt ✓ 2 layer/
✓CloudIQ support ✓ CloudIQ support ✓ CloudIQ support ✓ CloudIQ Modern app ✓ CloudIQ support HCI/storage
✓ CloudIQ support
Gartner Magic Quadrant – 7th Consecutive Year Leader (Oct’22)!!
“Distributed file systems and object storage deployments are
growing faster than ever in both volume and size as the
consolidated platform for unstructured data services in
global data center.”
- Gartner Magichttps://www.gartner.com/doc/reprints?id=1-2BHJ4VIQ&ct=221024&st=sb
Quadrant for Distributed File Systems and Object Storage – Oct. 2022
Gartner, Inc. “Magic Quadrant™ for Distributed File Systems and Object Storage” byJulia Palmer, Jerry Rozeman, Chandra Mukhyala, Jeff Vogel, October 19, 2022.
Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest
ratings or other designation. Gartner research publications consist of the opinions of Gartner’s research organization and should not be construed as statements of fact. Gartner
disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose.
This graphic was published by Gartner, Inc. as part of a larger research document and should be evaluated in the context of the entire document. The Gartner document is
available upon request from Dell Technologies.
GARTNER and MAGIC QUADRANT are registered trademarks and service marks of Gartner, Inc. and/or its affiliates in the U.S. and internationally and are used herein with
permission. All rights reserved
Home directories
Energy
Virtual server of the world’s data
stored is unstructured* Manufacturing
Virtual desktop
Financial services
Video surveillance
Life sciences
EDA
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Unstructured data workload
Artificial intelligence
Data analytics
Home directories
Workloads suitable for
Energy
Manufacturing
Financial services
Video surveillance
Life sciences
EDA
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Different in the way of accessing
Dell PowerScale
Scale-Out NAS
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Powerscale Scale-out architecture
Client
Multi-protocol Applications
Intra-cluster
communication
REST
NDMP
HDFS
Ethernet layer
HTTP
FTP
S3
SMB
NFS
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Traditional NAS
NFS
SMB
GUI
Establish pools/LUNs
NAS #1 Marketing
HR Finance Engineering
Set Data protection/RAID
Shelf #1 Marketing
2 LUN 6 LUN 4
Permissions/Access/Protocol
LUN 3
Scaling compounds complexity
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Powerscale Architecture
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
ARCHITECTURE REALLY MATTERS
Traditional – Scale-Up Modern - Scale Out
PowerScale Storage Nodes
ALL-FLASH (Performance)
HYBRID (Blended)
F900
F600
H700 / H7000
F200
ARCHIVE (Capacity)
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Multiple pools, one filesystem Drive lowest blended TCO
Optimize Data Placement to Lower Blended TCO without compromising
performance
Transparently leverage
POLICY multiple pools of storage
1 week
move to hybrid tier SMARTPOOLS
Policy based automated
1 month
move to archive tier tiering within one filesystem
CLOUDPOOLS
1 year
move to cloud Extends filesystem to the
cloud or on-prem object store
Transparent to clients
No impact to users
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Eliminate data migrations
Add new generation node to
existing cluster
No impact to users
No administrator intervention
required
Round Robin
Ensures CPU is evenly utilized
CPU
across cluster
No impact to users
No impact to users
No impact to users
No impact to users
Directory and
/ifs N+3 file level granularity
change just a single directory to a
higher level
N+4
Directory and
file level granularity
change just a single directory to a
higher level
Directory and
file level granularity
change just a single directory to a
FAILED higher level
Directory and
file level granularity
change just a single directory to a
higher level
All of the nodes rebuild the failed units into free space
Dell Internal and Partner Confidential Copyright
29 of 58 © Copyright
© Dell Inc.
2020AllDell
Rights
Inc.Reserved.
Policy based data protection
Protection with no downtime
ADVANTAGES
N+3 Protected @ N+3 Change data protection
USED FREE on the fly
Directory and
file level granularity
change just a single directory to a
higher level
Directory and
file level granularity
change just a single directory to a
higher level
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
OneFS In-line Data Reduction
Dell Internal and Partner Confidential Copyright © Dell Inc. All Rights Reserved.
Secure multi-protocol access
Global permissions structure shared
across ALL users with ALL protocols
InsightIQ
SmartPools SyncIQ
Policy-based automated tiering Asynchronous replication for DR
CloudPools SmartLock
Cloud tiering to a choice of providers Policy-based compliance WORM protection
SmartQuotas SmartConnect
Quota management and thin provisioning Policy-based client failover load balancing
DataIQ
“Create an isolated recovery “Ensure that backups are not “Secure tertiary data backup should “It is important that the backup data
environment: Make ransomware connected to the business be disconnected … so that it can is stored offline and not connected
recovery via an IRE part of your network” withstand targeted cyber attacks … to your network.”
disaster recovery plan & include it or threats from malicious insiders.”
in future disaster recovery tests.”
“It is critical to maintain offline, “Data Vault requirement: “Ensure backups are not “Daily backups of important data,
encrypted backups of data” ‘Air gapped’” connected to the networks they software and settings, stored
back up.” disconnected, retained for at least
three months.”
Ransomware Defender
AirGap Cluster
Real-Time Ransomware detection and
prevention. Monitors user behavior &
shuts down access when ransomware
activity is identified.
SnapshotIQ AirGap
OneFS Snapshots are read-only. Impact of Manages SyncIQ policy replication schedule to 3rd cluster
attack is limited to changes between last automatically and keeps the Airgap closed when active
snapshot policy and when attack happened. threats are detected. Longest Data Retention capability
OneFS has granular policy options and with block level snapshot differencing. Fastest Restore
rapidly recovers from an attack Speed with rapid recovery of data in hours not days or
weeks “Get PB’s of data usable in hours”
• Metropolitan System
• Airports, Railways, Public transportation
• Hospitals
• Prisons Surveillance
Body worn
video
• Strategic objects – power plants etc.
• Logistics companies Aerial video
Audio
In car video
Crime science
GIS data
CCTV – Traditional approach
LUN 1 LUN 2 LUN 3 LUN 4 LUN 5 LUN 6 LUN 7 LUN 8 LUN 9 LUN 10 LUN 11 LUN 12
(E:) (F:) (G:) (H:) (E:) (F:) (G:) (H:) (E:) (F:) (G:) (H:)
CCTV –Traditional approach
LUN 1 LUN 2 LUN 3 LUN 4 LUN 5 LUN 6 LUN 7 LUN 8 LUN 17 LUN 9 LUN 10 LUN 11 LUN 12
(E:) (F:) (G:) (H:) (E:) (F:) (G:) (H:) (I:) (E:) (F:) (G:) (H:)
CCTV – smart & modern approach
by UPP
Prague
Data Analytics/Data Management
with Data Lake
Data architecture evolution to Data Lakehouse
Data Warehouses
Data Warehouses
Metadata, Caching
ETL ETL
and Indexing Layer
1 1 0 0 0 1 1 0 1 1 0 0 0 1 1 0 1 1 0
ETL 0 0 1 1 0
ETL 0 0
0
0 0 0 1 1 1 0
1 0 1 0 0 1
1
1 1
0 0
1 0 0
0
0 0 0 1 1 1 0
1 0 1 0 0 1
1
1 1
0 0
1 0 0
0
0 0 0 1 1 1 0
1 0 1 0 0 1
1
1 1
0 0
1
1 1
1 1 1 1 1 1 1 1
1 1 1 1 1 1 1 1
1 1 1 1 1 1
1 1
0
1 0 0 1 1 0
0 1 0 0 0
0
0 1 1
0
1 0 0 1 1 0
0 1 0 0 0
0
0 1 1
0
1 0 0 1 1 0
0 1 0 0 0
0
0
1 0
0 0 1 1 0
0 0 1 1 0
0 0 1
0 0 1 0 0 1 0 0 1
Data Lake Data Lake Data Lake
External Data
Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node
Ethernet
NameNode
Data Node + Compute Node Data Node + Compute Node Data Node + Compute Node
name
node
data nodes
name
node
Ethernet
name
node
name
node
Compute Node Compute Node Compute Node
2、3 4
1 2
5
10 NameNode
6
9 9 3 NameNode
9
NameNode
7
8 NameNode
/ifs N+3
• A copy of the data
• Better usable rate
• Change the protection level online
• Granulariy at directory/file level
• Global hotspare space improves
Protection Description
Level
data security and reconstruction
+1n Tolerate failure of 1 drive OR 1 node efficiency
+2d:1n Tolerate failure of 2 drives OR 1 node
+2n Tolerate failure of 2 drives OR 2 nodes • Matured, 22 years old products.
Version 6.5 since 2001
+3d:1n Tolerate failure of 3 drives OR 1 node
+3d:1n1d Tolerate failure of 3 drives OR 1 node AND
1 drive
+3n Tolerate failure of 3 drives or 3 nodes
+4d:1n Tolerate failure of 4 drives or 1 node
+4d:2n Tolerate failure of 4 drives or 2 nodes
+4n Tolerate failure of 4 nodes
2x to 8x Mirrored over 2 to 8 nodes, depending on
config
61 Copyright © Dell Inc. All Rights Reserved.
Flexible configuration in a cloud environment
Hadoop Version 1
Hadoop on Container
server server server
Hadoop Version 2
Hadoop on Virtual Machine
server server server
Hadoop Version 3
서버 서버 서버 New Hadoop
server server server
✓ If you need different versions of ✓ When upgrading an existing version ✓ Hadoop based on on-demand
Hadoop clusters of Hadoop containers/virtual machines
➔ Data can be shared ➔ Ensure data safety ➔ Efficient HDFS zone configuration
62 of Y © Copyright 2021 Dell Inc.
PowerScale vs DAS HDFS
Item PowerScale Hadoop DAS
Operation Multi-Protocol HDFS/S3/NFS/SMB/FTP/HTTP/RESTful API Only HDFS, require gateway
Scalability Separate compute and storage resources and scale Storage and compute resources must scale
with actual demand to avoid waste together, and in most cases, wasted resources
Upgrade Storage and computing can be upgraded Both storage and compute need to be upgraded,
independently, and Powerscale can upgrade online, new clusters need to be established, and data
without the need to build new systems or migrate migration might required
data
Balancing data Auto, self-heal and little/no impact Manually, can impact performance, require tuning
Efficiency Capacity Data lake technology ensures multi-protocol sharing Hadoop requires 3 copies of data, which requires
of one piece of data, supporting more than 80% 6 times the space consumption if disaster
storage efficiency recovery or other replication is considered
i
F200
EMC PowerScale
i
F200
EMC PowerScale
i
F200
EMC PowerScale
10GbE
i
F200
EMC PowerScale
i
F200
EMC PowerScale
VMware i
EMC
F200
PowerScale
EMC PowerScale
i
F200
EMC PowerScale
i
F200
EMC PowerScale
Back-end
Network
Spine Switch x 4
Case Study –
Ngân hàng tại Việt Nam
Bank Network
Data Lake
Ethernet
40/100Gb
Ethernet
40Gb
Replication
8N – H5600
Ethernet
25GbEthernet
Workload mobility
40Gb
6N – EX500 5N – EXF900
69 of Y Dell - Internal Use - Confidential
PowerScale Resources
• OneFS Manuals
• White Papers
• PowerSizer
• BLogs
• Hands-on Labs
• OneFS Simulator
• CloudIQ Simulator