You are on page 1of 41

D1SDB108

Let’s Map AWS Storage Service to


your Needs

June Park (he/him) Ameen Khan S (he/him)


APJ Storage Specialist Solutions Architect Leade Sr. Storage Specialist Solutions Architect
AWS AWS

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Every Business is a Data
Business

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Exponential Data Growth is The New Normal

3x 87%
Growth of enterprise of enterprise data will
data stored by be stored in cloud
organisations by 2025 environment by 2025

Source: IDC Market Spotlight, Sponsored by AWS,


“Organizations Rely on Cloud Storage to Optimize Cost, Increase Agility, and Drive Innovation ,” Doc. #USUS48291421, October 2021

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Exponential Data Growth is a . . .

Challenge Risk Opportunity


Creating data silos, Caused by increased To innovate by shifting from
exceeding ability to complexity in protecting collecting to using data
scale, and driving data and continuity
higher costs for data

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
BUILD IN THE CLOUD MOVE TO THE CLOUD

Born in New applications, Application by Crown jewels


the cloud cloud first application in the cloud

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Always protected Put to
work

AWS Storage is
the Foundation AWS
for Cloud Data storage
Architecture
When and where you Cost-
need it optimised

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Define Application Requirements

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS Storage Services
Intelligent- Compression
Data services Tiering and dedupe
AWS Amazon EBS AWS Transfer Lifecycle
Replication Metrics reporting
Backup snapshots Family management
Core services

Object Block File

Amazon S3
Amazon Amazon
and Amazon Amazon EBS FSx Family EFS
S3 Glacier

Hybrid/edge storage
AWS AWS Storage AWS AWS
DataSync Gateway
Data movement services Snow Family Outposts

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Object Block File

Amazon S3
Amazon Amazon
and Amazon Amazon EBS FSx Family EFS
S3 Glacier

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Strike a Balance
Between Cost and Retrieval Speed

Use data Turn data Address workloads


cost-effectively into value such as data lakes,
backups, and archives

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
ü Virtually unlimited ü Lowest-cost storage
scalability for archive data
ü Most performant ü Built on 16 years of
innovation
ü Most secure
ü New archive instant
Amazon S3 ü Easy to manage retrievals

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Gain Insights, Optimize Costs, and Strengthen Da
ta Security
with Amazon S3
Take advantage of
ü Cost-effective storage classes for virtually any workload

ü New features like S3 Object Lambda, and Multi-Region Access Points

ü Training to optimize your use of S3

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How are Customers using Amazon S3?
Compliance Geospatial or lunar IoT sensor data Medical
records Analytics imagery
Media master files
images and
Customer records
Data call-center Digital record
lakes records preservation Mobile sync and
Home- storage
recording video
Seismic and
Origin reservoir Pharmaceutical
storage for simulation data Durable study data
DNA CDN backups
sequences Amazon
Surveillance S3 ML training data
video/closed-
Media assets circuit Financial transaction
television records Website
hosting
Log files
Meteorological and User-generated
environmental research content Autonomous
Oil and gas topography vehicle data
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Operating at Scale in Amazon S3
Industry-leading scalability,
availability, and durability

Wide range of
cost optimization capabilities

Amazon S3

Broadest data movement and


hybrid cloud
storage options

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Industry-leading performance
Optimizing using Amazon S3 Classes New

New

Decreasing storage prices Accelerating innovation


S3 Glacier
Instant Retrieval
S3 Intelligent- (2021)
Tiering,
Archive Instant
S3 Outposts Access
(2020) (2021)
S3 Glacier
Deep Archive
(2019)
S3 Intelligent-
Tiering
S3 One Zone-IA (2018)
(2018)
S3 Standard-IA
(2015)
S3 Glacier
(2012)
S3 Standard
(2006)

2006 2021 2006 2021

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
D1SDB108

Let's Map AWS Storage Service to


your Needs

June Park Ameen Khan S


APJ Storage Specialist SA Leader Sr. Storage Specialist Solutions Architect
AWS AWS

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scenarios

Building a data lake

Building a new data Migrating an existing


lake data lake

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Building a New Data Lake

Most ways to get data in Broadest portfolio of


analytics tools

More ways to optimize costs, Easiest to manage access at scale:


easiest to automate: Access points
Intelligent tiering
Batch operations Amazon S3 Most object-level controls

Unmatched durability,
Best security, compliance, and audit
availability, and scalability
capabilities:
Block Public Access
Access Analyzer for S3
Inventory reports

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migrating Existing DataLake into Amazon S3
Lift & shift

Run third-party analytic tools on EC2


What Use EBS and S3 as data stores
Self-managed environments

Simplify on-premises migrations


Why Use existing tools, code, and customizations
Minimize application changes

You provision, manage, and scale


Consider You monitor and manage availability
You own upgrades and versioning

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migrating Existing Data Lake into Amazon S3
Lift & shift AWS Managed Services

Amazon AWS Amazon Athena


Redshift Glue EMR

Run third-party analytic tools on EC2 AWS managed & serverless platforms
What Use EBS and S3 as data stores AWS Glue, Athena, Amazon EMR, Amazon Redshift
Self-managed environments More options to process data in place

Simplify on-premises migrations Focus on data outcomes, not infrastructure


Why Use existing tools, code, and customizations Speed adoption of new capabilities
Minimize application changes More tightly integrated with AWS security

You provision, manage, and scale Utilizing AWS Lake Formation


Consider You monitor and manage availability Flexibility and choice with open data formats
You own upgrades and versioning Leverage AWS pace of innovation

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migrating Existing Data Lake into Amazon S3
Lift & shift AWS Managed Services

Amazon AWS Amazon Athena


Redshift Glue EMR

Run third-party analytic tools on EC2 AWS managed & serverless platforms
What Use EBS and S3 as data stores AWS Glue, Athena, Amazon EMR, Amazon Redshift
Self-managed environments More options to process data in place

Simplify on-premises migrations Focus on data outcomes, not infrastructure


Why Use existing tools, code, and customizations Speed adoption of new capabilities
Minimize application changes More tightly integrated with AWS security

You provision, manage, and scale Utilizing AWS Lake Formation


Consider You monitor and manage availability Flexibility and choice with open data formats
You own upgrades and versioning Leverage AWS pace of innovation

Amazon S3 is the storage foundation for both approaches


© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Backup and Archiving

Backing up data to the


cloud

Using Backup ISV Using local file shares Backing up data for
Replacing physical
software (e.g. Veritas, or volumes to store compliance and
tape library
Commvault, etc.) backup files regulatory purpose

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Choosing Between Amazon S3 Archive Storage
Classes

S3 Glacier S3 Glacier S3 Glacier


Instant Retrieval Flexible Retrieval Deep Archive

Storage cost $0.004 per GB-month $0.0036 per GB-month $0.00099 per GB-month

Expedited: 1–5 minutes


Standard: Within 12 hours
Data retrieval Instant Standard: 3–5 hours
Bulk: Within 48 hours
Bulk: 5–12 hours

Minimum object
90 Days 90 days 180 days
duration

Bulk retrievals are now FREE!

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How to Move Data into Amazon S3

PUT, COPY Replication Lifecycle AWS data APN


transfer
services

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Understand, Analyze and Optimize

Organization-wide S3 usage Bucket-level analysis of retrievals Object-level analysis for analytics


and activity dashboard for for predictable workloads and auditing
cost optimization

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Object Block File

Amazon S3
Amazon Amazon
and Amazon Amazon EBS FSx Family EFS
S3 Glacier

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migrating Existing Applications to the Cloud

Migrating existing
applications to the cloud

Existing applications with Existing applications with


data on a SAN or direct- data on a NAS or file
attached storage share

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Choosing an Amazon EBS Volume Type
IOPS Throughput
is more important is more important
>160,000 ≤160,000 Small, random I/O Large, sequential I/O

Latency? Aggregate throughput?


<1 ms Single-digit ms ≤4,750 MB/s > 4,750 MB/s

Which is more important? Which is more important?


Cost Performance Cost Performance

io2 Block gp3 io2 sc1 st1


Express
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Object Block File

Amazon S3
Amazon Amazon
and Amazon Amazon EBS FSx Family EFS
S3 Glacier

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scenario: Choosing the Right File Service
Amazon FSx Amazon FSx Amazon FSx
Amazon EFS
for Windows File Server for NetApp ONTAP for Lustre

On-premises co Scale-out file storage (Lustr


General file systems Windows file servers NetApp, commodity NAS
mparison e, GPFS, Isilon)

Cloud native NFS v4,


fully elastic, no prov
Native Windows features Multi-protocol, replicatio
isioning required, dy
*, file access auditing, FS n, cloning, intelligent tier Scale-out performance, S3
Unique features namically grow *and
x File Gateway for on-pr ing, Varonis and anti-viru data processing capabilities
* shrink, file-level re
emises caching s integration
store, IAM for filesys
tem access control

Enterprise IT, databases,


Serverless, container Windows-based user an Machine learning, container
containers (trident), line-
s, modern apps, CM d group shares, Window s, HPC, media processing, d
Use cases of-business apps, test/de
S, test/dev, backup s applications, SQL serve ata analytics, compute inte
v, backup and DR (NetAp
and DR r with HA nsive applications
p)

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision criteria: Access
Amazon FSx
Amazon FSx Amazon FSx
Amazon EFS for Windows Fi
for Lustre for NetApp ONTAP
le Server

SMB (2.0, 2.1, 3.0, 3.1, 3.1.1) SMB (2.0, 2.1, 3.0, 3.1
Protocol NFSv4 (4, 4.1) Lustre (POSIX compliant)
NFS (3, 4, 4.1) , 3.1.1)

Linux, MacOS Linux, Windows, Mac


Linux Linux, Windows, MacOS
Client compatibility containers (EKS, ECS), ser
containers (EKS, ECS) Containers (EKS, ECS)
OS
verless Containers (ECS)

AWS Direct Connect AWS Direct Connect AWS Direct Connect AWS Direct Connect
On-prem access AWS VPN AWS VPN AWS VPN AWS VPN

NetApp Global FileCache Amazon FSx File Gate


On-prem caching NetApp FlexCache way

Automatic import/ex
port of S3 data sets √
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision criteria: Deployment and Flexibility
Amazon FSx
Amazon FSx Amazon FSx
Amazon EFS for Windows File S
for Lustre for NetApp ONTAP
erver

Choice of throughput Throughput & IOPS (netw Throughput & network IO


Throughput Throughput
and IOPS levels ork & disk) PS

Standard (multi-AZ) Scratch (single-AZ) Single-AZ


Deployment options One-zone (single-AZ) Persistent (single-AZ)
Multi-AZ
Multi-AZ

Auto-tiering *

Automatic storage ela


sticity

* Coming soon with file cache enhancements

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision Criteria: Performance and Scale
Amazon FSx
Amazon FSx Amazon FSx
Amazon EFS for Windows File S
for Lustre for NetApp ONTAP
erver
Low single-digit ms (GP)
Latency Low double-digit ms (MaxI Sub-ms Sub-ms Sub-ms
O)
2 GB/s 2 GB/s
Max throughput per fil
1-5+ GB/s 100+ GB/s Up to 3 GB/s w/comp or c Up to 3 GB/s w/comp or c
e system ache ache
2 GB/s 2 GB/s
Max throughput per cl
500 MB/s Up to 12 GB/s Up to 3 GB/s w/comp or c Up to 3 GB/s w/comp or c
ient ache ache
Number of concurrent
Thousands Thousands Thousands Thousands
client connections
Max IOPS per file syst 35,000 for GP mode
Millions Hundreds of thousands Hundreds of thousands
em Millions for Max I/O mode

64 TiB; Extends to multiple


Max file system size Virtually unlimited Virtually unlimited Virtually unlimited
PiBs with DFS Namespaces
1200 GiB (SSD)
Min file system size 0
6000 GiB (HDD)
1024 GiB 32 GiB

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision Criteria: Data Protection
Amazon FSx
Amazon FSx Amazon FSx
Amazon EFS for Windows File S
for Lustre for NetApp ONTAP
erver
Fully-managed
Crash consistent Crash consistent Crash consistent
backups
Cross-region
Using S3 CRR NetApp SnapMirror
replication
Backup and DR from o
NetApp SnapMirror
n-prem to AWS

End-user file restore

Cross-region / cross-a
ccount backup

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision Criteria: Security
Amazon FSx
Amazon FSx Amazon FSx
Amazon EFS for Windows File S
for Lustre for NetApp ONTAP
erver

Encryption-at-rest AWS KMS AWS KMS AWS KMS AWS KMS

SMB encryption SMB encryption


AES-128-GCM or AES-128- AES-128-GCM or AES-128-
Encryption-in-transit TLS 1.2 AEAD-256-bit *
CCM Session signing w/ S CCM Session signing w/ S
MB Kerberos session keys MB Kerberos session keys

File accessing auditing

Anti-virus scanning

Identity AuthN & Auth


Active directory Active directory
Z

Client access policies

Resource managemen
t access policies

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Decision Criteria: FSx Supports all NAS Features;
Examples of Common Ones…
Amazon FSx
Amazon FSx Amazon FSx
Feature for Lustre for NetApp ONTAP
for Windows File S
erver

Snapshots

User-quotas

Instant cloning

Compression and
deduplication

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Key Takeaways
Amazon S3 is the foundation for the data lake whether it is self-
managed or AWS managed

Lift and shift of on-prem apps to AWS cloud requires to choose the
right EBS volume type based on the application requirements to
balance the cost and performance

Choose the right AWS File Storage service based on access,


performance, data protection and security

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Learn in-demand AWS Cloud Skills

AWS Skill Builder AWS Certifications


Access 500+ free digital courses Earn an industry-recognized
and Learning Plans credential

Explore resources with a variety Receive Foundational,


of skill levels and 16+ languages Associate, Professional,
to meet your learning needs and Specialty certifications

Deepen your skills with digital Join the AWS Certified community
learning on demand and get exclusive benefits

Access new
Train now exam guides

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
June Park (he/him) Ameen Khan S (he/him)
AWS AWS

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Please complete
the session survey

© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.

You might also like