You are on page 1of 37

DAT202-R

What's new in Amazon Aurora

Tony Petrossian
GM Amazon Aurora
Amazon Web Services

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Aurora
Enterprise database at open source price

Delivered as a managed service


Drop-in compatibility with MySQL and PostgreSQL

Simplicity and cost-effectiveness of open-source databases

Throughput and availability of commercial databases


Amazon Aurora

Simple pay-as-you-go pricing


4

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Database layers

SQL

Transactions
Multiple layers
Caching of processing
Logging & Storage
all in a single
engine
Aurora decouples storage and query processing

SQL Database
Transactions node

Caching
Amazon Aurora Storage
Processing

Shared storage volume

Storage
nodes
Scale-out, distributed storage processing architecture
Purpose-built log-structured distributed Availability Zone 1 Availability Zone 2 Availability Zone 3

Instance nodes
storage system designed for databases
SQL SQL SQL
Transactions Transactions Transactions
Storage volume is striped across hundreds
of storage nodes distributed over 3 Caching Caching Caching
different Availability Zones

Six copies of data, two copies in each

Storage nodes
Shared storage volume
Availability Zone to protect against AZ+1
failures

Data is written in 10 GB “protection


groups”, growing automatically when
needed
8

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Aurora: Why re-imagine the RDBMS

Customers
Aurora: Why re-imagine the RDBMS
Aurora: Why re-imagine the RDBMS

Applications operating 24x7


and scaling to unimaginable levels
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Aurora distributed storage provides:
Redo log processing
Instant crash recovery
Fault-tolerant and self-healing storage
Fast database cloning
Amazon Aurora Database backtrack
Database snapshots
Delivered as Continuous backups and point-in-time restore
a managed Storage automatic scaling independent of compute
service Read and write scalability
Warm cache on database restart
Low latency replication
Aurora distributed storage provides:
Redo log processing

Instant crash recovery

Fault-tolerant and self-healing storage

Fast database cloning


Amazon aurora
Database backtrack

Database snapshots

Continuous backups and point-in-time restore


transactions SQL Storage Auto Scaling independent of compute

Read and write scalability

Warm cache on database restart

Low latency replication


Aurora
MySQL and PostgreSQL-compatible relational database built for the cloud
Performance and availability of commercial-grade databases at 1/10th the cost

Performance Availability
Highly secure Fully managed
and scalability and durability

5x throughput of standard Fault-tolerant, self-healing Network isolation, Managed by Amazon RDS:


MySQL and 3x of standard storage; six copies of data encryption at No server provisioning, software
PostgreSQL; scale-out up to across three Availability Zones; rest/transit, compliance patching, setup, configuration,
15 read replicas continuous backup to Amazon S3 and assurance programs or backups
16

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Aurora Global Database
Faster DR and enhanced data locality

Promote remote readers


to a master for faster
cross-region disaster
recovery (DR)

Bring data close to your


customer’s applications in
different Regions

Subsecond data
replication cross-Region
Aurora Global Database
Faster DR and enhanced data locality
Northern Virginia
(Secondary Region)

Inbound replication
Oregon Ohio
(Secondary Region) (Primary Region)
R R

Inbound replication M R R Storage

R R
Storage
Ireland
Storage
Outbound replication (Secondary Region)

Inbound replication
High throughput: Up to 200K writes/sec R R
Low replica lag: < 1-sec cross-Region lag
Fast recovery: < 1-min. downtime after Region unavailability Storage

Support for multiple secondary Regions


Support for in-place conversion to Global Database
Fast cross-account database cloning Analytics

Dev/test
Create a copy of a database without applications
duplicate storage costs
• Creation of a clone is nearly instantaneous— Clone
we don’t copy data
Production
• Data copy happens only on write—when applications
original and cloned volume data differ Clone

Typical use cases


• Clone a production DB to run tests
• Reorganize a database
• Run analytics workloads
• Save a point in time snapshot for analysis
without impacting production system
Production database

https://aws.amazon.com/blogs/aws/amazon-aurora-fast-database-cloning/
Aurora Serverless for PostgreSQL and MySQL
Application

Request routers

Scalable DB capacity Warm pool of instances

DB storage
Challenges with integrating machine learning (ML) with
your database
Select and train the model

Create application code to read data from the database

Query and format the data for the ML algorithm

Call an ML service to run the algorithm

Format the output


Amazon Aurora ML
Simple, optimized, and secure Aurora, Amazon SageMaker, and Amazon Comprehend
(in preview) integration

ML predictions Integration with Familiar SQL Low-latency, Security &


on relational data Amazon language, no real time governance
SageMaker & ML expertise
Amazon
Comprehend
Aurora optimized ML query processing

Select * from
user_feedback where
aws_comprehend.detect
_sentiment(review_text,
‘EN’)' = ‘POSITIVE'"

user_feedback
ID Feedback
1 Great product!
Good job
Mediocre
I didn’t like it
Loved it
Terrible service
50 Great service
Amazon RDS Proxy (preview)
Fully managed, highly available database proxy for Amazon RDS

Supports a large number of application connections


Applications

RDS Database Deployed across multiple AZs and fails over without
Instance
losing a connection
RDS
Proxy
Integrates with AWS Secrets Manager and IAM

Get started with a few clicks in the console


Connection Pooling
Preview: Aurora MySQL and RDS MySQL
Coming soon: Aurora PostgreSQL and RDS PostgreSQL
What is Performance Insights

• Analyze and tune Database Performance • Database Load is determined by Average Active
• Available through AWS Management Console Sessions (AAS)
and AWS API SDK • Categorized data by Wait Events, SQL, Hosts,
• Set up alarms for key issues and Users
• SQL statistics for queries new!
SQL statistics in Aurora
Database Activity Streams
DAS architecture
AWS Cloud

Amazon Kinesis
Users Partner Database Alerts
Amazon RDS Security Apps
Database Instance
/Replica

Amazon Kinesis
Data Firehose
Aurora read scaling options

15 promotable read replicas per cluster


Application Application Application BI/reporting
Auto scaling to automatically add servers servers servers application
and remove replicas server

Physical replication across Regions Read/write Read only


(Aurora Global Database)

Logical (binlog) replication to any Database Read


Primary
server replica
MySQL database Asynchronous
replication
Aurora read replicas

Read/write endpoint Read endpoint

Read
Master
replica

Shared distributed storage volume


Aurora MySQL multi-master

Read-write end-point Read-write end-point

Master Master

Shared distributed storage volume

Continuous Availability with Multi-Master


- https://aws.amazon.com/blogs/database/building-highly-available-mysql-applications-
using-amazon-aurora-mmsr/
Federated Query for Amazon Athena (preview)
Run SQL queries on data spanning multiple data stores

Run SQL queries on relational, non-relational, object, or


custom data sources; in the cloud or on premises

Open-source connectors for common data sources


Amazon S3/ S3 Glacier

Amazon Redshift Build connectors to custom data sources


Amazon ElastiCache
Run connectors in AWS Lambda: No servers to manage
Amazon Aurora

Amazon DynamoDB

Amazon DocumentDB
Amazon Redshift federated query (preview)

Queries on RDS and Aurora PostgreSQL databases


JDBC/ODBC

Analytics on live data without data movement

Unified analytics across data warehouse, data


lake, and operational databases

Flexible and easy way to ingest data

Performant and secure access to data


Related breakouts
DAT309 Amazon Aurora storage demystified: How it all works
DAT321 Deep dive on Amazon Aurora with MySQL compatibility
DAT328 Deep dive on Amazon Aurora with PostgreSQL compatibility
DAT350 Migrating open-source databases from Amazon EC2 to
Amazon Aurora PostgreSQL
DAT207-R What’s new in Amazon RDS
DAT382 Amazon Aurora Multi-Master: Scaling out database write
performance
DAT404 Amazon Aurora Multi-Master: Scaling out database write
performance
Learn databases with AWS Training and Certification
Resources created by the experts at AWS to help you build and validate database skills

25+ free digital training courses cover topics and services


related to databases, including:
• Amazon Aurora
• Amazon Neptune
• Amazon DocumentDB
• Amazon DynamoDB
• Amazon ElastiCache
• Amazon Redshift
• Amazon RDS
Validate expertise with the new AWS Certified Database - Specialty beta
exam

Visit aws.training

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
Tony Petrossian
tonypet@amazon.com

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

You might also like