Professional Documents
Culture Documents
Kapil Pendse
Sr. Solutions Architect, Amazon Web Services
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Agenda
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Definition
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Where does this data come from?
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Actionable insights come from proper tools
• Data warehouses can scale horizontally but suffer due to lack of indexes
Traditional data analytics tools are simply not built to handle the variety and volume of
rapidly proliferating machine data.
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Elasticsearch for turning logs into insights
Elasticsearch stores, and indexes application and log data in near real time, providing fast
retrieval, filtering, and analysis for monitoring. It's also great for search!
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Simple to use – it’s a database
1 2 3
Send data as Data is indexed— REST APIs for field matching,
JSON via REST APIs all fields searchable, Boolean expressions, sorting
including nested JSON and analysis
1 3
Server, application, Application data
network, AWS, and
Elasticsearch cluster Application users, analysts,
other logs
DevOps, security
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Amazon Elasticsearch Service Infrastructure
AWS Cloud - region
Amazon ES domain
VPC
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Machine Learning for Logs
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Use cases
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Anomaly Detection in Amazon Elasticsearch Service
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Anomaly Detection in Amazon Elasticsearch Service
WD DI DI DI DI DI
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Time
Now (T)
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Anomaly Detection in Amazon Elasticsearch Service
RCF algorithm
WD DI DI DI DI DI
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Time
Now (T)
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Anomaly Detection in Amazon Elasticsearch Service
RCF algorithm
WD DI DI DI DI DI
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Time
Now (T)
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Demo
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Reference Architecture – S3 Access Logs
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Reference Architecture – Application Logs
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Reference Architecture – VPC Flow Logs
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Getting Started
• Start ingesting log data into an index. As part of the ingestion pipeline, transform non-numeric
fields to numeric fields if needed. For example, one-hot encoding can be used for things like
HTTP response codes, error codes, exceptions etc.
• Configure a monitor using the Amazon Elasticsearch Service Alerting feature, so that you can
receive alerts via Slack or SNS.
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Things to Remember
• Amazon Elasticsearch Service Anomaly Detection only supports continuous numerical features,
categorical features are not supported
• The values of Detector Interval and Window Delay have to be carefully configured. Sometimes it
can take several hours for a detector to finish initializing. If your detector stays in “initializing”
state for longer than a day, you can use the “profile detector” API to check if there are any issues
that are affecting your detector.
• Anomaly Detection can be computationally intensive, so monitor the CPU utilization of your
cluster nodes and try to use larger instances if CPU utilization becomes a bottleneck.
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.
Thank you!
kapilpen@amazon.com
@kapilpendse
© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark.