Professional Documents
Culture Documents
Kala Aditya
19E51A0551
Introduction
• Data pipelines are a series of steps used to move and process data.
• You can build data pipelines that include various stages like extract, transform,
load and even analyse the data.
• Data pipeline security ensures that data is protected throughout the process.
Comparison of Existing and Advanced
• AWS services used for advanced data pipeline include Glue, Data Pipeline
and Kinesis.
• Traditional method are prone to human error, time consuming and less
efficient as compare to advanced method.
Advanced Model/Topic/Area
• Machine learning models can be used to analyze sales data and predict
future demand.
• Data pipeline security can be used to encrypt data and control access.
• AWS Glue is a fully managed ETL service for moving data between data
stores.
• AWS Lambda is a serverless compute service for running code in response
to events.
• Amazon Kinesis is a real-time data streaming service for processing and
analyzing large data streams.
• Glue, Lambda and Kinesis can be used together in a data pipeline
• Glue helps to move data, Lambda helps to run code, Kinesis allows for real-
time processing
• These services can be used in a variety of data pipeline use cases such as
data warehousing, log analysis, and data lake creation.
Applications