You are on page 1of 23

IITM Pravartak and

AWS Certified Advanced


Professional course in Big
Data and Cloud Analytics
Live Classes with 100% Job
Placement Assistance
About IIT MADRAS & Incubation Cell

The Indian Institute of Technology Madras or IIT Madras is

recognized globally and holds the laureate of being the

No.1 engineering university in India. With a faculty of

international fame, bolstered with a highly motivated and

profound student community, IIT Madras stands true as an

Institute of Eminence. It is a public technical and research

university established by the Government of India. IITM


Incubation Cell nurtures technology ventures through their
start-up phase by providing all the support

GUVI In A Glance

Grab Ur Vernacular Imprint-GUVI (an IIT-Madras


Incubated Company) is World’s First Vernacular Ed-Tech
Learning Platform. Introduced by Ex PayPal Employees,

GUVI empowers students to master programming skills


with the comfort of their native language. Its mission is
to impart technical skills to all through focussed
pedagogical tools.
About the Co-Founders

Co - Founder at GUVI

20+ years of Technical Expertise

& more

Tech Women Entrepreneur who was selected


For Google Developers’ Launchpad Program
Sridevi

CEO & Founder at GUVI

20+ years of Technical Expertise

& more

Built 7 Products from Scratch Mentored 1000+

Arun Prakash
students Hosted 200+ sessions & 25+ webinars

Co - Founder at GUVI

17+ years of experience with IT industry

Technologist with 9+ years of Entrepreneurial


experience & Member of the Syllabus Sub-
Bala Murugan
Committee at Anna University

Data Engineering Program


Within a stipulated 5-Months Weekend Live Online
Classes, our Data Engineering Program covers every
technology in-depth. Hands-on training with 20+ Industry
Projects backed with bi-weekly Hackathons, Mock
Interviews, unlimited practice sessions on Coding Practise
Platforms like Codekata, Webkata, & MicroArc, assist
students with absolutely no tech knowledge to skill up
and get ready for Data Engineering job roles. Along with
an Industry-oriented curriculum and Industry-recognized
certification, we offer 50+ guaranteed interviews with
100% job placement support.
Why GUVI’s

Data Engineering Class?

175% Highest Hike ₹21 Lakhs Highest Salary

40% Hike in Demand 200+ Hiring Partners

360+ Best Industry


100% Job Placement

Experts
Support
Pre-Bootcamp Phase

Book a Seat with ₹8000

(100% Refundable)

Attend Pre-Bootcamp Session



(2 sessions)

Take Assessment

If Selected/
If Not Selected/

Interested Not Interested

Proceed to Data

Engineering Program
Immediate Refund

with
of ₹8000

₹1,23,900(Course Fee) -

₹8000(Booking Fee) =

₹1,15,900(Remaining Fee)
Data Engineering
Program

5-Months Weekend Live Online Class

Hands-on Industry Projects + Bi-weekly

Hackathons

Technical Mentorship by Industry Experts +

Periodic developer Meetups

Practise on Coding Practise Platforms


CodeKata, WebKata, MicroArc

Mock interviews + Group Discussion Test

Live Cumulative Coding Test + Capstone Project

Proceed to Placement Phase


Placement Phase

Tech-Skilled Data

Engineering Learner

Enters Placement Window

Exclusive Skill-based

Job Notifications

Company-specific Expectations Setting

(Hits Interviews Arranged)

Tech-Guidance +

100% Placement Support

Top skills you’ll learn!

Great command in Python.

Solid Foundation in Database.

Cloud Services.

Hands-on in Big Data.

Excellent knowledge of Data Cleaning & Data

Visualization techniques.
Technologies covered

Python Database Shell Script


(Primary)

Orchestrator Cloud Services Big Data

Data Cleaning Data Pipelines

Infrastructure Data security Capstone


as code & Privacy Project
Program Curriculum

Module 1 : Python

Students explore Python, a versatile and

beginner-friendly programming language. Python

is known for its readability and wide range of

applications, from web development and data

analysis to artificial intelligence and automation.

It offers a rich ecosystem of libraries and tools,

making it a popular choice for both novice and

experienced programmers.

Why python ?

Python IDE

Hello World Program

Variables & Names

String Basics

List

Tuple

Dictionaries

Conditional Statements

For and While Loop , TRY AND EXCEPT

Numbers and Math Functions

Common Errors in Python


Module 2 : Python(Advance)

Students will dive into some advanced concepts

like comprehensions, file handling, regular

expressions, object oriented programming,

pickling and many more essential concepts.

Functions , Lambda, Filters and MapPython

Functions as Arguments

List Comprehension

Debugging in Python

Class and Objects

Inheritance , polymorphism , abstractions

Liner and non-Linear Data structures

Singly , doubly ,ciculer Linked list , Binary tree

Bubble , insertion , merge ,quick , Heap sorting

File Handling (Text , Json , csv )

Iterators

Pickling, Multi Threading


Module 3 : SQL

Students dive into SQL (Structured Query

Language) to acquire the skills needed for

managing and querying relational databases. SQL

enables them to retrieve, update, and manipulate

data, making it a fundamental tool for working with

structured data in various applications.

Sub module

Joins

SQL Outer Join

SQL Left Join

SQL Right Join

SQL Full Join

SQL Cross Join

Integrating Python SQL

Window functions(rank, dense

rank, row number, etc)

Sub module

Data Types, Variables, and Constants

Conditional Structures(IF,CASE,GOTO, and NULL)

Stored procedures and Function

sub quaries
Module 4 : RDBMS

Students explore RDBMS (Relational Database

Management System) to understand the

database technology that organizes data into

structured tables with defined relationships.

Mysql

Sql keys

Primary Key

Foreign Key

Unique Key

composite Key

triggers

indexes

transaction

Views

Module 5 : Mongo DB

Students delve into MongoDB to understand this

popular NoSQL database, which stores data in

flexible, JSON-like documents. They learn how

MongoDB's scalability and speed make it suitable

for handling large volumes of unstructured data

Sub module

CAP Theorem

Structured and unstructured data

OLTP vs OLAP

Schema vs Schema less

Dimensional modelling

Cluster set and up Monitoring"

Insert First Data

CRUD Operations

Insert Many

Update and Update Many

Delete and Delete Many

Sub module

Projection

Intro to Embed Documents

Embed Documents in Action

Adding Arrays

Fetching Data From Structured Data

Schema Types

Types of Data in MongoDB

Relationship between data's

Aggregation

Sub module

One to One using Embed Method

One to One using Reference Many

One to Many Embed

One to Many Reference Method

Assessment-MongoDB
Module 6 : Shell Script

Students explore shell scripting in the Linux

environment , where they learn to write and

execute scripts using the command-line

interface. Shell scripts are text files containing

a series of commands, and students discover

how to automate tasks

Intrduction to Linux

Basic Shell script commands

Creating Frameworks

Cron jobs, Email alerts

Running Batch jobs

Module 7 : Git

Students study Git , a distributed version control

system, to learn how it tracks changes in software

code. Git allows collaborative development , enabling

multiple people to work on the same project

simultaneously while managing different versions of

code. It is essential for software development , as it

tracks revisions, facilitates collaboration, and helps

in code management.

Introduction to Git

Git commonds

cloning repository in vs code

Working on cloning branches, commit , push,

add, merge from vs code

Module 8 : AWS Cloud

Students delve into cloud computing, which

involves delivering various computing services

(such as servers, storage, databases, networking,

software, and analytics) over the internet.

Sub module

Introduction to Cloud

AWS Services overview

Server vs serverless

IAM, roles, policies

EC2 , VM’s

S3

RDS – MySQL Free tier database

Integrating RDS to Local System and

Integrating RDS to Python Environment

Sub module

Lambda

Cloud Watch,

Integratrating All the Above componets and RDS

Monitoring ETL pipline with Step funtion

Glue, Data crawler, Athena

monitoring ETL pipeline with step funtion

System Design

Load balancer and High availbility

Horizontal vs Vertical Scaling

Monolithic vs microservice

Distributed messing service and Aws SQS

CDN (content deliver Network)

Caching , scalability

Aws API gateway


Module 9 : Snowflake

Students study Snowflake to grasp modern

cloud-based data warehousing, focusing on its

architecture, data sharing, scalability, and data

analytics applications.

introduction to snowflake

Diffrence between

Datalake,Datawarehouse,Deltalake,Database

Dimension and Fact Tables

Roles and users

Data modeling , snowpipe

MLOAP and ROLAP

Partitioning and indexing

Data mart and data cubes & caching

Data masking

Handling json files

Data loading from S3 and tranformation

Module 10 : Airflow

Students explore Airflow to understand its role

in orchestrating and automating workflows,

scheduling tasks, managing data pipelines, and

monitoring job execution.

Why and what is airflow

airflow UI

Run first dag

grid view

graph view

landing times view

calender view

gantt view

Code view

Core concepts of airflow

DAGs

Scope

Operators

control flow

Task and task instance

Database and executors

ETL/ ELT process implementation

monitoring ETL pipeline with aitflow


Module 11 : BigData

Students delve into big data to learn about

handling and analyzing vast datasets, using

tools like Hadoop, Hive , and HDFS , PIG for

insights and decision-making.

Sub module

Installing Hive, Installing MSQL Locally ,

Running Hive Query to integrat Local

and HDFS file system

Installing Pig

Working with Pig script and itegrating

with local and HDFS file system

Installing HBase working with HBase Qurey

Installing Cassandra and working

with Cassandra

Sub module

Installing Sqoop and fume and do the

data Migration

Local RDBMS to HDFS

Local RDBMS to Hive

Local RDBMS to HBase

HDFS to local RDBMS

Hive to RDBMS

Module 12 : Kafka

Students learn about Kafka, an open-source stream

processing platform. Kafka is used for ingesting,

storing, processing, and distributing real-time data

streams and explore Kafka's architecture, topics,

producers, consumers, and its role in handling large

volumes of data with low latency.

Introduction to kafka

producer, consumer, Consumer Groups

topics , offset , partitions, brokers

Zookeper,replication

Batch vs realtime streaming

real streaming process

Assignment and Task


Module 13 : Spark

Students will explore Spark is an open-source,

distributed computing framework that provides

high-speed, in-memory data processing for

big data analytics.

Sub module

Introduction to Apache Spark

Spark architecture Hadoop vs Spark

RDDs , Dag , tranformation , actions

Data Partitioning and Shuffling

DataFrame & Spark SQL

Streaming data handling in Spark

Sub module

Spark batch data processing (CSV,

JSON,parquet files)

AWS Data Management Tools

[AWS EMR , GLUE jobs] Assignment &

Assessments

Module 14 : Data cleaning

Students engage in data cleaning to understand

the process of identifying and correcting errors or

inconsistencies in datasets, ensuring data

accuracy and reliability for analysis and reporting.

Structured vs Unstructured Data using Pandas

Common Data issues and how to clean them

Data cleaning with Pandas and pyspark\

Handling Json Data

Meaningful data transformation (Scaling and

Normalization)

Example: Movies Data Set Cleaning


Module 15 : Prometheus

Students study Prometheus to explore its role as

an open-source monitoring and alerting toolkit ,

used for collecting and visualizing metrics from

various systems, aiding in performance

optimization and issue detection.

Sub module

server,architecture

installation

understading prom UI

node exporters

promql (agg , fun ,opertaors,datatypes)

integrating python with prom

counter , gauge , summary, histogram

recording rules

alerting rules

alert manager ,installation of alert manager

grouping, inhebiting , throttling ,

silencing alerts

Sub module

salck integration with prom with alert manager

pager duty integration with alert manager

black box exporters,installation

mysql exporter

integrating aws and prom

aws cloudwatch and prom

implementing graffana dashboard to prom"


Module 16 : Data dog

Metrics

Dashboards

Alerts

Monitors

Tracing

Logs monitoring

Integrations

Module 17 : Docker

Students learn about Docker to understand

containerization technology, which allows them to

package applications and their dependencies into

portable, efficient containers.Docker facilitates easy

deployment , scaling, and management of applications

across various environments.

What is docker

Installation of docker

Docker images , containers

Docker file

Docker volume

Docker registry

Containerzing applaciton with docker hands-on

Module 18 : Kubernetes

Nodes

Pods

ReplicaSets

Deployments

Namespaces

Ingress
Hear it from our learners

“They are very approachable and friendly when “GUVI is one of the best platforms to
we ask any doubt or any clarification. Before start a new course and a new career.

joining guvi I have already done a course of data Advanced Programming and Master Data
science in another institution.When comparing Science is one of the best programs
these two institutions, there is a lot of difference which are been trained with industry
in teaching.I love that the mentor who is experts. It has its own software to
teaching the course is not only a mentor but a practise and a huge number of exercises
professional too. This is a very unique thing to master any topic.”

about guvi. I will rate 5/5 to Guvi.”

Vishally Tejas Samanthapudi

“I have attended several classes of Masters in Data


“Guvi helps me to improve my self-confidence science course conducted by Guvi. It is really
in coding skills . The zoom classes are totally helpful to gain knowledge as it is different from
comfortable,friendly and easy to learn .It helps other online courses. Here, we have mentors in live
me to understand the basic and the core sessions, so we will be more concentrated than
concepts and it helped me to. Build logical other online courses where we watch pre
skills.I got great mentor's which helped me to recorded videos. Also we are getting weekly tasks
bridge between the academics. I'm very proud that would make us learn even if there is no class.

Thanks to Guvi.”

I am thankful for all the people in Guvi for building


up such a valuable program for our career.”

Gokila Gokul

“I always liked coding but I didn't really get a


good platform to learn things as per industrial
“Guvi offers a cordial, supportive and friendly requirements. When I was in search I got to know
environment to learners. With excellent support about Guvi, I really felt trustworthy by their
and 24*7 assistance from the mentors guvi response When I joined the Data Science course
does not leave any stone unturned to improvise the weekend live classes and recorded course
your learning. Thanks for being such an videos has made learning easy to me. Eventually
inspiration to us.”

I started spending more time practicing in


Codekata. I loved the way Guvi took care of

clarifying doubts asap. Thank you!.”

Gokak Mohd Ishtiyaque Sonia kola

“Hello folks, if you are thinking of a career transition in


the ‘Data Science’ field then, “GUVI” is the best
platform to get nourished, indulged and protruded in “The datascience course is very good,
this upcoming field and also, it doesn’t matter from the concepts are being explained in a
which engineering background you are or whether crisp manner. The instructors have good
you are a working fellow. The best thing I found here depth in the subject and solve every
is you will always get motivated unknowingly and doubt one might have. Thanks to GUVI for
become curious to learn more & more from the setting a great structured program.”

tutorial videos conducted by the IITM professors.


GUVI helps me to think about the problem in
multidimensional ways. Thanks to the GUVI team”

Shubham Nehete D iliban Sibi

“This course is designed being dynamic, interactive


“The course videos help you to learn the tools by and range of materials to refer. This is very well
yourself and you can track the progress.The structured in such a way that it makes the
mentors are very patient and ensure that participants to perform, discuss, and to participate in
students understand the concept, sometimes assessments that will help the participants to
going the extra mile and explaining. Sometimes maximize the utilization. This program is suitable for all
the mentors try to teach in your native language, if students, freshers and working professionals. This
needed. The practice platforms are easy to learn course is excellent for those who would like to learn
and practice. By completing this data science the basics of program like Python and would like to
course, sure you can become a Data Scientist.”

broaden their knowledge in Data Science. I enjoyed


seeing videos in GUVI website from experts that also
explains the concepts in a detailed manner. ”

Sridharan K A nbazhagan
Instructors
Learn from India’s top Industry Leaders

Dr Sanatan Sukhija Dr Yayati Gupta


Doctorate - IIT-Ropar Doctorate - IIT-Ropar

Shabarinath Premlal Mr Koushik Krishnan


Founder of ResPro Labs Data Science Analyst at
Credit Suisse

Mr Nethaji Nirmal Mr Revanth Guthala


Co-Founder, Webdojo Lead Data Analyst, Yulu

Mr Bala Chandar Mr Abhishek


Data Scientist,
Data Scientist, Bosch
US-based client
Our Placements

"I got a 57% Hike,

Thank you GUVI Team"

Sonia Kola

Data Scientist

Watch Video

“Every topic was

covered from scratch”

Rakesh

Python Developer

Watch Video

Watch Video
Program Details

5-Months Weekend Live Online Classes

Please contact our Data Engineering coordinator

Deepak: +91-97360 97320

Total Course Fee ₹1,23,900

Pre-BootCamp Booking Fees -₹8000

Remaining Fee ₹1,15,900

Now become a proficient Data Engineer at Affordable


Installments! Master Data Engineering

at just ₹10893 /Month

Upto 12 Months

Note: Valid documents are required for EMI Process.


Additional processing fee will be applied. EMI Amount
might vary with Vendors

No - Eligibility / Restrictions!!!

Students & Working Professionals, seeking


opportunities to upskill their Data Engineering
proficiency for faster career growth.
Develop your Data
Engineering skills

&

Unlock a challenging &


rewarding Career
Begin your Skill Development Journey Today!

For further information:


IITM Research park - phase 2

Deepak@guvi.in
module #9, 3rd floor, D block,

+91 9736097320 Kanagam Rd, Tharamani, Chennai,

Tamil Nadu, India. 600113

You might also like