You are on page 1of 9

18ECP101L-MASSIVE OPEN ONLINE COURSE-I

SEMESTER V
YEAR: NOV 2022

SQL FOR DATA SCIENCE

Report Submitted by
KATAKAM SAIKUMAR [RA2011053010059]

Faculty in-charge
Mrs S Hannah Pauline

DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING

FACULTY OF ENGINEERING AND TECHNOLOGY

SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

S.R.M. Nagar, Kattankulathur - 603203, Kancheepuram District


TABLE OF CONTENTS
S.NO TITLE PAGE NO.
1. COURSE DETAILS 2

2. INSTRUCTOR PROFILE 3

3. INTRODUCTION OF THE COURSE 4

4. COURSE TIMELINE

5. SYLLABUS

6. WEEK-1 CONTENTS

7. WEEK-2 CONTENTS

8. WEEK-3 CONTENTS

9. WEEK-4 CONTENTS

10. COURSE GRADE DETAILS

11. COURSE OUTCOME

12. PROOF OF COURSE COMPLETION

THE END
1. COURSE DETAILS

COURSE PLATFORM- COURSERA


COURSE TITLE- SQL FOR DATA SCIENCE

OFFERED BY- University of California, Davis

COURSE DURATION- 4 weeks

DASHBOARD:

This Specialization is intended for a learner with no previous coding


experience seeking to develop SQL query fluency. Through four
progressively more difficult SQL projects with data science
applications, you will cover topics such as SQL basics, data
wrangling, SQL analysis, AB testing, distributed computing using
Apache Spark, Delta Lake and more. These topics will prepare you
to apply SQL creatively to analyse and explore data; demonstrate
efficiency in writing queries; create data analysis datasets; conduct
feature engineering, use SQL with other data analysis and machine
learning toolsets; and use SQL with unstructured data sets.
2. INSTRUCTOR PROFILE

3. INTRODUCTION OF THE COURSE


As data collection has increased exponentially, so has the need for people skilled at using and
interacting with data; to be able to think critically, and provide insights to make better
decisions and optimize their businesses. This is a data scientist, “part mathematician, part
computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being
a data scientist is the best job in America; with a median base salary of $110,000 and
thousands of job openings at a time. The skills necessary to be a good data scientist include
being able to retrieve and work with data, and to do that you need to be well versed in SQL,
the standard language for communicating with database systems.
You will create new tables and be able to move data into them. You will learn
common operators and how to combine the data. You will use case statements and
concepts like data governance and profiling. You will discuss topics on data, and
practice using real-world programming assignments. You will interpret the
structure, meaning, and relationships in source data and use SQL as a professional
to shape your data for targeted analysis purposes.
4. COURSE TIMELINE
DATE OF ENROLLMENT: 24th August 2022
DATE OF COMPLETION: 2nd November 2022

5. SYLLABUS

 selecting and retrieving data with SQL


 filtering, sorting, and calculating data with

SQL
 subqueries and joins in SQL

 modifying and analysing data with SQL


6. WEEK-1 CONTENTS
 selecting and retrieving data with SQL
In this module, you will be able to define SQL and discuss how
SQL differs from other computer languages. You will be able to
compare the roles of a database administrator and a data
scientist, and explain the differences between one-to-one, one-
to-many, and many-to-many relationships with databases. You
will be able to use the SELECT statement and talk about some
basic syntax rules. You will be able to add comments in your
code and synthesize its importance.
7. WEEK-2 CONTENTS
 filtering, sorting, and calculating data with

SQL
In this module, you will be able to use several more new clauses
and operators including WHERE, BETWEEN, IN, OR, NOT,
LIKE, ORDER BY, and GROUP BY. You will be able to use the
wildcard function to search for more specific or parts of records,
including their advantages and disadvantages, and how best to
use them. You will be able to discuss how to use basic math
operators, as well as aggregate functions like AVERAGE,
COUNT, MAX, MIN, and others to begin analyzing our data.
8. WEEK-3 CONTENTS
 subqueries and joins in SQL
In this module, you will be able to discuss subqueries,
including their advantages and disadvantages, and when
to use them. You will be able to recall the concept of a key
field and discuss how these help us link data together with
JOINs. You will be able to identify and define several
types of JOINs, including the Cartesian join, an inner join,
left and right joins, full outer joins, and a self join. You will
be able to use aliases and pre-qualifiers to make your
SQL code cleaner and efficient.
9. WEEK-4 CONTENTS
 modifying and analyzing data with SQL

In this module, you will be able to discuss how to modify


strings by concatenating, trimming, changing the case, and
using the substring function. You will be able to discuss the
date and time strings specifically. You will be able to use
case statements and finish this module by discussing data
governance and profiling. You will also be able to apply
fundamental principles when using SQL for data science.
You'll be able to use tips and tricks to apply SQL in a data
science context.

You might also like