You are on page 1of 29

DP-900:

Microsoft Azure
Data Fundamentals

Instructors:
Piyush Wairale
Adarsh Madre
© Copyright Microsoft Corporation. All rights reserved.
Course format:
One-day instructor-led with hands-on exercises

About
this course Course objectives:
Describe core data concepts
Identify services for relational data
Identify services for non-relational data
Identify services for data analytics

© Copyright Microsoft Corporation. All rights reserved.


Course agenda

Content Units
Explore fundamentals of data • Core data concepts
• Data roles and services

Explore fundamentals of relational data in Azure • Explore relational data concepts


• Explore Azure services for relational data

Explore fundamentals of non-relational data in Azure • Fundamentals of Azure Storage


• Fundamentals of Azure Cosmos DB

Explore fundamentals of data analytics • Large-scale data warehousing


• Streaming and real-time analytics
• Data visualization

© Copyright Microsoft Corporation. All rights reserved.


Lab environment

• Labs in this course are based on exercises in Microsoft Learn


• Go to https://msle.learnondemand.net/
• Sign In with training key and complete the labs

© Copyright Microsoft Corporation. All rights reserved.


Explore fundamentals of
data

© Copyright Microsoft Corporation. All rights reserved.


Core data concepts
Agenda
Data roles and services

© Copyright Microsoft Corporation. All rights reserved.


1: Core data concepts

© Copyright Microsoft Corporation. All rights reserved.


What is data?
Values used to record information – often representing entities that have one or more attributes
Structured Semi-structured Unstructured
{
"firstName": "Joe",
Customer "lastName": "Jones",
"address":
ID FirstName LastName Email Address {
1 Joe Jones joe@litware.com 1 Main St. "streetAddress": "1 Main
St.",
2 Samir Nadoy samir@northwind.com 123 Elm Pl. "city": "New York",
"state": "NY",
"postalCode": "10099" {
}, "firstName": "Samir",
Product "contact": "lastName": "Nadoy",
"address":
[
ID Name Price { {
"type": "home", "streetAddress": "123 Elm
123 Hammer 2.99 Pl.",
"number": "555 123-1234"
162 Screwdriver 3.49 }, "unit": "500",
{ "city": "Seattle",
201 Wrench 4.25 "state": "WA",
"type": "email",
"address": "postalCode": "98999"
"joe@litware.com" },
} "contact":
] [
} {
"type": "email",
"address":
"samir@northwind.com"
}
]
}

© Copyright Microsoft Corporation. All rights reserved.


How is data stored?
Files Databases

Customer Product
ID Email Address ID Name Price
1 joe@litware.com 1 Main St. 123 Hammer 2.99
2 samir@northwind.com 123 Elm Pl. 162 Screwdriver 3.49
201 Wrench 4.25

Order LineItem
OrderNo OrderDate Customer OrderNo ItemNo ProductID Quantity
{ 1000 1/1/2022 1 1000 1 123 1
"customers": 1001 1/1/2022 2 1000 2 201 2
[
1001 1 123 2
{ "firstName": "Joe", "lastName": "Jones"},
{ "firstName": "Samir", "lastName": "Nadoy"}
]
}

© Copyright Microsoft Corporation. All rights reserved.


Transactional data workloads
Order
… … …
… … …

* * *

© Copyright Microsoft Corporation. All rights reserved.


Analytical data workloads

DW


DL

© Copyright Microsoft Corporation. All rights reserved.


1: Knowledge check
How is data in a relational table organized?
❑ Rows and Columns
❑ Header and Footer
❑ Pages and Paragraphs

Which of the following is an example of unstructured data?


❑ A comma-delimited text file with EmployeeID, EmployeeName, and EmployeeDesignation fields

❑ Audio and Video files

❑ A table within relational database

What is a data warehouse?


❑ A non-relational database optimized for read and write operations

❑ A relational database optimized for read operations

❑ A storage location for unstructured data files


2: Data roles and services

© Copyright Microsoft Corporation. All rights reserved.


Data professional roles

Database Administrator Data Engineer Data Analyst


Database provisioning, Data integration pipelines and ETL Analytical modeling
configuration and management processes
Data reporting and summarization
Database security and user access Data cleansing and transformation
Data visualization
Database backups and resiliency Analytical data store schemas and
data loads
Database performance monitoring
and optimization
Microsoft cloud services for data

© Copyright Microsoft Corporation. All rights reserved.


2: Knowledge check
Which one of the following tasks is the responsibility of a database administrator?
❑ Backing up and restoring databases
❑ Creating dashboards and reports
❑ Creating pipelines to process data in a data lake

Which role is most likely to use Azure Data Factory to define a data pipeline for an ETL process?
❑ Database Administrator

❑ Data Engineer

❑ Data Analyst

Which single service would you use to implement data pipelines, SQL analytics, and Spark analytics?
❑ Azure SQL Database

❑ Microsoft Power BI

❑ Azure Synapse Analytics


2: Explore fundamentals of
relational data in Azure

© Copyright Microsoft Corporation. All rights reserved.


Explore relational data concepts

Agenda
Explore Azure services for relational data

© Copyright Microsoft Corporation. All rights reserved.


1: Explore relational data concepts

© Copyright Microsoft Corporation. All rights reserved.


Relational tables
Customer
ID FirstName MiddleName LastName Email Address City
1 Joe David Jones joe@litware.com 1 Main St. Seattle

Data is stored in tables 2 Samir Nadoy samir@northwind.com 123 Elm Pl. New York

Product
ID Name Price

Tables consists of rows and columns 123 Hammer 2.99


162 Screwdriver 3.49
201 Wrench 4.25

All rows have the same columns


Order LineItem
OrderNo OrderDate Customer OrderNo ItemNo ProductID Quantity

1000 1/1/2022 1 1000 1 123 1


Each column is assigned a datatype 1001 1/1/2022 2 1000 2 201 2
1001 1 123 2

© Copyright Microsoft Corporation. All rights reserved.


Normalization Customer
ID FirstName LastName Address City
Sales Data
1 Joe Jones 1 Main St. Seattle
OrderNo OrderDate Customer Product Quantity
2 Samir Nadoy 123 Elm Pl. New York
1000 1/1/2022 Joe Jones, 1 Main St, Seattle Hammer ($2.99) 1
1000 1/1/2022 Joe Jones- 1 Main St, Seattle Screwdriver ($3.49) 2
1001 1/1/2022 Samir Nadoy, 123 Elm Pl, New York Hammer ($2.99) 2
Product
ID Name Price
… … … … …
123 Hammer 2.99
162 Screwdriver 3.49
Order
• Separate each entity into its own table OrderNo OrderDate Customer
201 Wrench 4.25

• Separate each discrete attribute into its 1000 1/1/2022 1

own column
1001 1/1/2022 2

• Uniquely identify each entity instance LineItem


(row) using a primary key OrderNo
1000
ItemNo
1
ProductID
123
Quantity
1

• Use foreign key columns to link related 1000 2 201 2

entities
1001 1 123 2

© Copyright Microsoft Corporation. All rights reserved.


Structured Query Language (SQL)
SQL is a standard language for use with relational databases
Standards are maintained by ANSI and ISO
Most RDBMS systems support proprietary extensions of standard SQL
Data Definition Language (DDL) Data Control Language (DCL) Data Manipulation Language
(DML)
CREATE, ALTER, DROP, RENAME GRANT, DENY, REVOKE INSERT, UPDATE, DELETE, SELECT
CREATE TABLE Product GRANT SELECT, INSERT, UPDATE SELECT Name, Price
( ON Product FROM Product
ProductID INT PRIMARY KEY, TO user1; WHERE Price > 2.50
Name VARCHAR(20) NOT NULL, ORDER BY Price;
Price DECIMAL NULL Product
); ID Name Price Results
Name Price
Product 123 Hammer 2.99
162 Screwdriver 3.49 Hammer 2.99
ID Name Price
201 Wrench 4.25 Screwdriver 3.49
Wrench 4.25

© Copyright Microsoft Corporation. All rights reserved.


Other common database objects
Views Stored Procedures Indexes
Pre-defined SQL queries that Pre-defined SQL statements that Tree-based structures that
behave as virtual tables can include parameters improve query performance

Customer Order Product


… … … … … …
ID Name Price
… … … … … …
123 Hammer 2.99

Deliveries Product 162 Screwdriver 3.49


201 Spanner 4.25
OrderNo OrderDate Address City ID Name Price
1000 1/1/2022 1 Main St. Seattle 201 Wrench Spanner 4.25
1001 1/1/2022 123 Elm Pl. New York

© Copyright Microsoft Corporation. All rights reserved.


1: Knowledge check
Which one of the following statements is a characteristic of a relational database?
❑ All columns in a table must be of the same data type
❑ A row in a table represents a single instance of an entity
❑ Rows in the same table can contain different columns

Which SQL statement is used to query tables and return data?


❑ QUERY
❑ READ
❑ SELECT

What is an index?
❑ A structure that enables queries to locate rows in a table quickly
❑ A virtual table based on the results of a query
❑ A pre-defined SQL statement that modifies data

© Copyright Microsoft Corporation. All rights reserved.


2: Explore Azure services for relational data

© Copyright Microsoft Corporation. All rights reserved.


Azure SQL

• Near 100% compatibility with SQL


Server on-premises
• Automatic backups, software • Automatic backups, software
patching, database monitoring, and patching, database monitoring, and
other maintenance tasks other maintenance tasks
• Use a single instance with multiple
databases, or multiple instances in a
pool with shared resources
• Great for migrating most on-premises
databases to the cloud

© Copyright Microsoft Corporation. All rights reserved.


Azure Database services for open-source

• PaaS implementation of MySQL in • An implementation of the • Database service in the


the Azure cloud, based on the MariaDB Community Edition Microsoft cloud based on the
MySQL Community Edition database management system PostgreSQL Community Edition
Commonly used in Linux, Apache, adapted to run in Azure
• database engine
MySQL, PHP (LAMP) application • Compatibility with Oracle • Hybrid relational and object
architectures Database storage

© Copyright Microsoft Corporation. All rights reserved.


Lab: Provision Azure relational database services

https://aka.ms/dp900-sql-lab

© Copyright Microsoft Corporation. All rights reserved.


2: Knowledge check

Which deployment option offers the best compatibility when migrating an existing SQL Server on-premises
solution?
❑ Azure SQL Database (single database)
❑ Azure SQL Database (elastic pool)
❑ Azure SQL Managed Instance

Which of the following statements is true about Azure SQL Database?


❑ Most database maintenance tasks are automated
❑ You must purchase a SQL Server license
❑ It can only support one database

Which database service is the simplest option for migrating a LAMP application to Azure?
❑ Azure SQL Managed Instance
❑ Azure Database for MySQL
❑ Azure Database for PostgreSQL

© Copyright Microsoft Corporation. All rights reserved.

You might also like