Professional Documents
Culture Documents
data analytics
Data visualization
▲----
▼----
▲----
• Extract, Transform, and Load (ETL) • Denormalized relational data • Semantic models for analytical • Reports
or Extract, Load, and Transform storage in a data warehouse entities • Charts
(ELT) orchestration • •
Semi-structured file storage Often in the form of aggregated • Dashboards
• Distributed processing to cleanse in a data lake cubes that summarize numeric
and restructure data at scale values across one or more
• Batch and real-time data dimensions
processing
Pipeline
Input dataset Activities Output dataset
Linked services
Use for a single, unified large-scale Use to leverage Databricks skills Use when you need to support
analytical solution on Azure and for cloud portability multiple open-source platforms
In this lab, you will provision an Azure Synapse Analytics workspace, and use it to ingest
and process data
What must you define to implement a pipeline that reads data from Azure Blob Storage?
A linked service for your Azure Blob Storage account
Which open-source distributed processing engine does Azure Synapse Analytics include?
Apache Hadoop
Apache Spark
Apache Storm
Data is collected and processed at regular intervals Data is processed in (near) real-time as it arrives
In this lab, you will use Azure Stream Analytics to process a real-time data stream
Which service would you use to continually capture data from an IoT Hub, aggregate it over
temporal periods, and store results in Azure SQL Database?
Azure Cosmos DB
Azure Storage
Which language would you use to query real-time log data in Azure Synapse Data Explorer?
SQL
Python
KQL
Product
Sales (fact)
Key TimeKey ProductKey CustomerKey Quantity Revenue ∑
er e
om lic
1 01012022 1 1 1 2.99
st ir A
Cu e Sam
2 01012022 2 1 2 6.98
3 02012022 1 2 2 5.98 Jan Feb Mar Apr May …
Jo
Model aggregates measures Time
Time (dimension) Measures at each hierarchy level
Key Year Month Day WeekDay
Year Month Day Revenue
01012022 2022 Jan 1 Sat
2022 8221.48
02012022 2022 Jan 2 Sun
Jan 574.86
1 9.97
Hierarchy 2 5.98
… …
In this lab, you will use Power BI Desktop to create a data model and a report
What should you define in your data model to enable drill-up/down analysis?
A measure
A hierarchy
A relationship
Which kind of visualization should you use to analyze pass rates for multiple exams over time?
A pie chart
A scatter plot
A line chart
To review what you've learned and do additional labs, review the Microsoft Learn
modules for this course: