Welcome to Scribd!

Revision Notes v1

Uploaded by

0% found this document useful (0 votes)

6 views14 pages

This document provides an overview and agenda for revision notes for the Databricks Certified Data Engineer Associate Exam. It covers key topics like Delta Lake, Databricks architecture, cluster types, magic commands, views, managed vs unmanaged tables, and the Medallion architecture. The notes are intended to be reviewed 1-2 hours before the exam and are part of a larger Udemy course for exam preparation.

Original Description:

Original Title

Revision_notes_v1

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views14 pages

Revision Notes v1

Uploaded by

Vinay

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 14

Search inside document

Databricks Certified Data Engineer

Associate Exam Resource

QUICK REVISION
NOTES
By: Certification Champs

Visit here for full Udemy Course

AGENDA

03 Introduction 11 Views in Databricks

04 Delta Lake 12 Managed vs Unmanaged Tables

05 Databricks Architecture 13 Medallion Architecture

08 Magic commands

09 Clusters

10 Types of Clusters
Introduction
These notes are part of our Udemy Course. Visit
here to access our Course
Reading these notes twice is suggested 1-2 hours
before the exam
You may post your queries in the Q&A section of
the course, if you have any doubts.
Features of Delta Lake

ACID Unifies batch

Open
and streaming
compliant Source data
Stores data Time
in Parquet
travel
format
Databricks Architecture
Databricks Architecture consists of two parts

CONTROL Controlled by Customer

PLANE
Controlled by Databricks DATA
Visit here for full Udemy Course
PLANE
DATABRICKS CUSTOMER
CLOUD ACCOUNT CLOUD ACCOUNT
Web Application
Repositories and
Notebooks
Data
Job Scheduling

Cluster Management
ONLY DATA IS STORED IN CUSTOMER CLOUD
ACCOUNT AND IS CONTROLLED BY THE
CUSTOMER
WHILE REST OF THE THINGS LIKE CLUSTERS
AND NOTEBOOKS ARE STORED IN DATABRICKS
CLOUD ACCOUNT AND IS CONTROLLED BY
DATABRICKS
Visit here for full Udemy Course
Magic commands in Notebooks!
Language specific magic commands supported in Databricks

%python %scala %r %sql

Changing the default language of a notebook will result in addition of
%{previous_default_language} at the start of each existing cell of the notebook.
In other words, changing the default language of a notebook from Python to Scala will result in
addition of %python at the start of each existing cell of the notebook.

Some other magic commands include %md - Used for Markdown language and
%run - Used to run a notebook from another notebook
Visit here for full Udemy Course
Clusters In
Databricks
CLUSTERS CAN BE CREATED BY SELECTING
COMPUTE FROM THE LEFT SIDE NAVIGATION BAR

COMPUTE PAGE CAN ALSO BE USED TO UPDATE,

START, STOP OR TERMINATE A CLUSTER
Types of Clusters
All-purpose Cluster Job Cluster

An all-purpose cluster can be created using UI, A job cluster cannot be created manually. It is
CLI or REST API automatically created as you submit a job

It can be restarted or terminated as per the It is terminated automatically when the job
need ends and CANNOT be restatred

It can be shared between multiple users It CANNOT be shared between users

Used for running interactive jobs Used for running automated jobs

Visit here for full Udemy Course

Noteworthy things about
Views! Stored in temporary
Database named global_temp
Accessed by using
Persisted through multiple
global_temp.database_name
sessions

Tied to a
CREATE GLOBAL
CREATE VIEW session, CREATE TEMPORARY
VIEW view_name TEMPORARY VIEW
view_name CANNOT be view_name
accessed when
the session
ends

VIEWS DON'T HAVE PHYSICAL EXISTENCE

Visit here for full Udemy Course
Managed vs Unmanaged tables
Managed Table External/Unmanaged Table

Both data and metadata are controlled by Only metadata is controlled by Databricks.
Databricks Data is controlled by the user

If you drop a managed table, data is also If you drop an unmanaged(external) table, the
deleted data remains intact

The data for a managed table remains at While creating an external table, LOCATION
dbfs:/user/hive/warehouse/{db_name}.db/ keyword must be used

CREATE TABLE table_name (col1 INT, CREATE TABLE table_name (col1 INT,
col2 STRING) col2 STRING) LOCATION '/path/'
Medallion Architecture
ALSO KNOWN AS MULTI-HOP ARCHITECTURE

Bronze SILVER GOLD

JOINS AGGREGATIONS
REPORTING
Streaming
Data

Batch
Data DASHBOARDING

Raw Data Enriched data Aggregated data to

with timestamps be used for
from different columns converted to Reporting and
sources human readable form Dashboarding
Visit here for full Udemy Course
THANK YOU!

1409 Thiruvasagam Lyrics Tamil Thiruvasagam
Document133 pages
1409 Thiruvasagam Lyrics Tamil Thiruvasagam
Sivasangar Seenivasagam
67% (3)
De Mod 1 Get Started With Databricks Data Science and Engineering Workspace
Document27 pages
De Mod 1 Get Started With Databricks Data Science and Engineering Workspace
Jaya Bharathi
No ratings yet
Exstream01PrepApps PDF
Document147 pages
Exstream01PrepApps PDF
Aymen EL ARBI
No ratings yet
Logic and Discrete Mathematics A Computer Science - 1
Document11 pages
Logic and Discrete Mathematics A Computer Science - 1
DeadPool Pool
No ratings yet
Amdocs - Convergent Billing
Document6 pages
Amdocs - Convergent Billing
Adil Ahmad
No ratings yet
The Following Syllabus Will Be Covered in This Online Course of Web Design and Development
Document7 pages
The Following Syllabus Will Be Covered in This Online Course of Web Design and Development
pir zada
No ratings yet
Cloudera Developer Training Exercise Manual
Document131 pages
Cloudera Developer Training Exercise Manual
Khusaila Toufali Lapaz
No ratings yet
Dbms Lab Manual - 2013 - Regulation
Document243 pages
Dbms Lab Manual - 2013 - Regulation
Omprakash D
No ratings yet
Active Directory Managemnt Using PowerShell Sec01 05
Document107 pages
Active Directory Managemnt Using PowerShell Sec01 05
ovodf
100% (1)
Data Engineering With Databricks Da
Document232 pages
Data Engineering With Databricks Da
vitlesh.sf
100% (1)
Top 17 Active Directory Interview Questions
Document4 pages
Top 17 Active Directory Interview Questions
achayan1989
No ratings yet
Oracle Vs Nucleus Vs Sybase IQ Vs Netezza
Document18 pages
Oracle Vs Nucleus Vs Sybase IQ Vs Netezza
enselsoftware.com
100% (4)
DBA Tasks - Imp-Notes
Document30 pages
DBA Tasks - Imp-Notes
Gautam Trivedi
No ratings yet
Databricks Interview Question & Answers
Document10 pages
Databricks Interview Question & Answers
ranjankaul
No ratings yet
Databricks 1667066239
Document10 pages
Databricks 1667066239
Adithya Vardhan Reddy Kothwal Patel
No ratings yet
D77758GC10 Les 03
Document40 pages
D77758GC10 Les 03
pavan0927
No ratings yet
Dev Ops
Document4 pages
Dev Ops
Sai Phanidhar Varanasi
No ratings yet
Question & Answers of Windows Server 2012
Document32 pages
Question & Answers of Windows Server 2012
ksant77
100% (2)
DataBase Administration
Document50 pages
DataBase Administration
Help me get to 10k subscribers
No ratings yet
CMDB Admin
Document140 pages
CMDB Admin
Fireball India
No ratings yet
Snowflake Certification Syllabus
Document4 pages
Snowflake Certification Syllabus
WeAre1
No ratings yet
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Document6 pages
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Bugz Binny
100% (1)
8 DBA Tasks For Azure SQL: What's Different From On-Prem
Document33 pages
8 DBA Tasks For Azure SQL: What's Different From On-Prem
Joji K
100% (1)
Construction of A Technical Glossary in English For The Occupational Area
Document6 pages
Construction of A Technical Glossary in English For The Occupational Area
César Trujillo
No ratings yet
Module 05 Implement Infrastructure As A Service Solutions
Document49 pages
Module 05 Implement Infrastructure As A Service Solutions
Xulfee
No ratings yet
Azure Data Factory Interview Questions and Answer
Document12 pages
Azure Data Factory Interview Questions and Answer
Madhumitha Podishetty
No ratings yet
Doctrine
Document24 pages
Doctrine
shambalic
No ratings yet
DBA Interview Questions and Answers
Document2 pages
DBA Interview Questions and Answers
Narmada Devi
No ratings yet
Advanced Admin Pdms
Document78 pages
Advanced Admin Pdms
chandru683
100% (1)
Technical Skills Enhancement - PL/SQL Best Practices Oracle Architecture
Document35 pages
Technical Skills Enhancement - PL/SQL Best Practices Oracle Architecture
Huynh Sy Nguyen
No ratings yet
Top 17 Active Directory Interview Questions & Answers2
Document9 pages
Top 17 Active Directory Interview Questions & Answers2
rakesh ranjan
No ratings yet
Oci Database Migration Service End To End Online Migration Tutorial 1
Document35 pages
Oci Database Migration Service End To End Online Migration Tutorial 1
amornchaiw2603
No ratings yet
SQL Server Database Mirroring Concept
Document75 pages
SQL Server Database Mirroring Concept
nithinvn
No ratings yet
Managing The Application Lifecycle With MSDN
Document28 pages
Managing The Application Lifecycle With MSDN
Hamza Sağ
No ratings yet
Oracle Database 12c Administration Workshop Ed 2
Document6 pages
Oracle Database 12c Administration Workshop Ed 2
Adonis Prince Nani
No ratings yet
D96069GC10 1001 Us
Document3 pages
D96069GC10 1001 Us
William Lee
No ratings yet
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Document6 pages
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
jackomito
100% (1)
Les 02
Document38 pages
Les 02
Mohammad Nizamuddin
No ratings yet
Enterprise Admin Group Domain Admin Group
Document5 pages
Enterprise Admin Group Domain Admin Group
Khizer
No ratings yet
Dav Institute of Engineering & Technology, Jalandhar
Document66 pages
Dav Institute of Engineering & Technology, Jalandhar
Karan Gupta
No ratings yet
Data Base Administration Level IV: Shashemene Poly Technique College
Document10 pages
Data Base Administration Level IV: Shashemene Poly Technique College
Mahdi Zeyn
No ratings yet
Unit IV - Database
Document18 pages
Unit IV - Database
kunalvarad75
No ratings yet
DBA Skills
Document19 pages
DBA Skills
Nath Alordiah
No ratings yet
Using Ola Hallengrens SQL Maintenance Scripts PDF
Document28 pages
Using Ola Hallengrens SQL Maintenance Scripts PDF
Hana Ibisevic
No ratings yet
Notes: (Noteshub - Co.In) Cse: Advance Database Management Systems (Adbms)
Document73 pages
Notes: (Noteshub - Co.In) Cse: Advance Database Management Systems (Adbms)
Nikhil Tiwari
No ratings yet
Viva Voce Questions
Document6 pages
Viva Voce Questions
Kumar Varun
No ratings yet
Stored Procedure and Its Purpose Ans Advantages
Document7 pages
Stored Procedure and Its Purpose Ans Advantages
Venkateswara Rao
No ratings yet
Google Cloud 2
Document27 pages
Google Cloud 2
AKhil
No ratings yet
Backend Security Project DB2 Hardening (12!03!23)
Document6 pages
Backend Security Project DB2 Hardening (12!03!23)
hugo_obis
No ratings yet
Azure Terraform Pipeline - DevOps
Document119 pages
Azure Terraform Pipeline - DevOps
amit kaishver
No ratings yet
Docu98680 - DD OS, PowerProtect DDMC, and PowerProtect DDVE 6.1.2.70 Release Notes
Document66 pages
Docu98680 - DD OS, PowerProtect DDMC, and PowerProtect DDVE 6.1.2.70 Release Notes
jerome Perriguey
No ratings yet
Snowflake Tables
Document4 pages
Snowflake Tables
pandahomie81
No ratings yet
Oracle Database Cloud For Oracle DBAs (OCI-C) - Oracle University
Document4 pages
Oracle Database Cloud For Oracle DBAs (OCI-C) - Oracle University
vineet
No ratings yet
Day - 2 Tuning Diagnostic
Document34 pages
Day - 2 Tuning Diagnostic
Vikas Kumar
No ratings yet
Top 17 Active Directory Interview Questions
Document4 pages
Top 17 Active Directory Interview Questions
Bharath
No ratings yet
Silo - Tips - Security Concepts in Oracle Multitenant o R A C L e W H I T e P A P e R J A N U A R y
Document28 pages
Silo - Tips - Security Concepts in Oracle Multitenant o R A C L e W H I T e P A P e R J A N U A R y
Fernando
No ratings yet
Logical Standby Database For Reporting: Mark Bole Nocoug Nov 10, 2005
Document40 pages
Logical Standby Database For Reporting: Mark Bole Nocoug Nov 10, 2005
saravanand1983
No ratings yet
SQL Server - Installation - Step by Step
Document38 pages
SQL Server - Installation - Step by Step
Azizi Sungita
No ratings yet
Nur Aqilah Binti Ja'afar - 2021878208 - Jim246 4a
Document10 pages
Nur Aqilah Binti Ja'afar - 2021878208 - Jim246 4a
Nur Aqilah Ja'afar
No ratings yet
Azure Data Engineer Interview Questions and Answers
Document7 pages
Azure Data Engineer Interview Questions and Answers
Aparna Tatavarthy
No ratings yet
15) Azure AD and IAM
Document33 pages
15) Azure AD and IAM
hanuman challisa
No ratings yet
Oracle Tips and Tricks
Document28 pages
Oracle Tips and Tricks
anand
No ratings yet
Monitoring and Administering Database
Document39 pages
Monitoring and Administering Database
Amanuel Kassa
No ratings yet
A Summer Training Presentation On Oracle 10G and
Document24 pages
A Summer Training Presentation On Oracle 10G and
Piyush Jain
No ratings yet
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
Rating: 5 out of 5 stars
5/5 (1)
Princom Intro
Document44 pages
Princom Intro
Richster Lofranco
No ratings yet
TEMP6
Document2 pages
TEMP6
aazath kalam
No ratings yet
DB2 Problem Determination Using Db2top Utility
Document40 pages
DB2 Problem Determination Using Db2top Utility
pkd007
100% (2)
Java Script For Loops
Document8 pages
Java Script For Loops
mihaelahristea
No ratings yet
Hamming Code For Double Bit Error Detection & Rectification Capability by Using Cadence Tool
Document7 pages
Hamming Code For Double Bit Error Detection & Rectification Capability by Using Cadence Tool
Prabhakar
No ratings yet
Data Modeling and Entity Relationship Diagram (ERD)
Document5 pages
Data Modeling and Entity Relationship Diagram (ERD)
Muhammad Ali Masood
No ratings yet
Constructor in Java
Document6 pages
Constructor in Java
qwermnb
No ratings yet
TD04803001E - Visual Designer Driver List
Document8 pages
TD04803001E - Visual Designer Driver List
rogermantilla08
No ratings yet
System Requirements Comparison Chart (Continued) : Product Specifications
Document2 pages
System Requirements Comparison Chart (Continued) : Product Specifications
Masykur Majid
No ratings yet
AVR Project Book: D I Y Abdul Maalik Khan
Document71 pages
AVR Project Book: D I Y Abdul Maalik Khan
Nguyen Duong
100% (5)
HC 06 Manual
Document3 pages
HC 06 Manual
Anonymous lSmZWoJt
No ratings yet
Overview of Order Management Suite-1
Document94 pages
Overview of Order Management Suite-1
bommakanti.shiva
No ratings yet
Bongaigaon - 488
Document513 pages
Bongaigaon - 488
Ramesh Babu
No ratings yet
A PROJECT REPORT by Justin Mathew
Document61 pages
A PROJECT REPORT by Justin Mathew
Rakesh Kumar
No ratings yet
LG 22ld350-cb Chassis Lc01a
Document34 pages
LG 22ld350-cb Chassis Lc01a
Juan Pablo Montoya Cardenas
No ratings yet
Final Paper
Document139 pages
Final Paper
Jericho
No ratings yet
The Essential of Instructional Design
Document18 pages
The Essential of Instructional Design
Eka Kutateladze
No ratings yet
Subsea PLEM & PLET - Theory & Application PDF
Document127 pages
Subsea PLEM & PLET - Theory & Application PDF
Paolo Bertolli
No ratings yet
Channel Capacity and Models
Document30 pages
Channel Capacity and Models
Ashley Seesurun
No ratings yet
Innovation and Design in The Age of Artificial Intelligence
Document4 pages
Innovation and Design in The Age of Artificial Intelligence
Smamda Agung
No ratings yet
2020 - Employ 6D-BIM Model Features For Buildings Sustainability Assessment
Document12 pages
2020 - Employ 6D-BIM Model Features For Buildings Sustainability Assessment
Ali J. Lubbad
No ratings yet
Format of Synopsis and Report (Project)
Document9 pages
Format of Synopsis and Report (Project)
Shweta Sansaniwal
No ratings yet
Charles Schwab Presentation Original by K.Studioso and T.Wilson
Document11 pages
Charles Schwab Presentation Original by K.Studioso and T.Wilson
Kristina Frost
No ratings yet
Arduino Based Underground Cable Fault Detector (Single Phase)
Document51 pages
Arduino Based Underground Cable Fault Detector (Single Phase)
Irum
96% (26)
Fullcalendar Into Phpmakercode
Document6 pages
Fullcalendar Into Phpmakercode
Sinan Yıldız
No ratings yet