Professional Documents
Culture Documents
Data Frame Spark SQL
Data Frame Spark SQL
2 Hands on
- Spark SQL Foundations
- DataFrame Reader
- DataFrame Writer
- Spark Catalog
- SQL Joins
- Practice
01
Giới thiệu Data Frame &
Spark SQL
4
▪ Cung cấp API hoặc có thể query trực tiếp dữ liệu thông qua SQL Query
5
PySpark
7
DataFrame (1)
“A Dataframe is an
immutable, distributed
collection of data that is
organized into rows,
where each one consists
a set of columns and
each column has a
name and an associated
type"
9
DataFrame (2)
10
DataFrame (3)
Catalyst Optimizer
Q&A
02
Hands on: Spark SQL
& Data Frame APIs
14
Hands on