Professional Documents
Culture Documents
TOOLS
GROUP 3
KDU | FOC| DSBA
W.A.C.Imasha | M.V.D.Nimsliu | B.K.T.Dhananjana |
What is
ggplot2 ?
• ggplot2 is an advanced
data visualization package
for the R programming
language.
ggplot2
Plot = Data + Aesthetics + Geometry
Grammar of Graphics
• Data : A data frame
Spark python
Mlib
DataRobot
Open
H2O source
libraries
DataRobot offers
Line Plot
Polar Plot
Image Plot
Toolkits used in Matplotlib
●Mapping toolkits
Basemap
Cartopy
●General Toolkits
Mplot3D
Axesgrid
Mpl Data Cursor Mplotlib
GTK Tools Cartopy
Excel Tools
Natgrid
●High level plotting
Seaborn
Holoviews
Ggplot
Prettyplotlib
AxesGrid
Holoviews
Matplotlib
Introduction of Apache Hadoop
• Hadoop is an open-source application
framework which is a part of the Apache suite
of application.
• Hadoop was created by Doug Cutting and
Mike Cafarella in 2005.
• It is primarily used for data analysis.
Low-Cost
Data
for ? Warehous
e
Internet of
MapReduce
Uses things
Hadoop
Data Lake
Components
Discovery
and
Analysis
Advantages and Disadvantages
• Advantages • Disadvantages