Welcome to Scribd!

Exercise 8

Uploaded by

0% found this document useful (0 votes)

3 views1 page

The document provides instructions for a Spark exercise to be completed individually. It outlines installing Spark locally, downloading lab files and datasets, examining Spark versions and configurations through the web UI, running a sample code to load diabetes data from a CSV file into a DataFrame and analyzing the web UI tabs. Students are asked to create a 3 page maximum Word report answering exercise questions and including screenshots and diagrams.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views1 page

Exercise 8

Uploaded by

aditi13goyal

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

BAN 5753, Big Data & Spark Exercise (10 Points)

You must do it alone (it is not a group activity)

1. Install Spark setup on local machine – Standalone cluster installation

2. Download the lab files - notebooks and datasets to the desired location(Documents or
Downloads ); paths that you can access via read/write operations
3. Examine the Spark versions, configurations and Web UI using the below screenshot

4. Run the below command and examine the spark web UI different tabs and share your
findings.

//Modify the path based on the file location

scala> val rawDF=spark.read.option("inferSchema","true") .option("header","true").csv("/Users

/Downloads/spark_modules/Lab/data/diabetes.csv")
rawDF: org.apache.spark.sql.DataFrame = [Pregnancies: int, Glucose: int ... 7 more fields]

Deliverables:
As you complete the exercise, create a short report in Microsoft Word (max 3 pages) and
in this report answer the questions in the exercise description. Copy and paste supporting
documents/diagrams/screenshots as needed to justify your answer. Make sure you print
your name, section number, student ID# on the report and turn-in the report as
communicated by your instructor.

Snowflake Free Lab Guide
Document58 pages
Snowflake Free Lab Guide
Rudransh Sharma
50% (4)
Learning Apache Spark With Python
Document10 pages
Learning Apache Spark With Python
dalalroshan
No ratings yet
Top Answers To Splunk Interview Questions
Document6 pages
Top Answers To Splunk Interview Questions
Ejaz Alam
100% (2)
Running Winsteps With SAS
Document18 pages
Running Winsteps With SAS
Ezekiel F. Co
No ratings yet
Solaris 11 Advanced System Administrator
Document106 pages
Solaris 11 Advanced System Administrator
dokta
No ratings yet
DataGrokr Technical Assignment - Data Engineering - Internshala
Document5 pages
DataGrokr Technical Assignment - Data Engineering - Internshala
Vinutha M
No ratings yet
BDALab Assn5
Document16 pages
BDALab Assn5
Deepti Agrawal
No ratings yet
Final Manual Practical 1 DE
Document6 pages
Final Manual Practical 1 DE
Mudabbir
No ratings yet
Hands On Lab Guide For Data Lake PDF
Document19 pages
Hands On Lab Guide For Data Lake PDF
PrasadVallura
No ratings yet
Struts by Sarangapani:: Birlasoft LTD
Document16 pages
Struts by Sarangapani:: Birlasoft LTD
rajivdhoundiyal
No ratings yet
Practical Assignment - :: Distributed Data Processing With Apache Spark
Document3 pages
Practical Assignment - :: Distributed Data Processing With Apache Spark
Teshome Mulugeta
No ratings yet
Filg 8
Document631 pages
Filg 8
Katraj Nawaz
No ratings yet
" : " "Genre - STR, Directed - by - STR" "0" "On"
Document11 pages
" : " "Genre - STR, Directed - by - STR" "0" "On"
Lokesh Yadav Narra
No ratings yet
HW 6
Document4 pages
HW 6
Hariharan Shankar
No ratings yet
Computer Science & Engineering: Department of
Document6 pages
Computer Science & Engineering: Department of
Mriganka shekher Mukhopadhyay
No ratings yet
Apache Spark Tutorial (Fast Data Architecture Series) - DZone Big Data
Document5 pages
Apache Spark Tutorial (Fast Data Architecture Series) - DZone Big Data
Ricardo Cardoso
No ratings yet
Dm&pa Lab Manual
Document68 pages
Dm&pa Lab Manual
noamanaijaz38
No ratings yet
Connecting Interbase To Java Applications: Getting and Installing The Driver
Document5 pages
Connecting Interbase To Java Applications: Getting and Installing The Driver
duque_604
No ratings yet
Features of Apache Spark
Document7 pages
Features of Apache Spark
Sailesh Chauhan
No ratings yet
Oracle® Goldengate: Tutorial For Oracle To Oracle
Document18 pages
Oracle® Goldengate: Tutorial For Oracle To Oracle
prajwaldba
No ratings yet
CASIO FX-602P Simulator: User Manual
Document9 pages
CASIO FX-602P Simulator: User Manual
Derrick Boza Carbonelli
No ratings yet
Hands-On Lab Guide For: Virtual Zero-To-Snowflake
Document63 pages
Hands-On Lab Guide For: Virtual Zero-To-Snowflake
nischal vadari
No ratings yet
Tutorials Point, Simply Easy Learning: Apache Struts 2 Tutorial
Document42 pages
Tutorials Point, Simply Easy Learning: Apache Struts 2 Tutorial
Srividhya Ramakrishnan
No ratings yet
Bulk SQL Injection Using Burp To Sqlmap
Document8 pages
Bulk SQL Injection Using Burp To Sqlmap
ntoy sableng
No ratings yet
Azuredatabricks New
Document22 pages
Azuredatabricks New
Madhavi Kareddy
No ratings yet
ADBAFile Vinayak 90696403117 7i789
Document25 pages
ADBAFile Vinayak 90696403117 7i789
heysiri0804
No ratings yet
Apache Spark
Document6 pages
Apache Spark
Tam
No ratings yet
Spark Interview Questions PDF 2
Document19 pages
Spark Interview Questions PDF 2
Varun
No ratings yet
04A - Working With Datastores - Jupyter Notebook PDF
Document11 pages
04A - Working With Datastores - Jupyter Notebook PDF
jh
No ratings yet
UGF9377 Lopez ADF Tips N Tricks
Document53 pages
UGF9377 Lopez ADF Tips N Tricks
Danica Gajevic
No ratings yet
D17108GC21 Setup
Document10 pages
D17108GC21 Setup
Mac Millan
No ratings yet
Installing Spark On A Windows PC: Ukdataservice - Ac.uk
Document15 pages
Installing Spark On A Windows PC: Ukdataservice - Ac.uk
ivan patricio ayala ayala
No ratings yet
Extract Essbase Outline To SQL Database
Document21 pages
Extract Essbase Outline To SQL Database
hoola81
No ratings yet
Week 4 - Automated SQL Injection
Document6 pages
Week 4 - Automated SQL Injection
Paul Crane
No ratings yet
SQL Injection
Document10 pages
SQL Injection
acernitro88588
No ratings yet
Splunk Quick Reference
Document3 pages
Splunk Quick Reference
Vasudeva Nayak
No ratings yet
Lab - Batch Data Ingestion With DMS - Instructor Setup
Document16 pages
Lab - Batch Data Ingestion With DMS - Instructor Setup
Job Llanos Montaldo
No ratings yet
Test: Sun Systems Fault Analysis Workshop: Online Assessment
Document21 pages
Test: Sun Systems Fault Analysis Workshop: Online Assessment
ulrich nobel kouamé
No ratings yet
Cs572 HW Nutch
Document7 pages
Cs572 HW Nutch
Easo Thomas
No ratings yet
Jaq L Exercise 2
Document10 pages
Jaq L Exercise 2
Kanak Tripathi
No ratings yet
Labsheet1 Updated
Document11 pages
Labsheet1 Updated
deftsoftp
No ratings yet
InterPSS Editor User Guide - English - Edition
Document104 pages
InterPSS Editor User Guide - English - Edition
scutparis
No ratings yet
Spark Jobs Stage Shuffle Task Slots 1686774188
Document3 pages
Spark Jobs Stage Shuffle Task Slots 1686774188
chandu.sasidhar
No ratings yet
Apache Struts: Processing Requests With Action Objects
Document25 pages
Apache Struts: Processing Requests With Action Objects
yaagnti23
No ratings yet
Installation and Config Guide
Document21 pages
Installation and Config Guide
habibi722847
No ratings yet
VMTN - Virtual Appliances - Alfresco Community Edition
Document2 pages
VMTN - Virtual Appliances - Alfresco Community Edition
Filip Miclea
No ratings yet
Spark Runtime Architecture Overview
Document5 pages
Spark Runtime Architecture Overview
kolodacool
No ratings yet
Assignment 4 - WK4 Midterm Project
Document3 pages
Assignment 4 - WK4 Midterm Project
Nehemiah Kiplangat
No ratings yet
Spark Ops Final
Document45 pages
Spark Ops Final
jeanluc_orsai185
No ratings yet
Oracle: Question & Answers
Document15 pages
Oracle: Question & Answers
Javier Solis
No ratings yet
Spart Part 2
Document44 pages
Spart Part 2
Aleena Nasir
100% (1)
A4 Resume Parser
Document1 page
A4 Resume Parser
Munthitra Thadthapong
No ratings yet
Labsheet1 Updated
Document11 pages
Labsheet1 Updated
deftsoftp
No ratings yet
Create Database Oracle Database
Document18 pages
Create Database Oracle Database
Backhamla Michivid
No ratings yet
Performacne Tuning Vol2-2
Document10 pages
Performacne Tuning Vol2-2
ChristianQuirozPlefke
No ratings yet
Apex 4.2 Installation With Oracle
Document3 pages
Apex 4.2 Installation With Oracle
Marwan Saad
No ratings yet
Oracle University Oracle Database 11g: Administration Workshop I
Document10 pages
Oracle University Oracle Database 11g: Administration Workshop I
perhacker
No ratings yet
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Document11 pages
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Snehal Mahesh Wable
No ratings yet
Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets
From Everand
Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets
Ed Elliott
No ratings yet
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
From Everand
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
Darl Kuhn
No ratings yet