ISL76-BDA Lab Syllabus

Uploaded by

Satish B basapur

0% found this document useful (0 votes)

8 views2 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views2 pages

ISL76-BDA Lab Syllabus

Uploaded by

Satish B basapur

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Sub Title : BIG DATA AND ANALYTICS LAB

Sub Code:ISL76 No of Credits : 1:0:1.5 No. of Lecture Hours/Week : 03

Exam Duration : Exam Marks :CIE +Assignment + SEE = 45 + 5 + 50 =100
3hours

Course Objectives:
1. To understand the concept of Big data with hands on.
2. Understand installation of various Big data tools under Hadoop.
3. To apply Hadoop concepts to various applications and NoSQL implementation.

I. LIST OF PROGRAMS
1.Start by reviewing HDFS. You will find that its composition is similar to your local Linux
file system. You will use the hadoop fs command when interacting with HDFS.
a. Review the commands available for the Hadoop Distributed File System:
b. Copy file foo.txt from local disk to the user‘s directory in HDFS
c. Get a directory listing of the user‘s home directory in HDFS
d. Get a directory listing of the HDFS root directory
e. Display the contents of the HDFS file user/fred/bar.txt

2. Start by reviewing HDFS. You will find that its composition is similar to your local Linux
file system. You will use the hadoop fs command when interacting with HDFS.
a. Move that file to the local disk, named as baz.txt
b. Create a directory called input under the user‘s home directory
c. Delete the directory input old and all its contents
d. Verify the copy by listing the directory contents in HDFS.

3. Demonstrate word count on an input file using MapReduce program.

4. Using movie ratings data, Develop the queries in Hive for the following-
a. List all the Users who have rated the movies (Users who have rated at least one
movie)
b. List of all the User with the max, min, average ratings they have given against any
movie
c. List all the Movies with the max, min, average ratings given by any user

5. In this program you will use HiveQL to filter and aggregate click data to build facts about
user‘s movie preferences. The query results will be saved in a staging table used to populate
the Oracle Database.
The moveapp_log_json table contains an activity column. Activity states are as follows:

33
 RATE_MOVIE
 COMPLETED_MOVIE
 PAUSE_MOVIE
 START_MOVIE
 BROWSE_MOVIE
 LIST_MOVIE
 SEARCH_MOVIE
 LOGIN
 LOGOUT
 INCOMPLETE_MOVIE.
 PURCHASE_MOVIE
a. Write a query to select only those clicks which correspond to starting, browsing,
completing, or purchasing movies. Use a CASE statement to transform the
RECOMMENDED column into integers where ‗Y‘ is 1 and ‗N‘ is 0. Also, ensure
GENREID is not null. Only include the first 25 rows.
b. Write a query to select the customer ID, movie ID, recommended state and most
recent rating for each movie.

6. The moveapp_log_json table contains an activity column. Activity states are as follows:

 RATE_MOVIE
 COMPLETED_MOVIE
 PAUSE_MOVIE
 START_MOVIE
 BROWSE_MOVIE
 LIST_MOVIE
 SEARCH_MOVIE
 LOGIN
 LOGOUT
 INCOMPLETE_MOVIE.
a. Load the results of the previous two queries into a staging table. First, create the
staging table:
b. Next, load the results of the queries into the staging table.

7. Write R program to:

a. Create two matrices and perform multiplication & division on those matrices.
b. Create a data frame and print the: data frame, structure of data frame and summary of
data frame.
c. Create a Bar chart and sketch the Bar chart by taking months as input & plot it against
revenue. Also, add legend to the chart that includes regions.

II. OPEN ENDED QUESTIONS

1. Installation and Configuration of Hadoop software on stand alone system.

2. Installation and Configuration of Hadoop software on Ubuntu cluster system.
3. Highest temperature year wise using MapReduce.

Signature of Scrutinizer Q.No. Solutions Marks Allotted 5.c)
Document1 page
Signature of Scrutinizer Q.No. Solutions Marks Allotted 5.c)
Satish B basapur
No ratings yet
Signature of Scrutinizer Q.No - Solutions Marks Allotted 2.a) 10M
Document2 pages
Signature of Scrutinizer Q.No - Solutions Marks Allotted 2.a) 10M
Satish B basapur
No ratings yet
Signature of Scrutinizer Q.No. Solutions Marks Allotted 10.a)
Document1 page
Signature of Scrutinizer Q.No. Solutions Marks Allotted 10.a)
Satish B basapur
No ratings yet
Explanation of Any Five Built in Functions of Awk (Substr, Index, Length, Systeme,..etc) With Examples
Document1 page
Explanation of Any Five Built in Functions of Awk (Substr, Index, Length, Systeme,..etc) With Examples
Satish B basapur
No ratings yet
Select Custid From Moviedetail Where Rating Is Not Null Select Custid, Min (Rating), Max (Rating), Avg (Rating) From Moviedetails Group by Custid, Movieid
Document1 page
Select Custid From Moviedetail Where Rating Is Not Null Select Custid, Min (Rating), Max (Rating), Avg (Rating) From Moviedetails Group by Custid, Movieid
Satish B basapur
No ratings yet
Signature of Scrutinizer Q.No. Solutions Marks Allotted: Subject Title: UNIX Shell Programming Subject Code: IS34
Document1 page
Signature of Scrutinizer Q.No. Solutions Marks Allotted: Subject Title: UNIX Shell Programming Subject Code: IS34
Satish B basapur
No ratings yet
VII-VIII Syllabus PDF
Document52 pages
VII-VIII Syllabus PDF
Satish B basapur
No ratings yet
V-VI Syllabus PDF
Document53 pages
V-VI Syllabus PDF
Satish B basapur
No ratings yet
Descriptive Answer Type Questions-HVPE Unit:1-5
Document65 pages
Descriptive Answer Type Questions-HVPE Unit:1-5
sunil
No ratings yet
Cse.m-Ii-Managing Big Data (14SCS21) - Solution PDF
Document49 pages
Cse.m-Ii-Managing Big Data (14SCS21) - Solution PDF
Satish B basapur
No ratings yet
DIC's Objectives, Functions and Role in Industrial Development
Document5 pages
DIC's Objectives, Functions and Role in Industrial Development
Satish B basapur
No ratings yet
AIT Syllabus Second
Document42 pages
AIT Syllabus Second
Satish B basapur
No ratings yet
USN IS34: B. E. Degree (Autonomous) Third Semester End Examination (SEE), Dec 2018/jan 2019
Document2 pages
USN IS34: B. E. Degree (Autonomous) Third Semester End Examination (SEE), Dec 2018/jan 2019
Satish B basapur
No ratings yet
Struts Notes
Document211 pages
Struts Notes
VKM2013
No ratings yet
Model Question Paper Template (New 2018)
Document2 pages
Model Question Paper Template (New 2018)
Satish B basapur
No ratings yet
Graduate Exit Survey Feedback for ISE Dept 2014-2018
Document3 pages
Graduate Exit Survey Feedback for ISE Dept 2014-2018
Satish B basapur
No ratings yet
C# Syllabus
Document3 pages
C# Syllabus
Satish B basapur
No ratings yet
Machine Learning and Applications: Assignment
Document2 pages
Machine Learning and Applications: Assignment
Satish B basapur
No ratings yet
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Rating: 4 out of 5 stars
4/5 (1090)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
Lab5 v2
Document12 pages
Lab5 v2
Shahab designer
No ratings yet
Administering Microsoft SQL Server Databases
Document4 pages
Administering Microsoft SQL Server Databases
Muthu Raman Chinnadurai
No ratings yet
Semester 2 Mid Term Exam
Document46 pages
Semester 2 Mid Term Exam
Mohanned Ali
No ratings yet
Steps To Perform For Rolling Forward A Standby Database Using RMAN Incremental Backup When Datafile Is Added To Primary (Doc ID 1531031.1)
Document3 pages
Steps To Perform For Rolling Forward A Standby Database Using RMAN Incremental Backup When Datafile Is Added To Primary (Doc ID 1531031.1)
Phan Long
No ratings yet
Properties Primary Key:: Candidate Key - A Column, or Set of Columns, in A Table That Can Uniquely
Document2 pages
Properties Primary Key:: Candidate Key - A Column, or Set of Columns, in A Table That Can Uniquely
Deadly Chiller
No ratings yet
Database Systems Data Modeling: Conceptual Modeling and ERD
Document16 pages
Database Systems Data Modeling: Conceptual Modeling and ERD
Toto Dando
No ratings yet
Mysql Using Sequences
Document2 pages
Mysql Using Sequences
Clyde Mkorombindo
100% (1)
Resume: Sajja Lakshmi Ganapathi
Document7 pages
Resume: Sajja Lakshmi Ganapathi
Ganapathi Sajja
No ratings yet
Visual Basic 5-6 Course Part 8
Document24 pages
Visual Basic 5-6 Course Part 8
Mark Collins
No ratings yet
Exercícios de Banco de Dados - Respostas - SQL
Document4 pages
Exercícios de Banco de Dados - Respostas - SQL
Lucy Anne de Omena
No ratings yet
Hyd 12 CS QP PB1-P
Document10 pages
Hyd 12 CS QP PB1-P
SUMIT KUMAR
No ratings yet
SQL DML and DDL Commands Explained
Document11 pages
SQL DML and DDL Commands Explained
kazsakura
No ratings yet
Database Management Systems - Prelims 2nd Attempt - 26 PDF
Document8 pages
Database Management Systems - Prelims 2nd Attempt - 26 PDF
jikjik
No ratings yet
Bit Arrays Bitwise Logical Operations: Bitmap Index
Document5 pages
Bit Arrays Bitwise Logical Operations: Bitmap Index
no_me_No_JOY
No ratings yet
DBMS MCQ
Document27 pages
DBMS MCQ
Rathore Yuvraj Singh
No ratings yet
AEM I Test
Document235 pages
AEM I Test
Reetika Jain
No ratings yet
SQL Server 2005 Architecture
Document33 pages
SQL Server 2005 Architecture
api-3715777
100% (2)
Assignment Brief BTEC Level 4-5 HNC/HND Diploma (QCF) : Merit and Distinction Descriptor
Document11 pages
Assignment Brief BTEC Level 4-5 HNC/HND Diploma (QCF) : Merit and Distinction Descriptor
Säthïshkä Priyad
100% (1)
Learner Transcript Report (17-Nov-2021
Document3 pages
Learner Transcript Report (17-Nov-2021
HARSH CHAUHAN 20SCSE1010094
No ratings yet
Semester 1 Final Exam Oracle PL SQL 3
Document24 pages
Semester 1 Final Exam Oracle PL SQL 3
Game GM
No ratings yet
The Definitive Guide To Delta Lake: Fundamentals and Performance
Document26 pages
The Definitive Guide To Delta Lake: Fundamentals and Performance
FabioSantanaReis
No ratings yet
DBMS 2 MCQ
Document6 pages
DBMS 2 MCQ
barunduari
No ratings yet
Daily Tasks in Oracle RDS
Document17 pages
Daily Tasks in Oracle RDS
ganesh_24
No ratings yet
Rsran046 - Sho Adjacencies-Cellpair
Document10 pages
Rsran046 - Sho Adjacencies-Cellpair
Onny
No ratings yet
Introduction To Database Management System, DBMS
Document13 pages
Introduction To Database Management System, DBMS
Mohammad Zakir Lasker
No ratings yet
Dbms Lab File: L ' L D I O M & S
Document4 pages
Dbms Lab File: L ' L D I O M & S
jtnshrma007
No ratings yet
VIEWS in Database
Document4 pages
VIEWS in Database
21112011024cs
No ratings yet
SQL Server Interview Questions Queries by Vikas Ahlawat
Document34 pages
SQL Server Interview Questions Queries by Vikas Ahlawat
vishnuvr143
0% (1)
Ms Silchar t2 2022 - Xii - Cs
Document5 pages
Ms Silchar t2 2022 - Xii - Cs
anithasancs
No ratings yet
Oracle Database Vault
Document356 pages
Oracle Database Vault
Alica Kollar
No ratings yet