You are on page 1of 2

SUBJECT: Big Data Analytics Lab (CSL702)

LIST OF EXPERIMENTS
CLASS: B.E. SEM: VII A.Y: 2023-24

LAB OUTCOMES:
1. To interpret business models and scientific computing paradigms, and apply software
tools for big data analytics.
2. To implement algorithms that uses Mapreduce to apply on structured and unstructured
data
3. To perform hands-on NoSql databases such as Cassandra, Hadoop Hbase, MongoDB, etc.
4. To implement various data streams algorithms.
5. To develop and analyze the social network graphs with data visualization techniques.

Expt
Title of the Experiment BLs LOs POs
No.
PRINTED PO1, PO2, PO5,
1a. Case Study on Hadoop Ecosystem BL2 LO1
PO8, PO9, PO10
To execute Hadoop HDFS Commands
Hadoop HDFS Practical:
-HDFS Basics, Hadoop Ecosystem Tools Overview.
PO1, PO2, PO3,
-Installing Hadoop.
1b. BL3 LO1 PO4, PO5, PO8,
-Copying File to Hadoop.
PO9, PO10
-Copy from Hadoop File system and deleting file.
-Moving and displaying files in HDFS.
-Programming exercises on Hadoop
To use sqoop tool to transfer data between
Hadoop & Relational databases servers
Use of Sqoop tool to transfer data between Hadoop PO1, PO2, PO3,
2. and relational database servers. BL3 LO1 PO4, PO5, PO8,
a. Sqoop - Installation. PO9, PO10
b. To execute basic commands of Hadoop eco
system component Sqoop.
To perform programming exercises in NoSQL PO1, PO2, PO3,
3. To install and configure MongoDB/ Cassandra/ BL3 LO3 PO4, PO5, PO8,
HBase/ Hypertable to execute NoSQL commands. PO9, PO10
To write a program to implement a word count
problem using MapReduce PO1, PO2, PO3,
4. Experiment on Hadoop Map-Reduce: BL3 LO2 PO4, PO5, PO8,
-Write a program to implement a word count PO9, PO10
program using MapReduce.
PO1, PO2, PO3,
5. To implement Flajolet Martin algorithm BL3 LO4 PO4, PO5, PO8,
PO9, PO10
PO1, PO2, PO3,
Social Network Analysis using R (e.g.: PRINTED
6. BL3, BL4 LO5 PO4, PO5, PO8,
Community Detection Algorithm)
PO9, PO10
PO1, PO2, PO3,
7. Data Visualization using Hive/PIG/R/Tableau. BL3, BL4
PRINTED LO5 PO4, PO5, PO8,
PO9, PO10
PO1, PO2, PO3,
8. Exploratory Data Analysis using Spark/ Pyspark PRINTED
BL3, BL4 LO1 PO4, PO5, PO8,
PO9, PO10
One real life large data application to be
implemented (Use standard Datasets available on
the web).
- Streaming data analysis – use flume for data
LO1, LO2, PO1, PO2, PO3,
capture, HIVE/PYSpark for analysis of twitter data, BL2, BL3,
9. LO3, LO4, PO4, PO5, PO8,
chat data, weblog analysis etc. BL4, BL5 LO5 PO9, PO10
- Recommendation System (for example: Health
Care System, Stock Market Prediction, Movie
Recommendation, etc.)
- Spatio Temporal Data Analytics BDA MP REPORT
- EXP FORMAT
*PROB STMNT
*DATASET
USED
*TECH
USED(THEORY)

Prof. Vijay Jumb Dr. Kunal Meher


Faculty In-charge Head of Department

You might also like