You are on page 1of 1

BIG DATA ANALYTICS LAB SYLLABUS (20CSP/ITP-471)

S. No. Name of Experiments Hours /Cos


Mapped
Unit-1 15 Hours
1.1 Write a program to configure a small Hadoop cluster with at least one master CO1
and two worker nodes.

1.2 Write a program for Map Reduce to analyses a dataset and understand the CO2
MapReduce workflow with execution on the Hadoop cluster.

1.3 Write a program to interact with HDFS using the Hadoop File System API to CO2
create a new file, write some content to it, and then read and display the
content from the file.

1.4 Write a program using Apache Spark to process a large dataset. CO2

Unit -2
2.1 Write a program to connect to a MongoDB instance to create a new database and a CO3
collection, insert multiple documents into the collection, and then query and display
the documents.
2.2 Write a program in to model a sample dataset for a social networking application in a CO3
NoSQL database.
2.3 Write a program using a suitable NoSQL driver (e.g., pymongo for MongoDB) to CO3
perform data migration from a CSV file to a NoSQL database.
Unit-3
3.1 Write a program using MLlib to perform predictive analytics on a large dataset. CO5
Choose a suitable machine-learning algorithm to predict a target variable.
3.2 Write a Python program using a machine learning library (e.g., scikit-learn) to perform CO5
feature engineering and evaluate a predictive model.
3.3 Write a program to design a prescriptive analytics solution for one of the real-world CO4
case studies (Walmart, Uber, Netflix, or eBay).

You might also like