Professional Documents
Culture Documents
Big Data Analytic Lab Syllabus
Big Data Analytic Lab Syllabus
1.2 Write a program for Map Reduce to analyses a dataset and understand the CO2
MapReduce workflow with execution on the Hadoop cluster.
1.3 Write a program to interact with HDFS using the Hadoop File System API to CO2
create a new file, write some content to it, and then read and display the
content from the file.
1.4 Write a program using Apache Spark to process a large dataset. CO2
Unit -2
2.1 Write a program to connect to a MongoDB instance to create a new database and a CO3
collection, insert multiple documents into the collection, and then query and display
the documents.
2.2 Write a program in to model a sample dataset for a social networking application in a CO3
NoSQL database.
2.3 Write a program using a suitable NoSQL driver (e.g., pymongo for MongoDB) to CO3
perform data migration from a CSV file to a NoSQL database.
Unit-3
3.1 Write a program using MLlib to perform predictive analytics on a large dataset. CO5
Choose a suitable machine-learning algorithm to predict a target variable.
3.2 Write a Python program using a machine learning library (e.g., scikit-learn) to perform CO5
feature engineering and evaluate a predictive model.
3.3 Write a program to design a prescriptive analytics solution for one of the real-world CO4
case studies (Walmart, Uber, Netflix, or eBay).