Welcome to Scribd!

Big Data Analytic Lab Syllabus

Uploaded by

0% found this document useful (0 votes)

10 views1 page

This document contains a syllabus for a Big Data Analytics lab course divided into 3 units totaling 15 hours. The syllabus lists 9 experiments mapping to course outcomes involving configuring Hadoop clusters, performing MapReduce jobs, interacting with HDFS, using Apache Spark, connecting to MongoDB databases, modeling social network data in NoSQL, performing data migration to NoSQL, performing predictive analytics with MLlib, performing machine learning with scikit-learn, and designing a prescriptive analytics solution for a case study.

Original Description:

Original Title

Big Data Analytic Lab Syllabus (1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views1 page

Big Data Analytic Lab Syllabus

Uploaded by

sujaniankratos68

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

BIG DATA ANALYTICS LAB SYLLABUS (20CSP/ITP-471)

S. No. Name of Experiments Hours /Cos

Mapped
Unit-1 15 Hours
1.1 Write a program to configure a small Hadoop cluster with at least one master CO1
and two worker nodes.

1.2 Write a program for Map Reduce to analyses a dataset and understand the CO2
MapReduce workflow with execution on the Hadoop cluster.

1.3 Write a program to interact with HDFS using the Hadoop File System API to CO2
create a new file, write some content to it, and then read and display the
content from the file.

1.4 Write a program using Apache Spark to process a large dataset. CO2

Unit -2
2.1 Write a program to connect to a MongoDB instance to create a new database and a CO3
collection, insert multiple documents into the collection, and then query and display
the documents.
2.2 Write a program in to model a sample dataset for a social networking application in a CO3
NoSQL database.
2.3 Write a program using a suitable NoSQL driver (e.g., pymongo for MongoDB) to CO3
perform data migration from a CSV file to a NoSQL database.
Unit-3
3.1 Write a program using MLlib to perform predictive analytics on a large dataset. CO5
Choose a suitable machine-learning algorithm to predict a target variable.
3.2 Write a Python program using a machine learning library (e.g., scikit-learn) to perform CO5
feature engineering and evaluate a predictive model.
3.3 Write a program to design a prescriptive analytics solution for one of the real-world CO4
case studies (Walmart, Uber, Netflix, or eBay).

ACA BigData Consolidated Dump
Document29 pages
ACA BigData Consolidated Dump
Ahimed Habib Husen
No ratings yet
File-439742875-439742875 ArcPyLab 2123286428607349
Document2 pages
File-439742875-439742875 ArcPyLab 2123286428607349
EUGENE AICHA
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Bda Lab
Document94 pages
Bda Lab
Dinesh Raj
No ratings yet
Notes
Document53 pages
Notes
Radheshyam Shah
No ratings yet
Pig Mix Benchmark
Document6 pages
Pig Mix Benchmark
abhimonica
No ratings yet
SCS16L
Document3 pages
SCS16L
sanjay mehra
No ratings yet
Daily Data Ingestion/data Node Capacity
Document2 pages
Daily Data Ingestion/data Node Capacity
Sam Unkil
No ratings yet
Big Data HW
Document6 pages
Big Data HW
Adilet Karim
No ratings yet
Lab2 WC
Document2 pages
Lab2 WC
wisesharkwhale
No ratings yet
Big Data Workshop Contents
Document2 pages
Big Data Workshop Contents
Sunil Patil
No ratings yet
20dce017 Bda Pracfil
Document41 pages
20dce017 Bda Pracfil
Raj Chauhan
No ratings yet
Databricks Spark Reference Applications
Document37 pages
Databricks Spark Reference Applications
jose
No ratings yet
Faculty of Informatics MCA V Semester (CBSE) Examination, 2021 Subject: Big Data Analytics Lab Question Bank
Document2 pages
Faculty of Informatics MCA V Semester (CBSE) Examination, 2021 Subject: Big Data Analytics Lab Question Bank
Vinay Kiran
No ratings yet
Final
Document276 pages
Final
Yàssine Hàdry
No ratings yet
SparkinDockerinKubernetes APracticalApproachforScalableNLP byJrgenSchmidl TowardsDataScience
Document23 pages
SparkinDockerinKubernetes APracticalApproachforScalableNLP byJrgenSchmidl TowardsDataScience
Kishan hari
No ratings yet
Bigdata Lab
Document55 pages
Bigdata Lab
Radheshyam Shah
No ratings yet
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
Document44 pages
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
Himanshu M
No ratings yet
Comparative Study of CouchDB and MongoDB - NoSQL Document Oriented Databases
Document3 pages
Comparative Study of CouchDB and MongoDB - NoSQL Document Oriented Databases
NITESHWAR BHARDWAJ
100% (1)
Map Reduce Programming
Document1 page
Map Reduce Programming
llllll
No ratings yet
SkipQ Full Stack Training - Week-By-Week Curriculum
Document3 pages
SkipQ Full Stack Training - Week-By-Week Curriculum
heythereiamanengineer
No ratings yet
Singh 2016
Document10 pages
Singh 2016
Sebastian
No ratings yet
BD - Unit - III - MapReduce
Document31 pages
BD - Unit - III - MapReduce
Prem Kumar
No ratings yet
Sports Day Points Calculator Python and Mysql
Document22 pages
Sports Day Points Calculator Python and Mysql
Rajnish Rajkumar
No ratings yet
Practical Assignment - :: Distributed Data Processing With Apache Spark
Document3 pages
Practical Assignment - :: Distributed Data Processing With Apache Spark
Teshome Mulugeta
No ratings yet
Lab4 Intro & Lab4
Document20 pages
Lab4 Intro & Lab4
Johanson Camasura
No ratings yet
Module-2 - Introduction To Hadoop
Document13 pages
Module-2 - Introduction To Hadoop
shreya
No ratings yet
BDA LabCompendium
Document6 pages
BDA LabCompendium
KiS Mint
No ratings yet
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
Document14 pages
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
moorthykem
No ratings yet
Big Data Module 2
Document23 pages
Big Data Module 2
Srikanth M
No ratings yet
Apache SystemDS
Document4 pages
Apache SystemDS
levin696
No ratings yet
MR YARN - Lab 2 - Cloud - Updated-V2.0
Document22 pages
MR YARN - Lab 2 - Cloud - Updated-V2.0
bender1686
No ratings yet
Apache Spark For Beginners
Document30 pages
Apache Spark For Beginners
ankesh patel
No ratings yet
Bda Lab Manual
Document45 pages
Bda Lab Manual
Srinivas Nani
No ratings yet
Lab Syllabus Format
Document4 pages
Lab Syllabus Format
Narendra Babu
No ratings yet
Final Project Report
Document34 pages
Final Project Report
Sahadev Marik
No ratings yet
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Document6 pages
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Editor IJTSRD
No ratings yet
Shmstreaming: A Shared Memory Approach For Improving Hadoop Streaming Performance
Document8 pages
Shmstreaming: A Shared Memory Approach For Improving Hadoop Streaming Performance
Sebastian
No ratings yet
Spark Interview Questions PDF 2
Document19 pages
Spark Interview Questions PDF 2
Varun
No ratings yet
Kcs 061 PPT Unit 2
Document56 pages
Kcs 061 PPT Unit 2
PRACHI ROSHAN
No ratings yet
20CM1110
Document2 pages
20CM1110
marce.rottenhotmail.com
No ratings yet
Hadoop Bitcoin-BlockChain - A New Era Needed in Distributed Computing
Document7 pages
Hadoop Bitcoin-BlockChain - A New Era Needed in Distributed Computing
pacdox
No ratings yet
Road To Data Engineer
Document9 pages
Road To Data Engineer
dtanonimo
No ratings yet
CSC431 AI - ProjectSLU2023
Document2 pages
CSC431 AI - ProjectSLU2023
muazu muhammad
No ratings yet
Pig Hive Bench Marking
Document33 pages
Pig Hive Bench Marking
MANAA
No ratings yet
COS 126 - Assignment 8
Document2 pages
COS 126 - Assignment 8
ivaneshubham
No ratings yet
Spark Streaming Research
Document6 pages
Spark Streaming Research
reshmashaik4656
No ratings yet
Mongodb Spark
Document13 pages
Mongodb Spark
Atif Fayaz Ali
No ratings yet
End Term Report
Document26 pages
End Term Report
Mohit Akerkar
No ratings yet
Institute of Technology: Practical List
Document4 pages
Institute of Technology: Practical List
Alex Tiwari
No ratings yet
ML Final Lab Manual
Document68 pages
ML Final Lab Manual
Awanit Kumar
No ratings yet
CS-702 (D) BigData
Document61 pages
CS-702 (D) BigData
garima bh
No ratings yet
BDA Module2-Chapter1
Document21 pages
BDA Module2-Chapter1
Lahari bilimale
No ratings yet
Oracle Data Integrator For Big Data: Alex Kotopoulis
Document42 pages
Oracle Data Integrator For Big Data: Alex Kotopoulis
mateusmfs1
No ratings yet
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
Document3 pages
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
Fgv
No ratings yet
Electronics 05 00029
Document14 pages
Electronics 05 00029
Wahyu
No ratings yet
Hadoop Job Runner UI Tool
Document10 pages
Hadoop Job Runner UI Tool
International Journal of Engineering Inventions (IJEI)
No ratings yet
A4 Resume Parser
Document1 page
A4 Resume Parser
Munthitra Thadthapong
No ratings yet
Big Data Glossary - HPE
Document8 pages
Big Data Glossary - HPE
maximaximo
No ratings yet
Bigdata and Hadoop
Document27 pages
Bigdata and Hadoop
Sauham Joshi
No ratings yet