The document outlines 9 experiments to be conducted as part of an IT course. The experiments cover topics like Hadoop, MapReduce, MongoDB, R programming, text mining, Bloom filters, clustering, and TF-IDF. Each experiment is allotted 2 hours and covers installing and using relevant software or programming libraries, performing basic tasks, analyzing sample datasets, and visualizing results. The goal of the experiments is to help students learn key concepts and skills across different technologies.
The document outlines 9 experiments to be conducted as part of an IT course. The experiments cover topics like Hadoop, MapReduce, MongoDB, R programming, text mining, Bloom filters, clustering, and TF-IDF. Each experiment is allotted 2 hours and covers installing and using relevant software or programming libraries, performing basic tasks, analyzing sample datasets, and visualizing results. The goal of the experiments is to help students learn key concepts and skills across different technologies.
The document outlines 9 experiments to be conducted as part of an IT course. The experiments cover topics like Hadoop, MapReduce, MongoDB, R programming, text mining, Bloom filters, clustering, and TF-IDF. Each experiment is allotted 2 hours and covers installing and using relevant software or programming libraries, performing basic tasks, analyzing sample datasets, and visualizing results. The goal of the experiments is to help students learn key concepts and skills across different technologies.
LabNo Experiment Data Set and Related Hours Module
Descriptions Lab 1 To draw and explain Hadoop Architecture Introduction 2 1 and Ecosystem with the help of a case Hadoop Installation study using WorkCount example. To define and install Hadoop. Lab 2 To implement the following file Hadoop - HDFS UI Ports and 2 2 management tasks in Hadoop System Nodes (HDFS): Adding files and directories, Hadoop_Lab - helloHDFS Retrieving files, Deleting files Lab 3 To run a basic Word Count MapReduce Steps to download gutenberg 2 2 program to understand MapReduce dataset Paradigm: To count words in a given WordCount.java file, To view the output file, and To calculate execution time. WordCount.jar WordCount using gutenberg dataset Lab 4 To perform NoSQL database using mongodb MongoDB_Lab - Installation 2 3 to create, update and insert. Steps primer-dataset.json MongoDB_Lab - Operations Lab 5 To study and implement basic functions R Installation Steps 2 5 and commands in R Programming. R Basic Commands Exploratory Data Analysis Lab 6 To build WordCloud, a text mining About WordCloud 2 10 method using R for easy to understand Sample Dataset and visualization than a table data. WorldCloud Example Lab 7 To implement Bloom Filters for filter About Bloom Filter 2 6 on Stream Data in C++/java. Bloom Filter Example Bloom Filter Sample Code Lab 8 To implement clustering program using R About Clutering 2 8 programming. foodstuffs_dataset.txt foodstuffs_clustering_example Lab 9 To find Term Frequency and Inverse TF-IDF Definition 2 5 & Document Frequency (tf-idf) 10 TF-IDF Example Matrix for Recommendation Systems and Plot TF Using R used. Dataset TF-IDF Code in R