Professional Documents
Culture Documents
Khan (Campus)
Assignment
Roll No:
Class: BS-IT
Submitted to:
Q;1- What is Internet traffic measurement and statistical analysis?
In computer networks, network traffic measurement is the process of measuring the amount and
type of traffic on a particular network. Network analysis could be measured by active technique
and passive techniques. Active techniques are more intrusive but are arguably more accurate.
Passive techniques are of less network overhead and hence can run in the background to be used
to trigger network management actions . A limitation of active measurement is that it may
disturb the network by injecting artificial probe traffic into the network and the main drawback
of using this passive measurement is that he assumed that he “owns” all networks.
In the network traffic measurement there are mainly two challenges like 1) Flow statistics
computation time and 2) Single node failure. To address these challenge, I want to implement
the internet traffic measurement and analysis using MapReduce programming model of Hadoop
framework. Apache Hadoop is an open source software frame work for storage and large scale
processing of Netflow datasets. MapReduce is a programming model and an associated
implementation for processing and generating large datasets that is amenable to a broad
variety of real-world tasks.
statistical analysis
Statistical analysis is a component of data analytics. In the context of business intelligence (BI),
statistical analysis involves collecting and scrutinizing every data sample in a set of items from
which samples can be drawn. A sample, in statistics, is a representative selection drawn from a
total population.
Statistical analysis can be broken down into five discrete steps, as follows:
Create a model to summarize understanding of how the data relates to the underlying
population.
Employ predictive analytics to run scenarios that will help guide future actions.
The goal of statistical analysis is to identify trends. A retail business, for example, might use
statistical analysis to find patterns in unstructured and semi-structured customer data that can be
used to create a more positive customer experience and increase sales.
SAS is so powerful that it can understand any type of data and it can access data from any
software and any format. Logical operation can also be performed in SAS by using if –then
statements. SAS runs all statements in a loop, step by step, and executes the program very
quickly. ODS procedure is used to take the output in other formats. Examples of this include
HTML, RTF, excel, etc. We can also make a macro from the SAS program to meet various
research needs.
SAS Window:
SAS has the following main windows:
Editor window
Log window
Output window
Results window
Explorer window