Professional Documents
Culture Documents
DATA
ANALYSIS
A Lokeshwari
22N31E0014
DEFINE CORRELATION
Correlation is a statistical measure that shows the strength of the relationship
between two variables. A correlation coefficient is a number between -1 and 1
that shows the direction and strength of the relationship. A correlation
coefficient of 1 indicates a perfect positive correlation, a correlation coefficient
of -1 indicates a perfect negative correlation, and a correlation coefficient of 0
indicates no correlation.
THERE ARE TWO MAIN
TYPES OF CORRELATION
Linear correlation: This type of correlation measures the strength of the linear relationship
between two variables. A linear relationship is a relationship where the points on a scatter plot
form a straight line.
Non-linear correlation: This type of correlation measures the strength of the non-linear
relationship between two variables. A non-linear relationship is a relationship where the points
on a scatter plot do not form a straight line.
TO CALCULATE
CORRELATION USING
EXCEL
We can use the CORREL function. The CORREL function takes two arrays as input and
returns a correlation coefficient. For example, the following code would calculate the
correlation coefficient between the variables "height" and "weight" in a dataset
=CORREL(height, weight)
The CORREL function is a powerful tool that can be used to analyze the relationship between
two variables. However, it is important to remember that correlation does not imply causation.
Just because two variables are correlated does not mean that one causes the other.
EXAMPLES OF
CORRELATION
The height and weight of adults are positively correlated. This means that as people get taller,
they tend to get heavier.
The price of gasoline and the number of miles driven are negatively correlated. This means
that as the price of gasoline goes up, people tend to drive less.
The number of hours students study and their grades are positively correlated. This means that
as students study more, they tend to get better grades.
create a data with 10 rows and 25 columns and sort the data as ascending descending apply
filter then secure your workbook sheet
steps on how to create a data with 10 rows and 25 columns, sort the data in ascending and
descending order, apply filter, and secure the workbook shee
Open Excel.
Create a new workbook.
In the first row, enter the column headers.
In the remaining rows, enter random data.
To sort the data in ascending order, click on the Data tab and then click on the Sort & Filter
button.
In the Sort dialog box, select the column that you want to sort by and then click on the
Ascending option.
To sort the data in descending order, click on the Descending option.
To apply a filter, click on the Data tab and then click on the Filter button.
In the Filter dialog box, select the column that you want to filter by and then select the criteria
that you want to use.
To secure the workbook sheet, click on the File tab and then click on the Protect Sheet button.
In the Protect Sheet dialog box, enter a password and then click on the OK button.
Once you have created the data, you can sort it in ascending or descending order by clicking on
the Data tab and then clicking on the Sort & Filter button. You can also apply a filter by clicking
on the Data tab and then clicking on the Filter button. To secure the workbook sheet, click on the
File tab and then click on the Protect Sheet button.
3 here are the steps on how to create different pivot tables from the above created
data and apply different colors to them:
Select the data that you want to create a pivot table from.
Click on the Insert tab and then click on the PivotTable button.
In the PivotTable dialog box, select the Existing sheet option and then select the
sheet that you want to create the pivot table on.
Click on the OK button.
In the pivot table, drag the column headers that you want to summarize to the
Rows, Columns, or Values areas.
To apply different colors to the pivot table, right-click on any cell in the pivot table
and then click on the PivotTable Styles option.
In the PivotTable Styles dialog box, select the style that you want to use and then
click on the OK button.
A pivot table that summarizes the data by column.
Once you have created the pivot tables, you can apply different colors to them to make
them easier to read and understand.
Here are some tips for applying different colors to pivot tables:
Mean, median, and mode are three measures of central tendency used in statistics to describe a
set of data.
Mean: The mean, also known as the average, is calculated by summing all the values in a
dataset and dividing the sum by the number of data points. It represents the "typical" value of
the data.
Median: The median is the middle value in a dataset when the data is arranged in ascending or
descending order. If the dataset has an odd number of data points, the median is the middle
value. If the dataset has an even number of data points, the median is the average of the two
middle values.
Mode: The mode is the value that occurs most frequently in a dataset. It represents the most
common value or category in the data.
To showcase the mean, median, and mode, let's use a subset of the Global Superstore 2016
data. Assume we have the following dataset representing the sales of a specific product: