# Name: Saptarshi Roy Roll Number: Learning Centre: ACES INFOTECH PRIVATE LIMITED (Code 02783) Subject: Statistics

for Management (MB0024) Assignment No.: Set 1 and 2. Date of Submission at the Learning Centre:

2008 Dec. 2008 Nov. 2008 Sept. 2008 Aug. 10 applications were automatically rejected as they were not satisfying the requirement of minimum balance. 2008 Oct. Monthly Minimum Balance in INR Month s A/C Holder 1 60000 70000 55000 90000 56000 80000 82000 79000 51000 95000 82000 83000 A/C Holder 2 56000 76000 11000 0 89000 88000 52000 58000 95000 86000 90000 82000 55000 A/C Holder 3 66000 74000 11200 0 90000 84000 57000 96000 55000 76000 95000 87000 56000 A/C Holder 4 86000 96000 19000 0 98000 84000 57000 66000 93000 74000 99000 84000 57000 A/C Holder 5 56000 76000 11000 0 89000 88000 52000 58000 95000 86000 90000 82000 55000 A/C Holder 6 59000 96000 12000 0 97000 98000 57000 56000 98000 88000 99000 88000 59000 A/C Holder 7 59000 78000 11500 0 87000 90000 55000 86000 99000 89000 95000 87000 59000 A/C Holder 8 52000 73000 11200 0 93000 89000 54000 55000 96000 97000 99000 88000 59000 A/C Holder 9 53000 98000 11300 0 66000 87000 59000 98000 59000 87000 95000 86000 52000 A/C Holder 10 56000 76000 12000 0 89000 86000 72000 98000 95000 84000 90000 82000 53000 Jan. 2008 You as an Assistant Branch Manager of the Bank are entrusted the task of selecting two account holders for sanctioning the loans. How you will select the two individuals among the 10 applicants to give the loan using appropriate statistical techniques? Give proper justification for your selection. 2008 Apr. the Bank decided that accountholder who had maintained a minimum balance of 50000 INR would only be considered for the loan. As a result. who are businessmen of the locality. Now. In order to reject some of the applications (as the fund was limited). 2008 Jun. the 10 applications remained and it was found that monthly minimum balance in all the cases were more than 50000 INR for the last 12 months. About 20 businessmen had applied for loan in order to develop their business further.Set 1 Case 1 ABC Branch of XYZ Bank has decided to give 10 Lakh of loan each on long term basis to only two of their customers (accountholders). 2008 Mar. 2008 Jul. Ans 1] Taking sum of the minimum balances for each month we get the following data. 2008 Feb. Their account details of monthly minimum balance are given below. 2008 May. Average Minimum Balance = Sum of minimum balances for the year / 12 Months A/C A/C Holde Holde r1 r2 Monthly Minimum Balance in INR A/C A/C A/C Holde Holder Holde r3 4 r5 A/C Holder 6 A/C Holde r7 A/C Holde r8 A/C Holde r9 A/C Holde r 10 .

84.0 00 73. Average Minimum Balance = Sum of min balances for the year / 12 Two account holders are selected who have maintained highest average minimum balance. 2008 Jul. 2008 Nov.583 during the period Jan.015.083 89000 97000 88000 98000 52000 57000 58000 56000 95000 98000 86000 88000 90000 99000 82000 88000 55000 59000 937. 2008. 2008 Jun. 000 84.0 00 78. A/C Holder 4 has maintained minimum average balance of Rs. From the above table we get. 2008 to Dec. 90.583 6600 0 8700 0 5900 0 9800 0 5900 0 8700 0 9500 0 8600 0 5200 0 89000 86000 72000 98000 95000 84000 90000 82000 53000 948.583 953. 2008 to Dec.250 9300 0 8900 0 5400 0 5500 0 9600 0 9700 0 9900 0 8800 0 5900 0 967. . 2008. 2008 Aug.417 90.0 00 79. 00 000 79. Bal 6000 56000 66000 86000 56000 59000 5900 5200 5300 56000 0 0 0 0 7000 76000 74000 96000 76000 96000 7800 7300 9800 76000 0 0 0 0 5500 0 9000 0 5600 0 8000 0 8200 0 7900 0 5100 0 9500 0 8200 0 8300 0 883.083 8700 0 9000 0 5500 0 8600 0 9900 0 8900 0 9500 0 8700 0 5900 0 999. 2008 Apr.001.583 110000 112000 190000 110000 120000 115000 112000 113000 120000 89000 90000 98000 88000 84000 84000 52000 57000 57000 58000 96000 66000 95000 55000 93000 86000 76000 74000 90000 95000 99000 82000 87000 84000 55000 56000 57000 937.000 1.0 00 1.0 00 83.084. 2008 May. 2008 Total Bal Avg. 2008 Mar. A/C Holder 6 has maintained minimum average balance of Rs.0 1.333 during the period Jan.0 00 78.0 00 80. 2008. 2008 Dec. A/C Holder 4 and A/C Holder 6 are selected for sanction of loan based on Average Minimum Balance maintained by them during the period Jan.333 By taking average minimum balance for the whole year of each customer we arrive at the following conclusion. 2008 Oct.Jan. 2008 Feb. 2008 to Dec.417 83. 2008 Sept.

Sampling error is the degree to which a sample might differ from the population. Population is the totality all objects about which the study is proposed. The purpose of sampling is to reduce the cost and/or the amount of work that it would take to survey the entire target population. There are no strict rules to follow. which is selected using certain statistical principles called sampling designs (this is for guaranteeing that a representative sample is obtained for the study). but carefully chosen sample can be used to represent the population. the degree to which the sample differs from the population remains unknown. Sample survey can also be described as the technique used to study about a population with the help of a sample. Sampling is a tool which enables us to draw conclusions about the characteristics of the population. Each member of the population has an equal and known chance of being selected. It is incumbent on the researcher to clearly define the target population. the entire population will be sufficiently small. In non-probability sampling. Once the sample decided information will be collected from this sample. In probability samples. Sample is only a portion of this population. results are reported plus or minus the sampling error. quota sampling. judgment sampling. members are selected from the population in some non-random manner. When inferring to the population. The population is defined in keeping with the objectives of the study. each member of the population has a known non-zero probability of being selected. The advantage of probability sampling is that sampling error can be calculated. but in the context of survey sampling it most often refers to a questionnaire used to measure the characteristics and/or attitudes of people. A survey may refer to many different types or techniques of observation. and snowball sampling. These include convenience sampling. It is also called an Nth name selection technique. Sometimes. Usually. systematic sampling. and stratified sampling. In non-probability sampling. Probability Sampling Methods 1. Ans 1] Sample is a finite subset of a population drawn from it to estimate the characteristics of the population. Systematic sampling is often used instead of random sampling. Survey sampling describes the process of selecting a sample of elements from a target population in order to conduct a survey. the population is too large for the researcher to attempt to survey all of its members. so the pool of available subjects becomes biased. Probability methods include random sampling.Set 2 1. 2. it is often difficult or impossible to identify every member of the population. A small. When there are very large populations. and the researcher must rely on logic and judgment. Random sampling is the purest form of probability sampling. What do you mean by sample survey? What are the different sampling methods? Briefly describe them. MB 0025 –Set 2 Page 2 Sampling methods are classified as either probability or non-probability. and the researcher can include the entire population in the study. A survey that measures the entire target population is called a census. The sample reflects the characteristics of the population from which it is drawn. which process is called sample survey. This type of research is called a census study because data is gathered on every member of the population. After the required sample size has been .

then they are said to be correlated. The researcher selects the sample based on judgment. What is the different between correlation and regression? What do you understand by Rank Correlation? When we use rank correlation and when we use Pearsonian Correlation Coefficient? Fit a linear regression line in the following data – X 12 15 18 20 27 34 28 48 Y 123 150 158 170 180 184 176 130 Ans 2] Correlation When two or more variables move in sympathy with other. Its only advantage over the random sampling technique is simplicity. As the name implies. every Nth record is selected from a list of population members. As long as the list does not contain any hidden order. Snowball sampling relies on referrals from initial subjects to generate additional subjects. This is usually extension of convenience sampling. Systematic sampling is frequently used to select a specified number of records from a computer file. Quota sampling is the non-probability equivalent of stratified sampling. Examples of stratums might be males and females. 3. 2. If both variables move in the same direction then they are said to be . 4. For example. "Sufficient" refers to a sample size large enough for us to be reasonably confident that the stratum represents the population. When MB 0025 –Set 2 Page 3 using this method. It may be extremely difficult or cost prohibitive to locate respondents in these situations. even though the population includes all cities. a researcher may decide to draw the entire sample from one "representative" city. Judgment sampling is a common non-probability method. The researcher first identifies the relevant stratums and their actual representation in the population. Random sampling is then used to select a sufficient number of subjects from each stratum. 3. without incurring the cost or time required to select a random sample. This non-probability method is often used during preliminary research efforts to get a gross estimate of the results. where the stratums are filled by random sampling. Like stratified sampling. the researcher first identifies the stratums and their proportions as they are represented in the population. Stratified sampling is often used when one or more of the stratums in the population have a low incidence relative to the other stratums. 2. the researcher must be confident that the chosen sample is truly representative of the entire population. Stratified sampling is commonly used probability method that is superior to random sampling because it reduces sampling error. the sample is selected because they are convenient. While this technique can dramatically lower search costs. or managers and non-managers. Non Probability Methods 1. Convenience sampling is used in exploratory research where the researcher is interested in getting an inexpensive approximation of the truth. A stratum is a subset of the population that share at least one common characteristic. This differs from stratified sampling. Snowball sampling is a special non-probability method used when the desired sample characteristic is rare. Then convenience or judgment sampling is used to select the required number of subjects from each stratum.calculated. this sampling method is as good as the random sampling method. it comes at the expense of introducing bias because the technique itself reduces the likelihood that the sample will represent a good cross section from the population.

” Correlation analysis attempts to study the relationship between the two variables x and y. If the variables move in opposite direction then they are said to be negatively correlated.e.) and the Y variable is something you measure. You simply quantify how well two variables relate to each other.g. say X and Y. Correlation analysis deals with 1) Measuring the relationship between variables. Xi = 0 or 1. which are used to code some nominal variables. With correlation you don't have to think about cause and effect. With regression.positively correlated. Regression Regression is defined as. You simply are computing a correlation coefficient (r) that tells you how much one variable tends to change when the other one does. MB 0025 –Set 2 Page 4 Different between correlation and regression Correlation and linear regression are not the same.. It rarely is appropriate when one variable is something you experimentally manipulate. 3) Giving confidence interval for population correlation measure. you do have to think about cause and effect as the regression line is determined as the best way to predict Y from X.e. in experiments. On the other hand. Consider these differences: Correlation quantifies the degree to which two variables are related. concentration. The correlation answers the STRENGTH of linear association between paired variables. in an experiment. X in terms of . 2) Testing the relationship for its significance. “the measure of the average relationship between two or more variables in terms of the original units of the data. You'll get the same correlation coefficient if you swap the two.. for example. (2b) Linear regression is used whenever: * at least one of the independent variables (Xi's) is to predict the dependent variable Y. * if one manipulates the X variable. * correlation is not used when the variables are manipulated. With linear regression. Linear regression are not symmetric in terms of X and Y. That is interchanging X and Y will give a different regression model (i. i. Correlation does not find a best-fit line (that is regression). e. The dependence is expressed in the form of the equations. it doesn't matter which of the two variables you call "X" and which you call "Y". (2a) Correlation is calculated whenever: * both X and Y is measured in each subject and quantifies how much they are linearly associated. * in particular the Pearson's product moment correlation coefficient is used when the assumption of both X and Y are sampled from normally-distributed populations are satisfied * or the Spearman's moment order correlation coefficient is used if the assumption of normality is not satisfied. If they move haphazardly then there is no correlation between them. as you'll get a different best-fit line if you swap the two. the regression tells us the FORM of linear association that best predicts Y from the values of X. Regression analysis attempts to predict the average x for a given y. In Regression it is attempted to quantify the dependence of one variable on the other. With correlation. Correlation is almost always used when you measure both variables. The line that best predicts Y from X is not the same as the line that predicts X from Y. the decision of which variable you call "X" and which you call "Y" matters a lot. With linear regression. Note: Some of the Xi's are dummy variables. the X variable is often something you experimental manipulate (time.

i. The raw scores are converted to ranks. however. then ρ is given by: where: di = xi − yi = the difference between the ranks of corresponding values Xi and Yi. it assesses how well an arbitrary monotonic function could describe the relationship between two variables. One has to assign the same rank to each of the equal values. 1.80 The same underlying distribution is assumed for all variables in linear regression. X 12 15 18 20 27 34 28 48 Y 123 150 158 170 180 184 176 130 Linear Regression Line for the above data can be plotted as : .Y) against the original Y in terms of X. Certain other measures of correlation are parametric in the sense of being based on possible relationships of a parameterized form.E can be used. If tied ranks exist. is a nonparametric measure of correlation – that is. MB 0025 –Set 2 Page 6 Conditions under which P. In principle. 2. Samples must have been selected at random. 3.e. if you interchange variables X and Y in the calculation of correlation coefficient you will get the same value of this correlation coefficient. classic Pearson's correlation coefficient between ranks has to be used instead of this formula. >= 0. It is an average of their positions in the ascending order of the values. MB 0025 –Set 2 Page 5 The "best" linear regression model is obtained by selecting the variables (X's) with at least strong correlation to Y. and n = the number of values in each data set (same for both sets). If there are no tied ranks. linear regression will underestimate the correlation of the independent and dependent when they (X's and Y) come from different underlying distributions. and the differences di between the ranks of each observation on the two variables are calculated. Spearman's rank correlation coefficient or Spearman's rho.80 or <= -0. Thus. In practice. The value of “r” must be determined from sample values. a simpler procedure is normally used to calculate ρ. On the other hand. such as a linear relationship. Samples should be drawn from a normal population. without making any other assumptions about the particular nature of the relationship between the variables. named after Charles Spearman and often denoted by the Greek letter ρ (rho) or as rs. ρ is simply a special case of the Pearson product-moment coefficient in which two sets of data Xi and Yi are converted to rankings xi and yi before calculating the coefficient.

The construction of business barometer consists of gross national product.Total Numbers : 8 Slope (b) :0. With the help of these business barometers the trend of fluctuations in business conditions are made known and by forecasting a decision can be taken relating to the problem.65 Regression Equation : 154. short term. MB 0025 –Set 2 Page 7 Following are the main methods of business forecasting:1. so they are also known as “Business Barometers” or “Economic Barometers‟. industrial production.16701 Y-Intercept (a) : 154. stock prices. consumer prices. As business indices are the indicators of future conditions. wholesale prices. Ans 3] Business forecasting refers to the analysis of past and present economic conditions with the object of drawing inferences about probable future business conditions. medium term or any specific term. bank deposits etc. The process of forecasting includes the use of statistical and mathematical methods for long term. These quantities may be concerted into relatives on a certain base.66 + 0. information and facts concerning to economic condition of business for past and present are analyzed. To forecast the future. What do you mean by business forecasting? What are the different methods of business forecasting? Describe the effectiveness of time-series analysis as a mode of business forecasting. various data. Describe the method of moving averages.17x 3. The relatives so obtained may be weighted and their . Business Barometers Business indices are constructed to study and analyze the business activities on the basis of which future conditions are predetermined.