Professional Documents
Culture Documents
Introduction
• Focus on causal models
• So far we measured how much of effort is needed, what amount of
time is needed, and what software tools are needed to build up a
quality product, to fix up bugs and so on.
• One more ultimate goal for software metrics is to help software
professionals (be they developers, testers, managers, or maintainers)
make decisions under uncertainty.
• A software is said to have zero defect only if it is fully completed and
well used by customers.
1. FROM CORRELATION AND REGRESSION TO CAUSAL MODELS
• Correlation is a term that is a measure of the strength of a linear
relationship between two quantitative variables (e.g., height, weight).
• Positive correlation is a relationship between two variables in which both variables
move in the same direction. This is when one variable increases while the other
increases and visa versa. For example, positive correlation may be that the more
you exercise, the more calories you will burn.
• Whilst negative correlation is a relationship where one variable increases as the
other decreases, and vice versa.
• with a reasonably high level of accuracy, the values of one variable is
based on the values of the other then, the relationship between the two
variables is described as a strong correlation. (drying of clothes)
• A weak correlation is one where on average the values of one variable
are related to the other, but there are many exceptions. (purchase
intention on seeing an advertisement)
• R and P value in correlation:
• R-value (correlation co-efficient) defines the correlation between
two variables (positive, negative or zero). The value ranges from -1 to
+1.
• p-value tells us if the result of an experiment is statistically
observed data.
• The correlation coefficient are plotted using scatter diagram
Need for Correlation:
the analysis will change if X and Y are swapped. With correlation, the X and Y
variables, such as height and weight or blood pressure and heart rate.
equation.
• Key advantage of correlation
• Correlation is a more concise (single value) summary of the relationship
between two variables than regression.
Equation No Yes
• You want to predict blood pressure for different doses of a drug - Regression
Causal Relationship:
• Causality means that there is a clear cause-effect relationship between two
variables.
• In statistics and probability theory, the Bayes’ theorem (also known as the
underlying truth of a data generating process than the prior probability since
test results by taking into consideration how likely any given person is
Cause-A
Effect/Evidence-B
Directed acyclic graph representing two independent possible causes of a Defect removable efficiency
Advantages of Bayesian network
• Bayesian Networks offer a graphical representation that is reasonably
interpretable and easily explainable.