Professional Documents
Culture Documents
Malik Rizwan
Contact
0334 5089011
malik_rizwan67@hotmail.com
MATH 361: PROBABILITY AND STATISTICS
Weightage
Quizzes 10%
OHT 1 15% (7th Week)
OHT 2 15% (13th Week)
Project 10 %
Ese 50% (18th Week)
Books
Advance Engineering Mathematics, Erwin Kreyszig, 10th Edition
Probability And Statistics, Murray R. Speigel
Course Content
Graphical Representation Of Data Using Expectation, Moments
Stem-and-leaf Plot, Histogram, box-plot Binomial, Poisson & Hypergeometric
Mean, Mode, Median distributions
EXAMPLE
You Want To Report Velocities Of Cars Passing A Certain Point On The Road (Experiment). Let Us
Consider A Hard And An Easy Way To Do It.
The Hard Way Is Spending Days, Weeks Or Months Measuring The Velocity Of Every Single Car Passing
From That Point And Reporting Your Data In An Endless Excel Spreadsheet.
The Easy Way Is Selecting A Few Say 50 Random Cars As Representative Sample For The Velocities On
A Day Or So And Computing Its Average And Standard Deviation (Statistical Tools) And Reporting
(Representation Of Data) Only Those Two Numbers As An Approximate Description Of This Velocity.
Representing A Quantity By An Average And A Standard Deviation Is A Very Simple Form Of Statistical
Modelling.
What Is P&S And It’s Uses In Engineering World?
Probability Is Used To
Example
Coin Flip In A Toss (Random Experiment) And To Calculate The Probability For A Head
Or Tail (Desired Event).
Testing Materials
Control Of Production Processes
Quality Control Of Production Outputs
Performance Tests Of Systems
Robotics And Automatization In General
Production Planning
Marketing Analysis And So On…….
Representation: Graphical Representation Of Data
Data
A Data Is Series Of Facts From Which Conclusions May Be Drawn.
Raw Data
Values Collected In A Survey Or Experiment That Are Unarranged And Uncategorised Are Called Raw Data.
Raw Data Can Be Arranged Using Different Data Representation Or Arrangement Techniques. E.G.
Velocities For Automotive In Km/H 57, 8, 6, 9, 3, 15 ,3, 57, 9, 9, 10
Variable
In Order To Collect Data, We Need To Measure Some Property Of The Subject Under Consideration. This
Property Is Called Variable.
Gender/Age/Score Of Students In A Class
Velocity Of Cars Passing By Some Point
Tensile Strength Of Different Metal Sheets
Further There Are Two Types Of Variables:
Qualitative
A Variable Is Qualitative If It Is Not Possible For It To Take A Numerical Value Or The Property Measured Can
Only Be Expressed In Words Rather Numbers.
Colour Of Different Specimen Undergoing Same Series Of Chemical Tests
Genders In A Group Of People
A Variety Of Food Whether Served Hot Or Cold
Representation: Graphical Representation Of Data
Quantitative
A Variable Is Quantitative If It Can Take A Numerical Value Or In Simple Words The Measurement Of The
Property Taken Can Be Expressed In The Form Of Numbers E.G. Temperature, Length, Age Etc. It Is
Further Divided In To
Continuous Variable
Such A Variable Which Can Take Any Numerical Value Between A Certain Range Is Called Continuous
Variable.
Tensile Strength Of Materials
Time Required To Do A Certain Task
Mass Of Different Animals In A Zoo
Discrete Variable
Such A Variable That Can Only Take A Specific Numerical Value Out Of Pre-stated Values Or Range Is
Called Discrete Variable And Has Clear Steps Between Its Possible Values.
Ratings Between 1-5 For Different Restaurants
Number Of Red Marbles In A Jar
Outcomes Of A Rolled Dice
Stem-and-leaf Diagram
Absolute Frequency
The Number Of Times A Value Occurs Is Called Its Absolute Frequency
Cumulative Absolute Frequency
The Sum Of The Absolute Frequencies Of The Values Up To The Line Of The Leaf Is Called
Cumulative Absolute Frequency
EXAMPLE
9 12 18 13 19 15 20 23 29 22 39 38 32 35 37 38 38 49 43 42 46 58 59 50 53 55 66 67 65 71
Score
Measures Of Location/ Measure Of Central Tendency
There Are Three Different Measure Of Location Or Central Tendency
Mean
Median
Mode
All These Are Used To Find A Central Point/ Typical Or A Single Value To Describe A Set Of Different Values
And Then Compare With Other Sets Of Data.
Mean
The Mean Of A Data Set Is Equal To The Sum Of The Values In The Data Set Divided By The Total Number
Of Values.
49,56,89,66,39,40
Mean = 49+56+89+66+39+40 = 56.5
6
Mathematically Given By
Measures Of Location/ Measure Of Central Tendency
Median
Median Is Simply The Middle Value Of An Arranged Set Of Data Or It Is A Value In A Data Set That Has
Equal Numbers Of Values Above And Below. If Total Number Of Values Is Odd Then Its Just The Middle
Number When Arranged And For Even Number Of Total Values,
Given By
1/2 (n+1)th Value If n Is Odd
Halfway Between 1/2nth Value And The Following Value If n Is Even
39, 40, 49, 56, 66, 89
Median = 49+56 = 52.5 (Half Of 1/2nth And Following Value)
2
Mode
It Is The Most Frequently Occurring Value Or A Value With Higher Frequency. A Data Set Can Have More
Than One Mode Or None If All Values Have Same Frequency Or Reoccurrence.
49,56,39,49,28,95
Mode Is 49
Measures Of Spread
As Clear From The Name, These Measures Are Used To Check The Spread Of A Data
A: 48 52 60 60 60 68 72
B: 0 10 60 60 60 110 120
Now Both Sets Have Equal Mean=median= Mode=60
In Such Cases We Go For Data Spread. In The Above Case, 'B' Has More Spread Then ‘A’ Due To The
Outliers. On The Other Hand Measures Of Location Are Of Less Help.
Outliers
The Values In A Data Set That Lie Far Away From Other Values Are Called Outliers. Outlier Could Be
More Than One. It May Also Indicate That Something Went Wrong With The Data Collection Process.
A Value More Than A Distance Of 1.5xIQR From Either End Of The Box In A Box Plot Is An Outlier.
$1000, $1000, $900, $5000, $5500, $1200, $1300, $800
$5500 And $5000 Are Outliers
Range
The Spread (Variability) Of The Data Values Can Be Measured By The Range, Given By
R = Highest Value – Lowest Value (Of The Data)
It Ignores Pattern Of Spread But Only Consider The Extreme Values.
39,40,49,56, 66, 89
R = 89 – 39 = 50