Probability Distributions
Ananda V. Mysore
SJSU
San José State University | A. Mysore | Spring2009
Measurement Error and Statistical Analysis
Measurement error can be categorized into two
major types, systematic and random.
For which of these types is statistical analysis
more useful, and why?
Image(s) from Introduction to Engineering Experimentation by A. J. Wheeler and A. R. Ganji
© 2004 Pearson Education, Inc., Upper Saddle River, NJ. All rights reserved.
San José State University | A. Mysore | Spring2009
2
Probability, Distributions, and Random Variables
Probability P is a quantitative expression for the likelihood of
occurrence of some event.
The probability of a particular event is the number of times
the event is occurs divided by the total number of trials.
A probability distribution is a mathematical model that
expresses how the probability of an event varies according to
the value of some variable.
A random variable is one for which its numerical value
follows a probability distribution, while still being subject to
variability.
Random variables may be categorized as continuous (e.g.
temperature) or discrete (e.g. outcome of rolling dice).
San José State University | A. Mysore | Spring2009
3
Common Probability Laws
For any event A, 0 ≤ P{A} ≤ 1
If A is the (exclusive) complement of A, P{A} + P{A} = 1
If A and B are independent events, P{A and B} = P{A}P{B}
If A and B are mutually exclusive, P{A or B} = P{A} + P{B}
If A and B are not mutually exclusive it is necessary to subtract
the joint probability, and P{A or B} = P{A} + P{B} – P{A}P{B}
If B has already known to have occurred, the conditional
probability that A will occur is P{A | B} = P{A and B} / P{B}
San José State University | A. Mysore | Spring2009
4
Measurements, Samples, and Populations
A measurement for a random variable x
produces a numerical value for that variable.
A sample is a subset of a population for which
something is to be quantified (i.e. measured).
In statistical analysis, a sample usually implies more
than a single measurement (e.g. n = 5 measurements
taken from a population of N = 1,000).
The population comprises the entire collection
of possible measurements “whose properties
are under consideration and about which some
generalizations are to be made.”
Quoted text from Introduction to Engineering Experimentation by A. J. Wheeler and A. R. Ganji
© 2004 Pearson Education, Inc., Upper Saddle River, NJ. All rights reserved.
San José State University | A. Mysore | Spring2009
5
Histogram and Cumulative Frequency Plot
Individual values of a variable are sorted
into intervals, then stacked to count the
number of observations that fall into
each interval.
Intervals of constant span are almost
always preferred.
A good default for choosing the number
of bins is the square root of the number
of total observations n.
Histograms may be used for both
continuous (e.g. layer thickness) and
discrete variables (e.g. number of
defects).
The cumulative frequency plot is a
related display that shows what fraction
of the observations fall under a given
value.
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
6
Statistical Inference
Statistical inference draws conclusions about a population
based on a sample selected from that population.
A random sample of size n is a subset of the population,
which has size N.
(Technically, random samples from finite populations
must be drawn with replacement to ensure equal
selection probability.)
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
7
Central Tendency & Variability in Samples
Sample Mean:
Sample Variance:
n
n
∑ xi
x=
2
(
x
−
x
)
∑ i
i =1
s2 =
n
Deviation (from mean):
n
2
(
x
−
x
)
∑ i
s=
Sample Range:
n −1
Sample Standard Deviation:
di = xi − x
i =1
i =1
n −1
R = xmax − xmin
San José State University | A. Mysore | Spring2009
8
Central Tendency & Variability in Populations
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
9
Discrete and Continuous Probability Distributions
In a discrete distribution (left), the probability P that
a random variable x has the specific value xi has a
discrete value.
In a continuous distribution (right), the probability P
of occurrence for a random variable x is expressed in
terms of an interval.
P{x = xi } = p{xi }
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
b
P{a ≤ x ≤ b} = ∫ f {x}dx
a
San José State University | A. Mysore | Spring2009
10
Mean, Variance, and Probability Distributions
Continuous Distribution
Mean:
Mean:
∞
µ = ∑ xi p{xi }
+∞
µ = ∫ xf {x}dx
−∞
i =1
Variance:
+∞
Variance:
σ = ∫ ( x − µ ) f {x}dx
2
Discrete Distribution
2
−∞
Standard Deviation:
∞
σ = ∑ ( xi − µ ) 2 p{xi }
2
i =1
σ = σ2
San José State University | A. Mysore | Spring2009
11
Binomial Distribution
Outcomes are discrete
success or failure.
Probability of success p.
Number of successes x.
Number of independent
trials n.
An example scenario
would be to find
probability of
encountering x number
of non-conforming
items in a random
sample of n items.
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
n
n!
=
x x!(n − x)!
San José State University | A. Mysore | Spring2009
12
Poisson Distribution
Number of defects per
unit (or unit area, unit
volume, etc.).
Parameter λ determines
the shape of the
distribution.
Gives the probability that
x has a particular “defect”
count
Useful in cases for
example in which µ is
known and probability
that x ≤ b is of interest:
e −4 µ x
P{x ≤ b} = ∑
x =0 x!
b
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
13
Normal Distribution
Relevant for “normal”
random variables x.
Most common and arguably
most important distribution in
applied statistics.
Abbreviated notation N(µ,σ²).
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
14
Standard Normal Distribution
A standard normal distribution converts an N(µ,σ²) random
variable to an N(0,1) random variable. Why is this useful?
z=
x−µ
σ
The probability that the normal random variable x is less that or
equal to a threshold a can be determined from the solution to the
following integral expression.
1
P{x ≤ a} = ∫
e
−∞
σ 2π
a
1 x−µ
−
2 σ
2
dx = ∫
a
−∞
1
e
2π
z2
−
2
dz
Results are tabulated (thankfully) based on a single input, z, in
what is called a cumulative standard normal distribution table.
Also: P{x ≥ a} = 1 − P{x ≤ a}
San José State University | A. Mysore | Spring2009
15
Central Limit Theorem
The sum y of n independent random variables x has a distribution that is
approximately normal, regardless of the distribution of each individual
random variable xi in the sum.
The approximation improves as n increases.
In many circumstances this theorem is often used to justify the assumption
of a normal distribution regardless of underlying distribution
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
16
Tabulated Standard Normal Distribution Values
Images from Introduction to Statistical Quality Control, 5th Ed.
by Douglas C. Montgomery, ©2005 John Wiley & Sons, Inc.
San José State University | A. Mysore | Spring2009
17
What’s the Probability of…
San José State University | A. Mysore | Spring2009
18
Discrete Random Variables w/ N “Equal Chances”
Real measurements are generally discrete
measurements.
For discrete random variables with N equally-likely
values, the following is true:
1
p{xi } =
N
N
N
∑ xi
µ=
i =1
N
∑ ( xi − µ )
σ2 =
i =1
N
N
2
(
x
−
µ
)
∑ i
2
σ=
i =1
N
San José State University | A. Mysore | Spring2009
19
© Copyright 2026 Paperzz