Learning and Decision
Making
Lecture 1
Learning and Decision Making
Slide
1
“Anytime that you have
a 50-50 chance of getting something right,
there’s a 90% probability
you’ll get it wrong”
–Andy Rooney
Learning and Decision Making
Slide
2
What is probability?
•
Has roots in games of chance
•
Used to compute the likelihood of a given event's
occurrence
•
Natural tool to model uncertainty
Learning and Decision Making
Slide
3
What is probability?
•
Classical definition of probability of event A
•
N possible events
•
NA ways by which A can occur
NA
P[A] =
N
•
Examples:
•
What is the probability of drawing an even number in a
die?
Learning and Decision Making
Slide
4
What is probability?
•
Classical definition of probability of event A
•
N possible events
•
NA ways by which A can occur
NA
P[A] =
N
•
Examples:
•
What is the probability of drawing a ♤ in a card deck?
Learning and Decision Making
Slide
5
What is probability?
•
Frequentist definition of probability of event A
•
N observed events
•
NA times that event A occurred
NA
P[A] = lim
N!1 N
•
Relative frequency of event A
Learning and Decision Making
Slide
6
What is probability?
•
Subjective definition of probability of event A
•
•
Degree of belief that event A may occur
Example:
•
Probability of Benfica winning the I Liga.
Learning and Decision Making
Slide
7
Formally, …
•
Probability space is a triplet (Ω, 𝓕, P) where:
•
Ω is the sample space
•
𝓕 is the set of events
•
P is a probability measure
Learning and Decision Making
Slide
8
Formally, …
•
Sample space Ω
•
Space of possible outcomes
•
Each and every thing that may happen
•
Example: In a die throw, the possible outcomes are {1, 2, 3, 4, 5, 6}
•
Example: When drawing a card, the possible outcomes are {A♤,
2♤, …, J♢, Q♢, K♢}
Learning and Decision Making
Slide
9
Formally, …
•
Set of events, 𝓕
•
Subsets of Ω that we can “measure”, i.e., assign a
probability
•
Includes the empty set ∅ and the full set Ω
•
Examples of events (die throw):
•
Drawing an even number: {2, 4, 6}
•
Drawing a number larger than 3: {4, 5, 6}
•
Drawing no number: ∅
Learning and Decision Making
Slide
10
Formally, …
•
Set of events, 𝓕
•
Subsets of Ω that we can “measure”, i.e., assign a probability
•
Includes the empty set ∅ and the full set Ω
•
Examples of events (card draw):
•
Drawing a spade: {A♤, 2♤, 3♤, 4♤, 5♤, 6♤, 7♤, 8♤,
9♤, 10♤, J♤, Q♤, K♤}
•
Drawing a figure: {J♤, Q♤, K♤, J♧, Q♧, K♧, J♡, Q♡,
K♡, J♢, Q♢, K♢}
Learning and Decision Making
Slide
11
Formally, …
•
Probability, P
•
“Measures” each event in 𝓕
•
Axioms of probability:
•
P(A) ≥ 0
•
P(Ω) = 1
•
Given disjoint events A1, …, An
P[A1 [ . . . [ An ] =
n
X
P[Ai ]
i=1
Learning and Decision Making
Slide
12
Can you show:
•P(A) ≥ 0
•P(Ω) = 1
• P[A1 [ . . . [ An ] =
n
X
P[Ai ]
i=1
•
P(∅) = 0
•
Given A, B with A ⊂ B, P[A [ B] P[A] + P[B]
Learning and Decision Making
Slide
13
Conditional probability
•
Conditional probability of event A given B:
•
Probability of A occurring, given that B occurred
P[A \ B]
P[A | B] =
P[B]
… assuming P(B) > 0.
•
Verifies all axioms of probability
Learning and Decision Making
Slide
14
Conditional probability
•
Important properties:
•
Independent events:
•
Law of total probability: given disjoint events A1, …, An, with
P[A | B] = P[A]
A1 U … U An = Ω,
P[B] =
n
X
i=1
P[B | Ai ] · P[Ai ]
Learning and Decision Making
Slide
15
Conditional probability
•
Important properties:
•
Bayes theorem:
likelihood prior
P[B | A]P[A]
P[A | B] =
P[B]
evidence
hypothesis observations
Learning and Decision Making
Slide
16
Random variable
•
Given a probability space (Ω, 𝓕, P),
•
Sometimes Ω is a cumbersome set to work with
•
It would be more convenient to instead work in some
other space, more mathematically convenient (for
example, R)
•
A random variable X is a map X : Ω ⟶ E, where E is some
convenient space (usually R)
•
We usually work with random variables, ignoring the
underlying probability space
Learning and Decision Making
Slide
17
Random variable
•
We write P(X = x) to represent
P[X = x] = P[{! 2 ⌦ : X(!) = x}]
•
If the r.v. takes values in a discrete set, we call it a discrete
random variable
•
If the r.b. takes values in a continuous set, we call it a
continuous random variable
Learning and Decision Making
Slide
18
Expectation
•
Given a r.v. X and a function f : X ⟶ R, the expectation of f is
the quantity
X
E[f (X)] =
f (x)P[X = x]
x
•
We can have conditional expectations:
E[f (X) | Y = y] =
X
x
f (x)P[X = x | Y = y]
Learning and Decision Making
Slide
19
Properties
•
Law of total probability with expectations:
X
E[f (X)] =
E[f (X) | Y = y]P[Y = y]
y
•
Conditioning “eliminates” expectation:
E[f (X) | X = x] = f (x)
Learning and Decision Making
Slide
20
Some useful results
•
Law of large numbers: Given i.i.d. real-valued r.v.s X1, …, Xn,
such that
E[X1 ] = E[X2 ] = . . . = E[XN ] = µ
then
N
X
1
a.s.
lim
Xi = µ
N !1 N
i=1
Learning and Decision Making
Slide
21
Some useful results
•
Jensen’s inequality: Given a real-valued r.v. X and a convex
function f, then
f (E[X]) E[f (X)]
x1
x2
Learning and Decision Making
Slide
22
© Copyright 2026 Paperzz