mean field games and mean field type control theory

INTRODUCTION
GENERAL PRESENTATION
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DIFFERENT POPULATIONS
COALITIONS
1/110
MEAN FIELD GAMES AND MEAN FIELD TYPE
CONTROL THEORY
Alain Bensoussan
1
1 International
Jens Frehse
2
Phillip Yam
3
Center for Decision and Risk Analysis
Jindal School of Management, University of Texas at Dallas
and Department of Systems Engineering and Engineering department,City
University Hong Kong
2 Institute
3 Department
for Applied Mathematics, Bonn University
of Statistics,The Chinese University of Hong Kong
August 16, 2013
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DIFFERENT POPULATIONS
COALITIONS
OUTLINE I
1
2
3
4
2/110
INTRODUCTION
GENERAL PRESENTATION
MODEL AND ASSUMPTIONS
DEFINITION OF THE PROBLEMS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL
PROBLEM
HJB-FP APPROACH
STOCHASTIC MAXIMUM PRINCIPLE
TIME CONSISTENCY APPROACH
THE MEAN VARIANCE PROBLEM
TIME CONSISTENCY
DIFFERENT POPULATIONS
GENERAL CONSIDERATIONS
MULTI-CLASS AGENTS
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DIFFERENT POPULATIONS
COALITIONS
OUTLINE II
MAJOR PLAYER
5
3/110
COALITIONS
SYSTEM OF HJB-FP EQUATIONS
APPROXIMATE NASH EQUILIBRIUM FOR LARGE
COMMUNITIES
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DIFFERENT POPULATIONS
COALITIONS
ACKNOWLEDGEMENTS
Research supported by the Research Grants Council of Hong Kong
Special Administrative Region (City U 500113)
4/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DIFFERENT POPULATIONS
COALITIONS
GENERAL COMMENTS
Mean eld games reduces to a standard control problem and
an equilibrium (xed point)
Dynamic Programming: coupled HJB and FP equations
Mean eld type control is a non standard control problem.
Stochastic Maximum Principle ( time inconsistency)
Time inconsistency
Major Playor
Coalitions
5/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
MODEL AND ASSUMPTIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DEFINITION OF THE PROBLEMS
DIFFERENT POPULATIONS
COALITIONS
MODEL
Probability space Ω, A , P , ltration F t generated by an
n -dimensional standard Wiener process w (t ).
The state space is R n and the control space is R d .
( , , ) : R n × L1 (R n ) × R d → R n ;
g x m v
f
(x , m, v ) : R n × L1 (R n ) × R d → R ;
σ (x ) : R n → L (R n ; R n )
( , ) : R n × L1 (R n ) → R
h x m
σ (x ), σ −1 (x ) bounded
m
6/110
(1)
(2)
is a probability density on R n
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
MODEL AND ASSUMPTIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DEFINITION OF THE PROBLEMS
DIFFERENT POPULATIONS
COALITIONS
STATE EQUATION
7/110
( ) ∈ C (0, T ; L1 (R n )) given . Feedback control v (x , t ).
m t
state of the system
dx
x
= g (x (t ), m(t ), v (x (t ))dt + σ (x (t ))dw (t )
(3)
(0) = x0
is a random variable independent of the Wiener process,
probability density m = m(0).
To the pair v (.), m(.) we associate the control objective
x0
0
( (.), m(.)) = E [
Z T
J v
f
(x (t ), m(t ), v (x (t )) dt + h(x (T ), m(T )]
0
Alain Bensoussan, Jens Frehse, Phillip Yam
(4)
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
MODEL AND ASSUMPTIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DEFINITION OF THE PROBLEMS
DIFFERENT POPULATIONS
COALITIONS
MEAN FIELD GAME
8/110
Find a pair v̂ (.),m(.) such that, denoting by x̂ (.) the solution
of
d x̂
x̂
= g (x̂ (t ), m(t ), v̂ (x̂ (t )))dt + σ (x̂ (t ))dw (t )
(5)
(0) = x0
then
( ) is the probability distribution of x̂ (t ), ∀t ∈ [0, T ]
m t
J (v̂
(6)
(.), m(.)) ≤ J (v (.), m(.))∀v (.)
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
GENERAL PRESENTATION
MODEL AND ASSUMPTIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
DEFINITION OF THE PROBLEMS
DIFFERENT POPULATIONS
COALITIONS
MEAN FIELD TYPE CONTROL PROBLEM
9/110
For any feedback v (.), let x (t ) = xv (.) (t ) be the solution of (3)
with m(t ) =probability distribution of xv (.) (t ). So (3) becomes
a McKean-Vlasov equation. Denote by mv (.) (t ) =probability
distribution of xv (.) (t ), we thus have
dx
x
v (.) = g (xv (.) (t ), mv (.) (t ), v (xv (.) (t ))dt + σ (xv (.) (t ))dw (t ) (7)
(0) = x0
v (.) (t ) = probability distribution of xv (.) (t )
m
(8)
Find v̂ (.) such that
J (v̂
(.), mv̂ (.) (.)) ≤ J (v (.), mv (.) (.)) ∀v (.)
Alain Bensoussan, Jens Frehse, Phillip Yam
(9)
Dierential games, Nash equilibrium, Mean Field, Ha
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION
Set
1
2
and the 2nd order dierential operator
( ) = σ (x )σ ∗ (x )
(10)
ϕ(x ) = −tr a(x )D 2 ϕ(x )
(11)
a x
A
The Dual Operator is
∗
A
ϕ(x ) = −
n
∂2
(akl (x )ϕ(x ))
k ,l =1 ∂x ∂x
∑
k
10/110
Alain Bensoussan, Jens Frehse, Phillip Yam
(12)
l
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
GATEAUX DIFFERENTIABILITY
11/110
Assume that the
m
→ f (x , m, v ), g (x , m, v ), h(x , m)
are dierentiable in m ∈ L
2
Notation
d
d
θ
(R n )
(13)
∂f
(x , m, v )(ξ ) to represent the derivative, so that
∂m
Z
f
(x , m + θ m̃, v )|θ =0 =
∂f
(x , m, v )(ξ )m̃(ξ ) d ξ
∂
R m
Alain Bensoussan, Jens Frehse, Phillip Yam
n
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
OBJECTIVE FUNCTIONAL I
12/110
Consider a feedback v (x ) and the corresponding trajectory
dened by (7), the probability distribution mv (.) (t ) of xv (.) (t )
is solution of the FP equation
∂ mv (.)
+ A∗ mv (.) + div (g (x , mv (.) , v (x ))mv (.) ) = 0
∂t
mv (.) (x , 0) = m0 (x )
(14)
and the objective functional J (v (.), mv (.) ) can be expressed as
follows
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
OBJECTIVE FUNCTIONAL II
13/110
( (.), mv (.) (.)) =
Z TZ
J v
0
Z
+
R
R
f
n
(x , mv (.) (x ), v (x ))mv (.) (x )dxdt + (15)
( , v (.) (x , T ))mv (.) (x , T )dx
h x m
n
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FURTHER DIFFERENTIABILITY I
Consider an optimal feedback v̂ (x ) and the corresponding
probability density mv̂ (.) (x ) = m(x ).Let then v (.) be any
feedback and v̂ (x ) + θ v (x ). We want to compute
v̂ (.)+θ v (.) (x )
dm
d
θ
|θ =0 = m̃(x )
We can check that
14/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FURTHER DIFFERENTIABILITY II
∂ m̃
+ A∗ m̃ + div (g (x , m, v̂ (x ))m̃) +
∂t
(16)
Z
+div([
15/110
∂g
∂g
(x , m, v̂ (x ))(ξ )m̃(ξ )d ξ +
(x , m, v̂ (x ))v (x )]m(x )) = 0
∂m
∂v
m̃ (x , 0) = 0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
COST DIFFERENTIABILITY I
16/110
dJ (v̂
(.) + θ v (.), mv̂ (x )+θ v (x ) (.))
|θ =0 =
dθ
Z TZ
0
Z TZ
0
R
R
f
(17)
(x , m, v̂ (x ))m̃(x )dtdx +
n
Z
n
∂f
(x , m, v̂ (x ))(ξ )m̃(ξ )m(x )dtd ξ dx +
∂
R m
Z TZ
∂f
(x , m, v̂ (x ))v (x )m(x )dtdx +
0
R ∂v
n
n
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
COST DIFFERENTIABILITY II
17/110
Z
+
Z
+
R
R
( , ( ))m̃(x , T )dx
h x m T
n
Z
n
∂h
(x , m(T ))(ξ )m̃(ξ , T )m(x , T )d ξ dx
∂
R m
n
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FUNCTION
u(x , t )
Introduce the function u (x , t ) solution of
Z
∂u
∂g
−
+ Au − g (x , m, v̂ (x )).Du −
Du (ξ ).
(ξ , m, v̂ (ξ ))(x )m(ξ )d ξ
∂t
∂m
R
Z
∂f
= f (x , m, v̂ (x ))+
(ξ , m, v̂ (ξ ))(x )m(ξ )d ξ
R ∂m
n
n
(18)
Z
( , ) = h(x , m(T ))+
u x T
∂h
(ξ , m(T ))(x )m(ξ , T )d ξ
∂
R m
n
18/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NECESSARY CONDITION I
19/110
(.) + θ v (.), mv̂ (x )+θ v (x ) (.))
|θ =0 =
dθ
Z TZ
∂f
(x , m, v̂ (x ))v (x )m(x )dtdx +
0
R ∂v
Z TZ
∂g
(x , m, v̂ (x ))v (x )m(x )dtdx
+
Du (x ).
∂v
0
R
dJ (v̂
n
n
Since v̂ (.) is optimal, this expression must vanish for any v (.).
Hence necessarily
∂f
∂g ∗
(x , m, v̂ (x )) +
(x , m, v̂ (x ))Du (x ) = 0
∂v
∂v
Alain Bensoussan, Jens Frehse, Phillip Yam
(19)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
REWRITING I
It follows that ( at least with convexity assumptions)
( ) = v̂ (x , m, Du (x ))
(20)
(x , m, v̂ (x )) + g (x , m, v̂ (x )).Du = H (x , m, Du )
(21)
v̂ x
We note that
f
Z
∂f
∂g
(ξ , m, v̂ (ξ ))(x ) + Du (ξ ).
(ξ , m, v̂ (ξ ))(x )]m(ξ )d ξ =
∂m
R ∂m
[
n
(22)
Z
∂H
(ξ , m, Du (ξ ))(x )m(ξ )d ξ
R ∂m
n
20/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
REWRITING II
21/110
( , , ( )) = g (x , m, v̂ (x , m, Du (x ))) = G (x , m, Du )
g x m v̂ x
Alain Bensoussan, Jens Frehse, Phillip Yam
(23)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
HJB-FP SYSTEM
We can nally write the system of HJB-FP P.D.E.
−
Z
∂H
∂u
+ Au = H (x , m, Du ) +
(ξ , m, Du (ξ ))(x )m(ξ )d ξ
∂t
∂m
R
Z
∂h
u (x , T ) = h (x , m (T )) +
(ξ , m(T ))(x )m(ξ , T )d ξ (24)
∂
R m
n
n
∂m
+ A∗ m + div (G (x , m, Du )m) = 0
∂t
m (x , 0) = m0 (x )
22/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SPIKE MODIFICATION I
Instead of changing v̂ (x , t ) into v̂ (x , t ) + θ v (x , t ) one can use
a spike modication
v̄ (x , s ) = v
( , )
v̂ x s
s
s
∈ (t , t + ε)
6∈ (t , t + ε)
similar to the proof of Pontryagin maximum principle.
One proves directly that v̂ (x , t ) minimizes the Lagrangian in v ,
instead of simply being a stationary point
23/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION
24/110
From the optimal feedback v̂ (x ) and the probability distribution
( ) we construct stochastic processes X (t ) ∈ R n , V (t ) ∈ R d ,
n
n n
Y (t ) ∈ R , Z (t ) ∈ L (R ; R ) which are adapted, dened as follows
m t
( ) = x̂ (t ),
X t
( ) = PX (t )
m t
We next dene
Y
(t ) = Du (X (t ), t ),
( ) = v̂ (X (t ), PX (t ) , Y (t ))
V t
and nally
( ) = D 2 u σ (X (t ), t )
Z t
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
STOCHASTIC MAXIMUM PRINCIPLE I
=g (X (t ), PX (t ) , V (t ))dt + σ (X (t ))dw (t )
∂H
−dY =
(X (t ), PX (t ) , V (t ), Y (t ))+
(25)
∂x
∂ 2H
∂ σ (X (t )) ∗
Z (t )
dt
E[
(X (t ), PX (t ) , V (t ), Y (t ))](X (t )) + tr
∂ x∂ m
∂x
− Z (t )dw (t )
(26)
2
∂ h(X (T ), PX (T ) )
∂ h
X (0) = x0 , Y (T ) =
+E[
(X (T ), PX (T ) ] (X (T )
∂x
∂ x∂ m
dX
25/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
STOCHASTIC MAXIMUM PRINCIPLE I
( ) minimizes H (X (t ), PX (t ) , v , Y (t )) in v
V t
(27)
When we write
∂ 2f
(X (t ), PX (t ) , V (t ))](X (t ))
∂ x∂ m
∂f
we mean that we take the function
(ξ , m, v )(x ), where ξ and v
∂m
are parameters and we take the gradient in x , denoted by
∂ 2f
(ξ , m, v )(x ).
∂ x∂ m
We then consider ξ = X (t ), v = V (t ) and take the expected value
∂ 2f
E
(X (t ), m, V (t ))(x ).
∂ x∂ m
E
26/110
[
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
STOCHASTIC MAXIMUM PRINCIPLE II
We take m = PX (t ) (note that it is a deterministic quantity) and
∂ 2f
thus get E
(X (t ), PX (t ) , V (t ))(x ).
∂ x∂ m
Finally, we take the argument x = X (t ).
27/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
POSSIBLE CONFUSION I
∂f
To emphasize the diculty of confusion, consider (x , m, v ). If
∂x
we want to take the derivative with respect to m, then we should
consider x , v as parameters, so change the notation to ξ and
∂ 2f
compute
(ξ , m, v )(x ).
∂ m∂ x
Clearly
∂ 2f
∂ 2f
(ξ , m, v )(x ) 6=
(ξ , m, v )(x )
∂ m∂ x
∂ x∂ m
28/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
PARTICULAR CASE I
29/110
We discuss here the following particular mean eld type problem
dx
x
= g (x (t ), v (x (t ))dt + σ (x (t ))dw (t )
(28)
(0) = x0
Z T
( (.), m(.)) = E [
J v
+
Z T
f
(x (t ), v (x (t )) dt + h(x (T ))]
0
(
(29)
( ))dt + Φ(Ex (T ))
F Ex t
0
We consider a feedback v (x , t ) and m(t ) = mv (.) (t ) is the
probability density of xv (.) (t ) the solution of (28).
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
PARTICULAR CASE II
The functional becomes J (v (.), mv (.) (.)). It is clearly a particular
case of mean eld type control problem.
30/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION I
31/110
We have indeed
Z
f
(x , m, v ) = f (x , v ) + F (
ξ m(ξ )d ξ )
Z
( , ) = h(x ) + Φ(
h x m
ξ m(ξ )d ξ )
Therefore
Z
( , , ) = H (x , q ) + F (
H x m q
ξ m(ξ )d ξ )
where
( , ) = inf (f (x , v ) + q .g (x , v ))
H x q
v
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION II
32/110
Considering v̂ (x , q ) which attains the inmum in the denition of
H (x , q ) and setting
( , ) = g (x , v̂ (x , q ))
G x q
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
HJB-FP SYSTEM I
The coupled system HJB-FP becomes , see (24),
−
∂u
+ Au = H (x , Du ) + F (
∂t
Z
Z
∂F
( ξ m(ξ )d ξ )xk
k ∂ xk
Z
Z
∂Φ
( ξ m(ξ )d ξ )xk
u (x , T ) = h (x ) + Φ(
ξ m(ξ )d ξ ) + ∑
k ∂ xk
ξ m(ξ )d ξ ) + ∑
(30)
∂m
+ A∗ m + div (G (x , Du )m) = 0
∂t
m (x , 0) = δ (x − x0 )
33/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
REWRITING I
We can reduce slightly this problem, using the following step:
introduce the vector function Ψ(x , t ; s ), t < s , solution of
−
∂Ψ
+ AΨ − D Ψ.G (x , Du ) = 0, t < s
∂t
Ψ(x , s ; s ) = x
(31)
then
Z
ξ m(ξ , t )d ξ = Ψ(x0 , 0; t )
so (30) becomes
34/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
REWRITING II
−
∂u
∂F
+ Au = H (x , Du ) + F (Ψ(x0 , 0; t )) + ∑
(Ψ(x0 , 0; t ))xk
∂t
∂
k xk
∂Φ
(Ψ(x0 , 0; T ))xk (32)
∂
k xk
( , ) = h(x ) + Φ(Ψ(x0 , 0; T )) + ∑
u x T
35/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
PRECOMMITMENT
We now have the system (31), (32). We can also look at u (x , t ) as
the solution of a non-local HJB equation, depending on the initial
state x . The optimal feedback
0
( , ) = v̂ (x , Du (x , t ))
v̂ x t
depends also on x . Note that it does not depend on any
intermediate state. This type of optimal control is called a
pre-commitment optimal control.
0
36/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
GAME CONCEPT
In [8], the authors introduce a new concept, in order to dene an
optimization problem among feedbacks which do not depend on the
initial condition.
A feedback will be optimal only against spike changes, but not
against global changes.
Game interpretation. Players are attached to small periods of
time ( eventually to each time, in the limit). Therefore, if one
uses the concept of Nash equilibrium, decisions at dierent
times correspond to decisions of dierent players, and thus out
of reach.
37/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION I
In the spirit of Dynamic Programming, and the invariant
embedding idea, we consider a family of control problems indexed
by the initial conditions, and we control the system using feedbacks
only. So if v (x , s ) is a feedback, we consider the state equation
( ) = xxt (s ; v (.))
x s
dx
= g (x (s ), v (x (s ), s ))ds + σ (x (s ))dw (t )
(33)
( )=x
x t
and the payo
38/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NOTATION II
39/110
Z T
J
x ,t (v (.)) = E [
+
Z T
t
t
(
f
(x (s ), v (x (s ), s )) ds + h(x (T )] +
(34)
( ))ds + Φ(Ex (T ))
F Ex s
Consider a specic control v̂ (x , s ) which will be optimal . We
dene x̂ (.) to be the corresponding state, solution of (33) and set
( , ) = Jx ,t (v̂ (.))
V x t
Alain Bensoussan, Jens Frehse, Phillip Yam
(35)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SPIKE MODIFICATION I
40/110
We make a spike modication and dene
v̄ (x , s ) = v
( , )
v̂ x s
t
< s < t +ε
s > t +ε
where v is arbitrary. The idea is to evaluate Jx ,t (v̄ (.)) and to
express that it is larger than V (x , t ). We introduce the function
Ψ(x , t ; s ) = E x̂xt (s ), t < s
which is the solution of
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SPIKE MODIFICATION II
41/110
−
∂Ψ
+ AΨ − D Ψ.g (x , v̂ (x , t )) = 0, t < s
∂t
Ψ(x , s ; s ) = x
(36)
We note the important property
( ) = E Ψ(x (t + ε), t + ε; s ), ∀s ≥ t + ε
E x̄ s
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
COMPARISON I
Therefore
x ,t (v̄ (.)) = E [
Z t +ε
J
+
Z T
t +ε
f
t
(x (s ), v )ds +
x (t +ε),t +ε (s ), v̂ (x̂x (t +ε),t +ε (s ), s ))ds + h(x̂x (t +ε),t +ε (T ))]
f (x̂
+
Z t +ε
t
(
( ))ds +
F Ex s
Z T
t +ε
( Ψ(x (t + ε), t + ε; s ))ds +
F E
+Φ(E Ψ(x (t + ε), t + ε; T )
42/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
COMPARISON I
The next point is to compare F (E Ψ(x (t + ε), t + ε; s )) with
EF (Ψ(x (t + ε), t + ε; s )). This is a simple application of Ito's
formula
EF
(Ψ(x (t + ε), t + ε; s )) − F (E Ψ(x (t + ε), t + ε; s )) =
ε ∑ aij (x ) ∑
ij
∂
2
F
kl ∂ xk ∂ xl
(Ψ(x , t ; s ))
(37)
∂ Ψk ∂ Ψl
(x , t ; s )) + 0(ε)
∂ xi ∂ xj
We can similarly compute the dierence
E
43/110
Φ(Ψ(x (t + ε), t + ε; T )) − Φ(E Ψ(x (t + ε), t + ε; T ).
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
EVALUATION OF THE PAYOFF I
44/110
J
x ,t (v̄ (.)) = EV (x (t + ε), t + ε) + ε[f (x , v ) + F (x )−
− ∑ aij (x )
ij
Z T
t
∑∂
kl
∂ 2F
∂ Ψk ∂ Ψl
(Ψ(x , t ; s ))
(x , t ; s ))ds −
xk ∂ xl
∂ xi ∂ xj
∂ 2Φ
∂ Ψk ∂ Ψl
(Ψ(x , t ; T ))
(x , t ; T ))] + 0(ε)
∂ xi ∂ xj
kl ∂ xk ∂ xl
− ∑ aij (x ) ∑
ij
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
HJB EQUATION I
45/110
−
Z T
− ∑ aij (x )[
ijkl
+
t
∂V
+ AV = H (x , DV ) + F (x )−
∂t
∂ 2F
∂ Ψk ∂ Ψl
(Ψ(x , t ; s ))
(x , t ; s ))ds +
∂ xk ∂ xl
∂ xi ∂ xj
(38)
∂ 2Φ
∂ Ψk ∂ Ψl
(Ψ(x , t ; T ))
(x , t ; T ))]
∂ xk ∂ xl
∂ xi ∂ xj
( , ) = h(x ) + Φ(x )
V x T
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FUNCTION
46/110
Ψ
I
Moreover the equation for Ψcan be written as
−
∂Ψ
+ AΨ − D Ψ.G (x , DV ) = 0, t < s
∂t
Ψ(x , s ; s ) = x
(39)
The optimal feedback obtained from the system (38), (39) is time
consistent.
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
STATEMENT OF THE PROBLEM I
The mean-variance problem is the extension in continuous time for
a nite horizon of the Markowitz optimal portfolio theory. Without
referring to the background of the problem, it can be stated as
follows, mathematically. The state equation is
dx
x
= rxdt + xv .(α dt + σ dw )
(40)
(0) = x0
( ) is scalar, r is a positive constant, α is a vector in R m and σ is
a matrix in L (R d ; R m ). All can depend on time and they are
deterministic quantities. v (t ) is the control in R m .
x t
We note that, conversely to our general framework, the control
aects the volatility term. The objective function is
47/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
STATEMENT OF THE PROBLEM II
γ
( (.)) = Ex (T ) − var(x (T ))
J v
2
(41)
which we want to maximize.
48/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
MEAN FIELD TYPE CONTROL PROBLEM I
Because of the variance term, the problem is not a standard
stochastic control problem. It is a mean eld type control problem,
since one can write
γ
γ
( )2 ) + (Ex (T ))2
(42)
2
We consider a feedback control v (x , s ) and the corresponding state
xv (.) (t ) solution of (40) when the control is replaced by the
feedback.
We associate the probability density mv (.) (x , t ) solution of
( (.)) = E (x (T ) −
J v
49/110
2
x T
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
MEAN FIELD TYPE CONTROL PROBLEM II
∂ mv (.)
∂
1 ∂2 2
+
(xmv (.) (r + α.v (x ))) −
(x mv (.) |σ ∗ v (x )|2 ) = 0
∂t
∂x
2 ∂ x2
(43)
v (.) (x , 0) = δ (x − x )
m
0
The functional (42) can be written as
Z
( (.)) =
J v
50/110
v (.) (x , T )(x −
m
γ
2
x
2
γ
)dx + (
Alain Bensoussan, Jens Frehse, Phillip Yam
2
Z
v (.) (x , T )xdx ) (44)
m
2
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NECESSARY CONDITIONS I
Let v̂ (x , t ) be an optimal feedback, and m(t ) = mv̂ (.) (t ). Using the
mean eld type control approach, we get a pair u (x , t ),m(x , t )
satisfying
∂u 2
∂u 1 (∂x ) ∗
∂u
−
− xr
+
α (σ σ ∗ )−1 α = 0
∂t
∂ x 2 ∂ 2u
∂ x 2Z
γ 2
u (x , T ) = x −
x + γx
m (ξ , T )ξ d ξ
(45)
2
51/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
NECESSARY CONDITIONS II
52/110

∂u

∂m
∂ (xm)
∂ 
m ∂ x  α ∗ (σ σ ∗ )−1 α −
+r
−
2

∂t
∂x
∂x
∂ u
2
 ∂x

∂u 2
( ) 
1 ∂2 
m ∂ x  α ∗ (σ σ ∗ )−1 α = 0
−
2 ∂ x 2  ∂ 2u 2 
( 2)
∂x
m (x , 0) = δ (x − x0 )

Alain Bensoussan, Jens Frehse, Phillip Yam
(46)
(47)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
OPTIMAL FEEDBACK I
The optimal feedback is dened by
∂u
∂ x (σ σ ∗ )−1 α
v̂ (x , t ) = −
∂ 2u
x
∂ x2
(48)
We can solve explicitly the system (45), (46). We look for
( , )=−
u x t
We also dene
1
P (t )x + s (t )x + ρ(t )
2
2
Z
( )=
q t
53/110
m
(ξ , t )ξ d ξ
Alain Bensoussan, Jens Frehse, Phillip Yam
(49)
(50)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SOLUTION I
54/110
We obtain
( ) = γ exp
P t
Z T
t
(2r − α ∗ (σ σ ∗ )−1 α)d τ
( ) = (1 + γ q (T )) exp
s t
ρ(t ) =
Z T
Z T
t
(51)
(r − α ∗ (σ σ ∗ )−1 α)d τ
1s ∗
α (σ σ ∗ )− α(τ)d τ
t 2P
2
1
We have to x q (T ). Equation (46) becomes
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SOLUTION II
55/110
∂m
∂ (xm)
∂ s
+r
−
) α ∗ (σ σ ∗ )−1 α
m (x −
∂t
∂x
∂x
P
1 ∂2 s 2
−
m (x −
)
α ∗ (σ σ ∗ )−1 α = 0
2 ∂ x2
P
m (x , 0) = δ (x − x0 )
Alain Bensoussan, Jens Frehse, Phillip Yam
(52)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
OPTIMAL FEEDBACK
If we test this equation with x we obtain easily
( ) = x0 exp
Z T
rd
q T
0
1
τ + [exp
γ
Z T
0
α ∗ (σ σ ∗ )−1 α)d τ − 1]
(53)
This completes the denition of the function u (x , t ). The optimal
feedback is dened by , see (48)
∗ −1
( , ) = −(σ σ )
v̂ x t
α+
1 1 + γ q (T )
x
γ
exp −
Z T
t
rd
τ
(54)
We see that this optimal feedback depends on the initial condition
x0
56/110
.
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
TIME CONSISTENCY APPROACH I
If we take the time consistency approach, we consider the family of
problems
dx
(55)
= rxds + xv (x , s ).(α dt + σ dw ), s > t
( )=x
x t
and the pay-o
J
γ
γ
x ,t (v (.)) = E (x (T ) − 2 x (T ) ) + 2 (Ex (T ))
2
2
(56)
Denote by v̂ (x , s ) an optimal feedback and set V (x , t ) = Jx ,t (v̂ (.)).
57/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FUNCTION
Ψ
We dene
Ψ(x , t ; T ) = E x̂xt (T )
where x̂xt (s ) is the solution of (55) for the optimal feedback.
58/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
FUNCTION
V
I
The function Ψ(x , t ; T ) is the solution of
∂Ψ ∂Ψ
1 ∂ 2Ψ
+
(rx + x v̂ (x , t )∗ α) + x 2 2 |σ ∗ v̂ (x , t )|2 = 0
∂t
∂x
2 ∂x
Ψ(x , T ; T ) = x
We can write
( , ) = E (x̂xt (T ) −
V x t
59/110
γ
γ
x̂ (T ) ) +
(Ψ(x , t ; T ))
2 xt
2
Alain Bensoussan, Jens Frehse, Phillip Yam
2
2
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
SPIKE MODIFICATION I
We consider a spike modication
v̄ (x , s ) = v
( , )
v̂ x s
t
< s < t +ε
s > t +ε
then
J
γ
x ,t (v̄ (.)) = E ((x̂x (t +ε),t +ε (T ) − 2 x̂x (t +ε),t +ε (T ) )
2
γ
+ (E Ψ(x (t + ε), t + ε; T ))2
2
where x (t + ε) corresponds to the solution of (55) at time t + ε for
the feedback equal to the constant v .
60/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
APPROXIMATION I
61/110
We note that
( ( + ε), t + ε) = E ((x̂x (t +ε),t +ε (T ) −
EV x t
+
γ
2
E
γ
2
x̂
(T ) )
2 x (t +ε),t +ε
(Ψ(x (t + ε), t + ε; T ))2
so we have to compare (E Ψ(x (t + ε), t + ε; T )) with
E (Ψ(x (t + ε), t + ε; T )) . We see easily that
2
2
(E Ψ(x (t + ε), t + ε; T ))2 − E (Ψ(x (t + ε), t + ε; T ))2 =
= −ε x 2
∂ 2Ψ
(x , t ; T )|σ ∗ v |2 + 0(ε)
∂ x2
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
HJB EQUATION I
We obtain the HJB equation
2
∂V ∂V
∂V ∗
x
∂ 2V
∂ 2Ψ
+
rx + max[x
v α +
( 2 − γ 2 (x , t ; T ))v ∗ σ σ ∗ v ] = 0
v
∂t
∂x
∂x
2 ∂x
∂x
(57)
( , )=x
V x T
A direct checking shows that
1 ZT ∗
V (x , t ) = x exp r (T − t ) +
α (σ σ ∗ )α ds
2γ t
Ψ(x , t ; T ) = x exp r (T − t ) +
62/110
Alain Bensoussan, Jens Frehse, Phillip Yam
1Z T
γ t
(58)
α ∗ (σ σ ∗ )α ds
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
HJB-FP APPROACH
GENERAL PRESENTATION
STOCHASTIC MAXIMUM PRINCIPLE
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
TIME CONSISTENCY APPROACH
DIFFERENT POPULATIONS
THE MEAN VARIANCE PROBLEM
COALITIONS
TIME CONSISTENCY
HJB EQUATION II
and
( , )=
v̂ x t
exp −r (T − t )
x
γ
(σ σ ∗ )α
(59)
This optimal control satises the time consistency property ( it
does not depend on the initial condition).
63/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
GENERAL CONSIDERATIONS I
In the preceding slides, we have considered a single population,
composed of a large number of individuals, with identical behavior.
In real situations, we will have several populations. The natural
extension to the preceding developments is to obtain mean eld
equations for each population. A much more challenging situation
will be to consider competing populations. We present rst the
approach of multi-class agents, as described in [16], [18].
64/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL I
Instead of functions f (x , m, v ),g (x , m, v ), h(x , m), σ (x ) we consider
functions fk (x , m, v ),gk (x , m, v ), hk (x , m), σk (x ), k = 1, · · · K .
The index k represents some characteristics of the agents, and a
class corresponds to one value of the characteristics. So there are
K classes. In the model discussed previously, we have considered a
single class. In the sequel, when we consider an agent i ,he will have
a characteristics α i ∈ (1, · · · , K ). Agents will be dened with upper
indices, so i = 1, · · · N with N very large. α i is a known
information. The important assumption is
K
1 N
N
65/110
∑ 1Iα =k → πk , as
i=
i
N
→ +∞
(60)
1
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL II
and πk is a probability distribution on the nite set of
characteristics, which represents the probability that an agent has
the characteristics k .
66/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
FURTHER NOTATION I
67/110
Generalizing the case of a single class, we dene
1
ak (x ) =
σ (x )σk (x )∗ and the operator
2 k
k ϕ(x ) = −trak (x )D ϕ(x )
A
2
We dene Lagrangians, Hamiltonians indexed by k ,namely
k (x , m, v , q ) = fk (x , m, v ) + q .gk (x , m, v )
L
L (x , m , v , q )
k (x , m, q ) = inf
v k
H
and v̂k (x , m, q ) denotes the minimizer in the dention of the
Hamiltonian. We also dene
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
FURTHER NOTATION II
68/110
k (x , m, q ) = gk (x , m, v̂k (x , m, q ))
G
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
SYSTEM OF HJB EQUATIONS I
69/110
Given a function m(t ) we consider the HJB equations, indexed by k
−
∂ uk
+ Auk = Hk (x , m, Duk )
∂t
uk (x , T ) = hk (x , m (T ))
(61)
and the FP equations
∂ mk
+ A∗ mk + div (Gk (x , m, Duk )mk ) = 0
∂t
mk (x , 0) = mk 0 (x )
Alain Bensoussan, Jens Frehse, Phillip Yam
(62)
(63)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
SYSTEM OF HJB EQUATIONS II
in which the probability densities mk are given. A mean eld game
equilibrium for the multi class agents problem is attained whenever
0
( , )=
m x t
K
∑ πk
k=
k (x , t ), ∀x , t
m
(64)
1
70/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
GENERAL COMMENTS I
We consider here a problem initiated by Huang [15], in the L.Q.
case. In a recent paper Nourian and Caines [23] have studied a non
linear mean eld game with a major player. In both papers, there is
a simplication in the coupling between the major player and the
representative agent. We will describe here the problem in full
generality and explain the simplication which is done in [23].
The new element is that, besides the representative agent there is a
major player. This major player inuences directly the mean eld
term. Since the mean eld term also impacts the major playor, he
will takes this into account to dene his decisions. On the other
hand, the mean eld term can no longer be deterministic, since it
depends on the major player decisions. This coupling creates new
diculties.
71/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL OF MAJOR PLAYER I
72/110
We introduce the following state evolution for the major player
dx0
x0
(65)
= g0 (x0 (t ), m(t ), v0 (t ))dt + σ0 (x0 )dw0
(0) = ξ0
We assume that x (t ) ∈ Rn0 , v (t ) ∈ Rd0 . The process w (t ) is a
standard Wiener process with values in Rk0 and ξ is a random
variable in Rn0 independent of the Wiener process. The process
m (t ) is the mean eld term, with values in the space of
probabilities on Rn . This term will come from the decisions of the
representative agent.
However, It will be linked to x (t ) since the major player inuences
the decision of the representative agent.
0
0
0
0
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL OF MAJOR PLAYER II
If we dene the ltration
F 0t = σ (ξ0 , w0 (s ), s ≤ t )
(66)
then m(t ) is a process adapted to F t . But it is not external.
0
73/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL OF MAJOR PLAYER I
74/110
We will describe the link with the state x in analyzing the
representative agent problem. The control v (t ) is also adapted to
F t . The objective functional of the major player is
0
0
0
( (.)) = E [
Z T
J0 v0
( ( ), m(t ), v0 (t ))dt +
f0 x0 t
0
(67)
+h0 (x0 (T ), m(T ))]
The functions g , f , σ , h are deterministic. We do not specify the
assumptions, since our treatment is formal.
0
0
0
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL OF REPRESENTATIVE AGENT I
The representative agent has state x (t ) ∈ Rn and control
d
v (t ) ∈ R .We have the evolution
dx
x
= g (x (t ), x0 (t ), m(t ), v (t ))dt + σ (x (t ))dw
(68)
(0) = ξ
in which w (t ) is a standard Wiener process with values in Rk and ξ
is a random variable with values in Rn independent of w (.).
Moreover, ξ , w (.) are independent of ξ , w (.). We dene
0
0
F t = σ (ξ , w (s ), s ≤ t )
Gt = F t ∪Ft
0
75/110
Alain Bensoussan, Jens Frehse, Phillip Yam
(69)
(70)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MODEL OF REPRESENTATIVE AGENT II
76/110
The control v (t ) is adapted to G t . The objective functional of the
representative agent is dened by
( (.), x0 (.), m(.)) = E [
Z T
J v
f
(x (t ), x0 (t ), m(t ), v (t ))dt +
0
(71)
+h(x (T ), x0 (T ), m(T ))]
Conversely to the major player problem, in the representative agent
problem, the processes x (.),m(.) are external. In (67) m(t )
depends on x (.).
0
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
CONDITIONAL PROBABILITY DENSITY OF THE
REPRESENTATIVE AGENT I
The representative agent's problem is similar to the standard
situation except for the presence of x (t ).
We begin by limiting the class of controls for the representative
agent to belong to feedbacks v (x , t ) random elds adapted to F t
. The corresponding state, solution of (68) is denoted by xv (.) (t ).
Of course, this process depends also of x (t ), m(t ) . Note that
t
x (t ), m (t ) is independent from F , therefore the conditional
probability density of xv (.) (t ) given the ltration ∪t F t is the
solution of the F.P. equation with random coecients
0
0
0
0
0
77/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
CONDITIONAL PROBABILITY DENSITY OF THE
REPRESENTATIVE AGENT II
78/110
∂ pv (.)
+ A∗ pv (.) + div(g (x , x0 (t ), m(t ), v (x , t ))pv (.) ) = 0
∂t
pv (.) (x , 0) = ϖ(x )
(72)
in which ϖ(x ) is the density probability of ξ .
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
OBJECTIVE FUNCTIONAL OF THE REPRESENTATIVE
AGENT I
We can then rewrite the objective functional J (v (.), x (.), m(.)) as
follows
0
Z TZ
( (.), x0 (.), m(.)) = E [
J v
p
0
Z
+
Rn
v (.),x0 (.),m(.) (x , t )f (x , x (t ), m(t ), v (x , t ))d
(73)
v (.),x0 (.),m(.) (x , T )h(x , x (T ), m(T ))dx ]
p
Rn
0
0
We can give an expression for this functional. Introduce the
random eld χv (.) (x , t ) solution of the stochastic backward PDE:
79/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
OBJECTIVE FUNCTIONAL OF THE REPRESENTATIVE
AGENT II
80/110
−
∂ χv (.)
+ Aχv (.) = f (x , x0 (t ), m(t ), v (x , t )) + g (x , x0 (t ), m(t ), v (x , t )).D χ
∂t
(74)
χv (.) (x , T ) = h(x , x0 (T ), m(T ))
then we can assert that
Z TZ
v (.),x0 (.),m(.) (x , t )f (x , x (t ), m(t ), v (x , t ))dxdt +
p
0
Rn
0
Z
p
Rn
v (.),x0 (.),m(.) (x , T )h(x , x (T ), m(T ))dx =
Alain Bensoussan, Jens Frehse, Phillip Yam
0
Z
Rn
χv (.) (x , 0)ϖ(x )
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
COMPUTING THE OBJECTIVE FUNCTION I
We get
Z
( (.), x0 (.), m(.)) =
J v
Rn
ϖ(x )E χv (.) (x , 0)dx
(75)
Now dene
u
v (.) (x , t ) = E
F 0t
χv (.) (x , t )
From equation (74) we can assert that
−E F
0t
∂ χv (.)
+ Auv (.) = f (x , x0 (t ), m(t ), v (x , t )) + g (x , x0 (t ), m(t ), v (x , t )
∂t
(76)
v (.) (x , T ) = h(x , x (T ), m(T ))
u
81/110
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
BACKWARD SPDE I
82/110
On the other hand
v (.) (x , t ) −
Z t
u
E
F 0s
0
∂ χv (.)
(x , s )ds
∂s
is a F t martingale. Therefore we can write
0
u
v (.) (x , t )−
Z t
E
0
F 0s
Z t
∂ χv (.)
(x , s )ds = uv (.) (x , 0)+ Kv (.) (x , s )dw0 (s )
∂s
0
where Kv (.) (x , s ) is F s measurable, and uniquely dened. It is
then easy to check that the random eld uv (.) (x , t ) is solution of
the backward stochastic PDE (BSPDE) :
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
BACKWARD SPDE II
83/110
−∂t uv (.) (x , t ) + Auv (.) (x , t )dt = f (x , x0 (t ), m(t ), v (x , t ))dt +
(77)
+g (x , x0 (t ), m(t ), v (x , t )).Duv (.) (x , t )dt − Kv (.) (x , t )dw0 (t )
v (.) (x , T ) = h(x , x (T ), m(T ))
u
Alain Bensoussan, Jens Frehse, Phillip Yam
0
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITION I
84/110
From ( 75) we get immediately
Z
( (.), x0 (.), m(.)) =
J v
Rn
ϖ(x )Euv (.) (x , 0)dx
(78)
We then write a necessary condition of optimality for a control
v̂ (x , t ).Setting u (x , t ) = uv̂ (.) (x , t ), K (x , t ) = Kv̂ (.) (x , t ) we obtain
the stochastic HJB equation
−∂t u (x , t ) + Au (x , t )dt = H (x , x0 (t ), m(t ), Du )dt − K (x , t )dw0
(79)
( , ) = h(x , x0 (T ), m(T ))
u x T
and
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITION II
85/110
( , ) = v̂ (x , x0 (t ), m(t ), Du (x , t ))
v̂ x t
Alain Bensoussan, Jens Frehse, Phillip Yam
(80)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
FP EQUATION I
86/110
We next have to express the mean eld game condition
( ) = pv̂ (.),x0 (.),m(.) (., t )
m t
we obtain from (72) the FP equation
∂m
+ A∗ m + div(G (x , x0 (t ), m(t ), Du (x , t ))m) = 0
∂t
m (x , 0) = ϖ(x )
(81)
The coupled pair of HJB-FP equations (79),(81) allow to dene the
reaction function of the representative agent to the trajectory x (.)
of the major player. One denes the random elds u (x , t ), m(x , t )
and the optimal feedback is given by (80).
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MAJOR PLAYER
I
Consider now the problem of the major player. In [23] and also [15]
for the L.Q. case it is limited to (65), (67) since m(t ) is external.
However since m(t ) is coupled to x (t ) through equations (79),
(81) one cannot consider m(t ) as external, unless limiting the
decision of the major player. So in fact the major player has to
consider three state equations (65), (79), (81). For a a given v (.)
adapted to F t we associate x ,v0 (.) (.), uv0 (.) (., .), mv0 (.) (., .)
solution of the system (65), (79), (81).
Introduce the notation
0
0
0
0
87/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
MAJOR PLAYER
88/110
II
( , , ) = inf [f0 (x0 , m, v0 ) + p .g0 (x0 , m, v0 )]
H0 x0 m p
( , , )
v̂0 x0 m p
( , , ) =
G0 x m p
v0
minimizes the expression in brackets
( , , ( , , ))
g0 x0 m v̂0 x0 m p
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITIONS FOR THE MAJOR PLAYER I
We have 3 adjoint equations
k0
−dp = [H0,x0 (x0 (t ), m(t ), p (t )) + ∑ σ0∗l ,x0 (x0 (t ))ql (t )
l=
1
Z
+
Z
∗
Gx (x , x0 (t ), m (t ), Du (x , t ))D η(x , t )m (x , t )dx
0
(82)
k0
ζ (x , t )Hx0 (x , x0 (t ), m(t ), Du (x , t )dx ]dt − ∑ ql dw0l
l=
89/110
+
Alain Bensoussan, Jens Frehse, Phillip Yam
1
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITIONS FOR THE MAJOR PLAYER II
90/110
( ) = h0,x0 (x0 (T ), m(T )) +
p T
Z
ζ (x , T )hx0 (x , x0 (T ), m(T ))dx
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITIONS FOR THE MAJOR PLAYER I
91/110
−∂t η + Aη(x , t )dt = [
∂ H0
(x0 (t ), m(t ), p (t ))(x )
∂m
+D η(x , t ).G (x , x0 (t ), m(t ), Du (x , t ))+
Z
+
D
η(ξ , t ).
Z
+
ζ (ξ , t )
∂G
(ξ , x0 (t ), m(t ), Du (ξ , t ))(x )m(ξ , t )d ξ
∂m
∂H
(ξ , x0 (t ), m(t ), Du (ξ , t ))(x )d ξ ]dt −
∂m
∂ h0
η(x , T ) =
(x0 (T ), m(T ))(x )+
∂m
Z
Alain Bensoussan, Jens Frehse, Phillip Yam
ζ (ξ , T )
∑ µl (
l
(83)
, )
l (t )
x t dw0
∂h
(ξ , x0 (T ), m(T ))(x )d ξ
∂m
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITIONS FOR THE MAJOR PLAYER I
∂ζ
+ A∗ ζ (x , t ) + div (G (x , x0 (t ), m(t ), Du (x , t ))ζ (x , t ))
∂t
+div(Gq∗ (x , x0 (t ), m(t ), Du (x , t ))D η(x , t ) m(x , t )) = (84)
0
ζ (x , 0) = 0
Next x (t ) satises
0
dx0
x0
92/110
=
( ( ), m(t ), p (t ))dt + σ0 (x0 (t ))dw0
G0 x0 t
(85)
(0) = ξ0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
GENERAL CONSIDERATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
MULTI-CLASS AGENTS
DIFFERENT POPULATIONS
MAJOR PLAYER
COALITIONS
NECESSARY CONDITIONS FOR THE MAJOR PLAYER II
So, in fact the complete solution is provided by the 6 equations
(85),(82), (79), (84), (81), (83) and the feedback of the
representative agent and the contol of the major player are given by
(80) and
( ) = v̂0 (x0 (t ), m(t ), p (t ))
v̂0 t
93/110
Alain Bensoussan, Jens Frehse, Phillip Yam
(86)
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
SYSTEM OF HJB-FP EQUATIONS I
We can introduce more general problems
−
∂ ui
+ Au i = H i (x , m, Du )
∂t
i
i
u (x , T ) = h (x , m (T ))
∂ mi
+ A∗ mi + div (G i (x , m, Du )mi ) = 0
∂t
i
i
m (x , 0) = m0 (x )
(87)
(88)
in which m = (m , · · · , mN ) and the functions H i , G i depend on the
full vector m. The interpretation is much more elaborate.
1
94/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
DESCRIPTION OF THE GAME I
95/110
We want to associate to problem (87), (88) a dierential game for
N communities, composed of very large numbers of agents. We
denote the agents by the index i , j where i = 1, · · · N and
j = 1, · · · M . The number M will tend to +∞. Each player
i ,j (x ), x ∈ Rn . The state of player i , j is
i , j chooses a feedback v
denoted by x i ,j (t ) ∈ Rn . We consider independent standard Wiener
processes w i ,j (t ) and independent replicas x i ,j of the random
variable x i , whose probability density is mi . They are independent
of the Wiener processes. We denote
0
0
0
v
.j
(.) = (v 1,j (.), · · · , v N ,j (.))
The trajectory of the state x i ,j is dened by the equation
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
DESCRIPTION OF THE GAME II
i ,j = g i (x i ,j , v .j (x i ,j ))dt + σ (x i ,j )dw i ,j
i ,j (0) = x i ,j
x
dx
(89)
0
96/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
DESCRIPTION OF THE GAME I
97/110
The trajectories are independent. The player i , j trajectory is
inuenced by the feedbacks v k ,j (x ), k 6= i acting on his own state.
When we focus on player i we use the notation
v
.j
(.) = (v i ,j (.), v̄ i ,j (.))
in which v̄ i ,j (.) represents all feedbacks v k ,j (x ), k 6= i . The notation
v (.) represents all feedbacks.
We now dene the objective functional of player i , j . It is given by
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
DESCRIPTION OF THE GAME II
J i ,j (v (.)) = E
i ( i ,j (t ),
f0 x
1
M
M
− 1 l =∑
16=j
Z T
[f i (x i ,j (t ), v .j (x i ,j (t ))) +
0
δx , (t ) ))]dt + Ehi (x i ,j (T ),
1
i l
(90)
M
δx , (T ) )
− 1 l =∑
16=j
i l
M
We look for a Nash equilibrium.
98/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
APPROXIMATE NASH EQUILIBRIUM I
99/110
Consider next the system of pairs of HJB-FP equations (87), (88)
and the feedback v̂ (x ).
We can show that the feedback
v̂
i ,j (.) = v̂ i (.)
is an approximate Nash equilibrium.
If we use this feedback in the state equation (89) we get
d x̂
i ,j = g i (x̂ i ,j , v̂ (x̂ i ,j ))dt + σ (x̂ i ,j )dw i ,j
i ,j (0) = x i ,j
x̂
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
APPROXIMATE NASH EQUILIBRIUM II
100/110
and the trajectories x̂ i ,j become independent replicas of x̂ i solution
of
d x̂
i = g i (x̂ i , v̂ (x̂ i ))dt + σ (x̂ i )dw i
i
i
x̂ (0) = x
0
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
APPROXIMATE NASH EQUILIBRIUM I
101/110
The probability density of x̂ i (t ) is mi (t ). We rst prove
J i ,j (v̂ (.)) − J i (v̂ (.), mi (.)) → 0, as M → +∞.
We now focus on player 1, 1 to x the ideas. Suppose he uses a
feedback v , (x ) 6= v̂ , (x ), and the other players use
i ,j (x ) = v̂ i (x ), ∀i ≥ 2, ∀j or ∀i , ∀j ≥ 2. We set v (x ) = v , (x ). Call
v̂
1 1
1 1
1
1 1
this set of controls ṽ (.). By abuse of notation, we also write
ṽ
(.) = (v 1 (.), v̂ 2 (.), · · · v̂ N (.)) = (v 1 (.), v̂ (.))
1
The corresponding trajectories are denoted by y 1,j (t ) solutions of
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
APPROXIMATE NASH EQUILIBRIUM II
102/110
dy
y
,
1 1
,
1 1
1
= g 1 (y 1,1 , v 1 (y 1,1 ), v̂ (y 1,1 ))dt + σ (y 1,1 )dw 1,1
(0) = x0
(91)
,
1 1
and y ,j = x̂ ,j for j ≥ 2.
We can then prove that
1
1
J 1,1 (ṽ (.)) ≥ J 1 (v̂ (.), m1 (.)) − O (M )
and this concludes the approximate Nash equilibrium property.
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES I
Andersson,D.,Djehiche,B. (2011) A Maximum Principle for
SDEs of Mean Field Type, Applied Mathematics and
Optimization 63, p.341-356
Bensoussan,A., Frehse,J. (2002) Regularity Results for
Springer Applied
Mathematical Sciences, Vol 151, 2002
Bensoussan,A., Frehse,J. (1995) Ergodic Bellman Systems for
Stochastic Games in arbitrary Dimension, Proc. Royal Society,
London, Mathematical and Physical sciences A, 449, p. 65-67
Nonlinear Elliptic Systems and Applications,
Bensoussan,A., Frehse,J. (2002) Smooth Solutions of Systems
of Quasilinear Parabolic Equations, ESAIM: Control,
Optimization and Calculus of Variations , Vol.8, p. 169-193
103/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES II
Bensoussan,A., Frehse,J. (2012), Control and Nash Games
with Mean Field eect, Chinese Annals of Mathematics, to be
published
Bensoussan,A., Frehse,J., Vogelgesang, J., (2010) Systems of
Bellman Equations to Stochastic Dierential Games with
Noncompact Coupling, Discrete and Continuous Dynamical
Systems, Series A, 274, p. 1375-1390
Bensoussan,A., Frehse,J., Vogelgesang, J., (2012) Nash and
Stackelberg Dierential Games, Chinese Annals of
Mathematics, Series B
Björk T., Murgoci A., A General Theory of Markovian Time
Inconsistent Stochastic Control Problems, SSRN: 1694759
104/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES III
Bensoussan,A.,Sung, K.C.J., Yam, S.C.P. ,Yung, S.P. ,
Linear-Quadratic Mean Field Games, Technical report
Buckdahn,R., Djehiche, B.,Li, J., (2011) A General Stochastic
Maximum Principle for SDEs of Mean-eld Type, Appl Math
Optim, 64: 197-216
Buckdahn,R., Li, J.,Peng, SG., (2009) Mean-eld backward
stochastic dierential equations and related partial dierential
equations, Stochastic Processes and their Applications, 119,
p.3133-3154
Cardaliaguet, P., (2010) Notes on Mean Field Games,
Technical Report
105/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES IV
Carmona, R., Delarue, F., (2012), Probabilistic Analysis of
Mean-Field Games, Working Paper
Guéant,O., Lasry,J.-M., Lions,P.-L.(2011) Mean Field Games
and applications, in A,R. Carmona et al.(eds), Paris-Princeton
Lectures on Mathematical Sciences 2010, 205-266
Huang,M., (2010) Large-Population LQG Games Involving a
Major Player: The Nash Certainty Equivalence Principle, SIAM
J. Control, Vol. 48, 5, p.3318-3353.
106/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES V
Huang,M.,Malhamé, R.P. ,Caines,P.E. (2006) Large Population
Stochastic Dynamic Games: Closed-Loop MCKean-Vlasov
Systems and the Nash Certainty Equivalence Principle,
Communications in Information and Systems, Vol 6, nº 3, p.
221-252
Huang,M., Caines,P.E., Malhamé, R.P. (2007)
Large-population cost-coupled LQG problems with nonuniform
agents: individual-mass behavior and decentralized ε−Nash
equilibria, IEEE Transactions on Automatic Control, 52 (9),
1560-1571
Huang,M., Caines,P.E., Malhamé, R.P. (2007) An Invariance
Principle in Large Population Stochastic Dynamic Games,
Journal of Systems Science and Complexity, 20 (2), 162-172
107/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES VI
Lasry,J.-M., Lions,P.-L. (2006). Jeux à champ moyen I- Le cas
stationnaire, Comptes Rendus de l'Académie des Sciences,
Series I, 343,619-625.
Lasry,J.-M., Lions,P.-L. (2006). Jeux à champ moyen II- Horizn
ni et contrôle optimal, Comptes Rendus de l'Académie des
Sciences, Series I, 343,679-684.
Lasry,J.-M., Lions,P.-L. (2007). Mean Field Games, Japanese
Journal of Mathematics 2(1),229-260.
McKean, H.P., (1966), A class of Markov processes associated
with nonlinear parabolic equations, Proc. Natl. Acad. Sci. USA
56, 1907-1911
108/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
REFERENCES VII
Nourian, M., Caines P.E. (2012) e-Nash Mean Field Game
Theory for nonlinear stochastic dynamical systems with major
and minor agents, submitted to SIAM J. Control Optim.
109/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H
INTRODUCTION
GENERAL PRESENTATION
SYSTEM OF HJB-FP EQUATIONS
DISCUSSION OF THE MEAN FIELD TYPE CONTROL PROBLEM
APPROXIMATE NASH EQUILIBRIUM FOR LARGE COMM
DIFFERENT POPULATIONS
COALITIONS
Thanks!
110/110
Alain Bensoussan, Jens Frehse, Phillip Yam
Dierential games, Nash equilibrium, Mean Field, H