Leveraging Correlations in Utility Learning Presenter: Ming Jin ACC

Leveraging Correlations in Utility Learning
Presenter: Ming Jin
ACC 2017
Ioannis C. Konstantakopoulos, Lillian J. Ratliff,
Ming Jin, and Costas J. Spanos
1
People-Building-Grid Nexus
Internet of Things
(IoT)
Smart building
Demand Respond Occupants’
Smart Grid satisfaction
Occupancy Detection
Building “senses”
grid – occupants
actions
Building respects comfort, productivity, wellbeing, satisfaction
Agile operations
in buildings
Occupants take
real time control
2
People-Building-Grid Nexus
Data
Agile
Building
Building
Occupants
Efficient
Algorithms
The M$ question –
How to learn about people’s
preference from their actions?
4
Social Game for energy
efficiency (Allerton 2014)
Histogram of Wild bootstrapping estimators
400
Wild bootstrapping utility estimators
cFGLS utility estimator
350
300
250
Social
Games
Robust Utility Learning (IEEE
Transactions on Control
Systems Technology)
200
150
100
50
0
-0.9
-0.8
-0.7
-0.6
-0.5
-0.4
-0.3
-0.2
-0.1
0
32 utility estimators
Robust Utility Learning
for closing the loop (CPS
Week 2016)
★
★
★
★
Efficient
Learning
Leveraging correlations
in utility learning (ACC
2017)
★
Decision
Modeling
Inverse Modeling via
Mixture of Utilities
(CDC 2016)
5
Leveraging social coalitions can better
understand/influence people’s behaviors
Engaging people in a game improves energy
efficiency and occupant comfort
6
Outline
Motivation
Energy efficiency via gamification
Utility learning framework
Leveraging correlations
Conclusion and on-going works
7
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Energy efficiency via gamification Social Game for Building Energy Efficiency: Incentive Design - (Allerton 2014)
8
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Social game for building energy efficiency ❑ Social game for changing
occupant energy – related
behaviors.
❑ 20 occupants
❑ Weekly lottery w/ Amazon cards
27.5% energy reduction
from shared lights
Lighting
zones
9
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Social game for building energy efficiency Real time control for
shared lights and HVAC
o Points are used to determine probability of winning in lottery
“Use less to gain more”
10
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
People play Nash equilibrium
11
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Nash characterized by KKT conditions
12
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Learning with constrained Ordinary
Least Squares (cOLS)
From the KKT conditions of Nash, one can formulate cOLS:
where
is “spherical” noise.
However, this estimator often performs “poorly”, due to this
strong assumption on “spherical noise structure”.
Besides, data is often expensive and limited!
13
People often vote as a group..
We can design utility with player coalition
based on the correlation matrix.
14
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Leveraging correlations among players
❑ Estimate correlations
between occupants
Contributions
❑ Boost performance using
OLS
❑ Reduce computational
complexity
❑ Transfer to online learning
15
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Gradient Boosting to estimate noise structure
Original fitting
residuals
3. Residuals
update
o AIC criterion
o Fixed
iterations
o Prevent
overfitting
1. Residual
fitting
2. Update with
shrinkage
parameter ν in
(0,1]
Robust Utility Learning with Applications to Social Games in Smart Buildings
(Submitted to IEEE Transactions on Control Systems Technology)
16
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Utility function based on correlation
Correlation utility
Coalition utility
Happiness Metric:
Coalition utility per player:
17
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Experimental
Results
18
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Gradient boosting gives unbiased estimator
Asymptotic bias estimation via bootstrapping
cFGLS estimator
almost unbiased
19
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Leveraging correlation improves
forecasting
No coalitions
20
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Happiness metric indicates collusion
between users 8 and 14
No coalitions
21
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
Conclusion
Unknown noise structure
Limited sample size
Correlated players’ actions
22
1. Social games 2. Utility learning 3. Leveraging correlations 4. Conclusion
NEW GAMES, MORE GAMES !!!
Shared lighting experiment
at Sutardja Dai Hall with 200
occupants (4th and 7th floor)
Shared lighting – personal
desk electrical equipment
experiment at CREATE Tower
(11th floor) in Singapore with
50 occupants
Personal room lighting at
Graduate Hall dorms in Eco
Campus in Singapore with
100++ occupants
23
Acknowledgement
❑ Singapore-Berkeley Building Efficiency and Sustainability in the Tropics
(SinBerBEST) Program
24