Optimisation theory - University of London International Programmes

Optimisation theory
M. Baltovic
MT3042, 2790042
2011
Undergraduate study in
Economics, Management,
Finance and the Social Sciences
This is an extract from a subject guide for an undergraduate course offered as part of the
University of London International Programmes in Economics, Management, Finance and
the Social Sciences. Materials for these programmes are developed by academics at the
London School of Economics and Political Science (LSE).
For more information, see: www.londoninternational.ac.uk
This guide was prepared for the University of London International Programmes by:
Mark Baltovic, Department of Mathematics, London School of Economics and
Political Science.
This is one of a series of subject guides published by the University. We regret that due to
pressure of work the author is unable to enter into any correspondence relating to, or arising
from, the guide. If you have any comments on this subject guide, favourable or unfavourable,
please use the form at the back of this guide.
University of London International Programmes
Publications Office
Stewart House
32 Russell Square
London WC1B 5DN
United Kingdom
Website: www.londoninternational.ac.uk
Published by: University of London
© University of London 2010
Reprinted with minor revisions 2011
The University of London asserts copyright over all material in this subject guide except where
otherwise indicated. All rights reserved. No part of this work may be reproduced in any form,
or by any means, without permission in writing from the publisher.
We make every effort to contact copyright holders. If you think we have inadvertently used
your copyright material, please let us know.
Contents
Contents
1 Introduction
1
1.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.2
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.3
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
1.4
How to use this subject guide . . . . . . . . . . . . . . . . . . . . . . . .
2
1.5
Structure of the subject guide . . . . . . . . . . . . . . . . . . . . . . . .
3
1.6
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
1.7
Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4
1.8
Online study resources . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4
1.8.1
The VLE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
1.8.2
Making use of the Online Library . . . . . . . . . . . . . . . . . .
5
Examination advice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
1.10 Syllabus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7
1.11 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7
1.12 List of abbreviations used in this subject guide . . . . . . . . . . . . . . .
7
1.9
2 Mathematical preliminaries
9
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9
Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9
2.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9
2.2
Vector calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
10
2.2.1
Inner products . . . . . . . . . . . . . . . . . . . . . . . . . . . .
10
2.3
Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
11
2.4
Basic set theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
2.4.1
Subsets of R . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
2.4.2
Bounded subsets of Rn . . . . . . . . . . . . . . . . . . . . . . . .
14
2.4.3
Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14
Topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
15
2.5.1
15
2.5
Open and closed sets . . . . . . . . . . . . . . . . . . . . . . . . .
i
Contents
2.5.2
Compactness . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
17
Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
18
2.6.1
Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
18
2.6.2
Differentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . .
19
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
20
2.6
3 Weierstrass’ Theorem
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
21
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
21
3.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
21
3.2
The Theorem of Weierstrass . . . . . . . . . . . . . . . . . . . . . . . . .
22
3.3
Proving the theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
23
3.3.1
Continuity and compactness . . . . . . . . . . . . . . . . . . . . .
23
3.3.2
Compactness and boundedness . . . . . . . . . . . . . . . . . . .
26
Worked examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
27
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
29
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
29
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
30
3.4
4 Unconstrained optimisation
39
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39
4.1
The basic optimisation problem in Rn . . . . . . . . . . . . . . . . . . . .
39
4.2
First-order conditions for optima . . . . . . . . . . . . . . . . . . . . . .
41
4.3
Definite and semidefinite matrices . . . . . . . . . . . . . . . . . . . . . .
42
4.4
Second-order conditions for optima . . . . . . . . . . . . . . . . . . . . .
43
4.5
Worked example
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
43
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
45
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
46
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
47
5 Optimisation under equality constraints
ii
21
53
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
53
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
53
5.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
53
5.2
Statement of the problem . . . . . . . . . . . . . . . . . . . . . . . . . .
53
5.2.1
55
The standard form . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents
5.3
The Theorem of Lagrange . . . . . . . . . . . . . . . . . . . . . . . . . .
56
5.3.1
Necessary versus sufficient conditions . . . . . . . . . . . . . . . .
57
5.3.2
Local versus global optimal points . . . . . . . . . . . . . . . . . .
58
5.4
Proof of Lagrange’s Theorem . . . . . . . . . . . . . . . . . . . . . . . .
58
5.5
The constraint qualification . . . . . . . . . . . . . . . . . . . . . . . . .
61
5.6
The Lagrangean multipliers . . . . . . . . . . . . . . . . . . . . . . . . .
63
5.7
The cookbook approach . . . . . . . . . . . . . . . . . . . . . . . . . . .
64
5.7.1
The Lagrangean . . . . . . . . . . . . . . . . . . . . . . . . . . . .
65
Worked examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
66
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
71
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
71
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
73
5.8
6 Optimisation under inequality constraints
83
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
83
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
83
6.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
83
6.2
Statement of the problem . . . . . . . . . . . . . . . . . . . . . . . . . .
84
6.2.1
The standard form . . . . . . . . . . . . . . . . . . . . . . . . . .
84
6.2.2
Effective constraints . . . . . . . . . . . . . . . . . . . . . . . . .
84
The Theorem of Kuhn and Tucker . . . . . . . . . . . . . . . . . . . . . .
85
6.3.1
Necessary versus sufficient conditions . . . . . . . . . . . . . . . .
86
6.4
Proof of the Kuhn-Tucker Theorem . . . . . . . . . . . . . . . . . . . . .
86
6.5
The mixed constraint problem . . . . . . . . . . . . . . . . . . . . . . . .
88
6.6
The constraint qualification . . . . . . . . . . . . . . . . . . . . . . . . .
88
6.7
The Kuhn-Tucker multipliers . . . . . . . . . . . . . . . . . . . . . . . . .
90
6.8
The cookbook approach . . . . . . . . . . . . . . . . . . . . . . . . . . .
91
6.8.1
The Lagrangean . . . . . . . . . . . . . . . . . . . . . . . . . . . .
92
Worked examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
93
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
98
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
98
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
100
6.3
6.9
7 Finite Horizon Dynamic Programming
127
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
127
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
127
iii
Contents
7.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
127
7.2
Statement of the problem . . . . . . . . . . . . . . . . . . . . . . . . . .
131
7.3
Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
132
7.4
Value functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
134
7.5
The method of backwards induction . . . . . . . . . . . . . . . . . . . . .
135
7.6
The Bellman equations . . . . . . . . . . . . . . . . . . . . . . . . . . . .
137
7.6.1
Suitable conditions for the Bellman equations . . . . . . . . . . .
140
7.7
A cookbook approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
141
7.8
Worked examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
142
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
148
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
149
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
152
8 Infinite Horizon Dynamic Programming
169
Essential reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
169
Aims and objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
169
8.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
169
8.2
Statement of the problem . . . . . . . . . . . . . . . . . . . . . . . . . .
170
8.3
Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
171
8.4
Existence of an optimal strategy . . . . . . . . . . . . . . . . . . . . . . .
173
8.5
Worked example
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
175
Learning outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
178
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
179
Solutions to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
182
A Examination advice
191
General remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
191
Showing your working . . . . . . . . . . . . . . . . . . . . . . . . . . . .
191
Covering the syllabus . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
191
Expectations of the examination paper . . . . . . . . . . . . . . . . . . .
191
Examination technique . . . . . . . . . . . . . . . . . . . . . . . . . . . .
192
B Sample examination paper 1
193
C A guide to answering Sample examination paper 1
199
D Sample examination paper 2
215
iv
1
Chapter 1
Introduction
1.1
Introduction
Optimisation theory is a Level 3 course (also known as a ‘300 course’) offered on the
Economics, Management, Finance and the Social Sciences (EMFSS) suite of
programmes. It is based on a similar course at LSE where it is offered as a second or
third year course in the Department of Mathematics.
Optimisation problems occur a lot in finance, economics, and operations research, as
well as a lot of applied mathematics courses. Because of this, students feel quite familiar
with the nature of the problems, which is sometimes an obstacle. Many students tend to
see optimisation problems as exercises in calculus and algebra, whereas a great deal of
the work involved in the problems seen here is the application of a theory, and not just
a method.
One of the obstacles of this course then is that students may have extensively used the
methods presented here without questioning their validity, accepting the fact that they
will always yield a solution to a given optimisation question. Here, we will not only ask
ourselves when this could fail, but explore the reasons why as well. Instances of failure
prompt us to ask why the theory has failed us, and what can be done to obtain a
solution, and you should expect to be given optimisation problems which offer no
solution.
So this course is not just about learning and applying a method, it is also about
understanding why and when the theory works and in what situations it will fail. You
will also get to see and practise how we can begin to translate the real world into
variables and quantifiable relationships, and hence use mathematics to model, and then
solve, real problems.
I hope that you enjoy studying this course.
1.2
Aims and objectives
This half course is designed to:
enable you to obtain a rigorous mathematical background to optimisation
techniques used in areas such as economics and finance
enable you to understand the connections between the several aspects of continuous
optimisation, and about the suitability and limitations of optimisation methods for
different purposes.
1
1
1. Introduction
1.3
Learning outcomes
At the end of this half course and having completed the Essential reading and activities
you should:
have knowledge and understanding of important definitions, concepts and results,
and how to apply these in different situations
have knowledge of basic techniques and methodologies in the topics covered
have basic understanding of the theoretical aspects of the concepts and
methodologies covered
be able to understand new situations and definitions, including combinations with
elements from different areas covered in the course, investigate their properties, and
relate them to existing knowledge
be able to think critically and with sufficient mathematical rigour
be able to express arguments clearly and precisely.
1.4
How to use this subject guide
This subject guide is not supposed to be a textbook to the course. It is here to help you
interpret the syllabus and give practical applications of the theory. You should read each
chapter to find out what you are expected to know for each area of the syllabus, and
use the questions provided to test your understanding. If you experience any difficulties
with the text, each chapter recommends relevant readings from the set textbook.
Unlike many of the courses available on the International Programmes there is only one
set textbook which you must read for this course; much of the information you need to
learn and understand is contained in examples and activities within the subject guide
itself.
It is important that you appreciate that different topics are not self-contained. There is
a degree of overlap between them as each chapter introduces a new topic in the light of
topics you will have seen in earlier chapters.
In terms of studying this course, the chapters of this subject guide are designed as
self-contained topics of study, but for examination purposes you need to have an
understanding of the subject as a whole, so it is important that you get a good
understanding (including being able to identify your weaknesses) of a topic before
progressing from one chapter to the next. In particular, Chapters 2, 3 and 4 provide
background material for the entire course, while Chapters 5 and 6 and Chapters 7 and 8
are two pairs of subjects which clearly exhibit overlap in nature and content. Thus you
are encouraged to see the common points between equality-constrained and
inequality-constrained optimisation problems, as well as those between finite and
infinite horizon dynamical programming problems.
Every chapter which introduces new material has exercises, with solutions, for you to
test your understanding. A good way to proceed is to read through an exercise and see
if you can answer it without help. If not, try to first use the solutions as suggestions for
your own answer. If this is not possible, then work through the solutions as you would
the text, making sure you question why each step is done. Remember, a solution you do
2
1.5. Structure of the subject guide
not understand is of no use to you. While this course attempts to systematise the
theory into a readily applicable recipe or ‘cookbook’ procedure, you must still
understand the theory to successfully practise the methods here.
At the end of each chapter you will find a checklist of your learning outcomes which is a
list of the main points that you should understand, once you have covered the material
in the chapter and the associated readings. If you still have problems with these
concepts, then you should refer back to Sundaram or other texts for further guidance.
1.5
Structure of the subject guide
Chapter 2 surveys the mathematical preliminaries that students should be familiar
with.
Chapter 3 introduces the Theorem of Weierstrass, which is a key first step in
solving the optimisation problems of the subsequent chapters.
Chapter 4 considers unconstrained optimisation of differentiable functions and
introduces the theorems for first-order and second-order conditions for local optima.
Chapter 5 introduces constrained optimisation, beginning with equality-constrained
optimisation problems and the Theorem of Lagrange, which we then use to develop
a cookbook approach to the solution of such problems.
Chapter 6 continues with the material seen in Chapter 5 and introduces
inequality-constrained optimisation problems and the Theorem of Kuhn and
Tucker. This is then used to develop a cookbook approach to the solution of such
problems.
In Chapter 7 we introduce time into our optimisation problem. Contrasted with the
single-period, single-decision problems of the previous chapters, our problems now
require a sequence of decisions to be made, with earlier decisions affecting what is
optimal at later stages. Necessary properties for a solution are introduced in the
Bellman equations. With these, we develop a program for solving these problems.
Chapter 8 extends the ideas of Chapter 7 a little further and considers what
happens when the the time span becomes infinite. Many of the ideas in this part of
the course are quite technical, and we prefer to give a brief overview of the ideas in
favour of a more practical approach to solving such problems.
1.6
R
Essential reading
You should purchase:
Sundaram, R.K. A First Course in Optimisation Theory. (Cambridge: Cambridge
University Press, 1996) [ISBN 9780521497701].
Each chapter of the subject guide commences by identifying the appropriate chapters
from this textbook.
It should be noted that this subject guide builds on previous knowledge and
understanding you will have gained in studying for the prerequisite courses if you are
studying this course as part of a BSc degree.
Detailed reading references in this subject guide refer to the editions of the set
3
1
1
1. Introduction
textbooks listed above. New editions of one or more of these textbooks may have been
published by the time you study this course. You can use a more recent edition of any
of the books; use the detailed chapter and section headings and the index to identify
relevant readings. Also check the VLE regularly for updated guidance on readings.
1.7
Further reading
Please note that as long as you read the Essential reading you are then free to read
around the subject area in any text, paper or online resource. To help you read
extensively, you have free access to the virtual learning environment (VLE) and
University of London Online Library (see below).
R
For the pure mathematics background, a good book to refer to is:
Rudin, W. Principles of Mathematical Analysis (International Series in Pure &
Applied Mathematics). (New York; London: McGraw-Hill Publishing Co., 1976)
[ISBN 0070856133].
However, I recommend that you find a textbook that you are comfortable with, as most
will provide the necessary topics needed for this chapter. Different authors have
different writing styles, and you should take some time in finding one whose voice suits
your needs. Of the authors listed in Sundaram’s text,ones which have been useful to my
students are :
R
R
R
R
Bartle, R.G. and D.R. Sherbert Introduction to Real Analysis. (New York: Wiley,
1999) [ISBN 0471321486].
Hewitt, E. and K. Stromberg Real and Abstract Analysis. (New York:
Springer-Verlag, 1965) [ISBN 0387901388].
Royden, H.L. Real Analysis. (New York: Macmillan, 1988) [ISBN 0024041513].
I would also recommend:
Bryant, V. Yet Another Introduction to Analysis. (Cambridge: Cambridge
University Press, 1990) [ISBN 052138835X].
A useful test is to try and identify a particular concept you had difficulty in
understanding and using this as a litmus test for various texts. Finally, do not expect
the answers to all your questions to be found in one textbook. In any course you are
advised to read extensively.
Having said that, there are a number of different textbooks on optimisation theory and,
more generally, on mathematical economics (which tends to cover equality-constrained
and inequality-constrained optimisation problems, and sometimes a little dynamic
programming). You should be aware that some texts will develop procedures which can
be different from the ones here, though they share the same name.
1.8
Online study resources
In addition to the subject guide and the Essential reading, it is crucial that you take
advantage of the study resources that are available online for this course, including the
4
1.8. Online study resources
VLE and the Online Library.
You can access the VLE, the Online Library and your University of London email
account via the Student Portal at:
http://my.londoninternational.ac.uk
You should have received your login details for the Student Portal with your official
offer, which was emailed to the address that you gave on your application form. You
have probably already logged in to the Student Portal in order to register! As soon as
you registered, you will automatically have been granted access to the VLE, Online
Library and your fully functional University of London email account.
If you forget your login details at any point, please email [email protected]
quoting your student number.
1.8.1
The VLE
The VLE, which complements this subject guide, has been designed to enhance your
learning experience, providing additional support and a sense of community. It forms an
important part of your study experience with the University of London and you should
access it regularly.
The VLE provides a range of resources for EMFSS courses:
Self-testing activities: Doing these allows you to test your own understanding of
subject material.
Electronic study materials: The printed materials that you receive from the
University of London are available to download, including updated reading lists
and references.
Past examination papers and Examiners’ commentaries: These provide advice on
how each examination question might best be answered.
A student discussion forum: This is an open space for you to discuss interests and
experiences, seek support from your peers, work collaboratively to solve problems
and discuss subject material.
Videos: There are recorded academic introductions to the subject, interviews and
debates and, for some courses, audio-visual tutorials and conclusions.
Recorded lectures: For some courses, where appropriate, the sessions from previous
years’ Study Weekends have been recorded and made available.
Study skills: Expert advice on preparing for examinations and developing your
digital literacy skills.
Feedback forms.
Some of these resources are available for certain courses only, but we are expanding our
provision all the time and you should check the VLE regularly for updates.
1.8.2
Making use of the Online Library
The Online Library contains a huge array of journal articles and other resources to help
you read widely and extensively.
To access the majority of resources via the Online Library you will either need to use
5
1
1
1. Introduction
your University of London Student Portal login details, or you will be required to
register and use an Athens login:
http://tinyurl.com/ollathens
The easiest way to locate relevant content and journal articles in the Online Library is
to use the Summon search engine.
If you are having trouble finding an article listed in a reading list, try removing any
punctuation from the title, such as single quotation marks, question marks and colons.
For further advice, please see the online help pages:
http://www.external.shl.lon.ac.uk/summon/about.php
1.9
Examination advice
Important: the information and advice given here are based on the examination
structure used at the time this guide was written. Please note that subject guides may
be used for several years. Because of this we strongly advise you to always check both
the current Regulations for relevant information about the examination, and the VLE
where you should be advised of any forthcoming changes. You should also carefully
check the rubric/instructions on the paper you actually sit and follow those instructions.
The examination paper for this course is two hours in duration and you are expected to
answer four questions, from a choice of five. The Examiner attempts to ensure that all
of the topics covered in the syllabus and subject guide are examined. Some questions
could cover more than one topic from the syllabus since the different topics are not
self-contained. A Sample examination paper appears as an appendix to this subject
guide, along with a sample Examiners’ commentary.
The Examiners’ commentaries contain valuable information about how to approach the
examination and so you are strongly advised to read them carefully. Past examination
papers and the associated reports are valuable resources when preparing for the
examination.
In general, it is advisable to read through all the questions in the paper first before
attempting to answer any questions. You should also allow yourself equal time for all
questions which you attempt since they will have equal marks attached.
You are advised to attempt all the exercises at the end of the chapters in order to gain
sufficient mastery of the topics for the examination. In particular, the exercises provided
in Chapters 5–8 are indicative of the level (if not the style) of the examination questions
for this course. A fully worked out optimisation question as presented in one of these
chapters is indicative of the content of an examination question in these topics.
However, please note that the solutions given in these chapters give far more
explanation than is required in an examination answer. For guidance on the length and
content expected in an answer to an examination question, please refer to Appendix B.
Remember, it is important to check the VLE for:
up-to-date information on examination and assessment arrangements for this course
where available, past examination papers and Examiners’ commentaries for the
course which give advice on how each question might best be answered.
6
1.10. Syllabus
1.10
Syllabus
Prerequisite: if taken as part of a BSc degree, then you must have passed 116
Abstract mathematics before this half course may be attempted. You are also
strongly encouraged to take 41 Advanced mathematical analysis.
This course aims to bring together several parts of the wide area of mathematical
optimisation, as encountered in many applied fields. The course concentrates on
continuous optimisation, and in this sense extends the theory studied in standard
calculus courses. The emphasis here will be on the mathematical ideas and theory used
in continuous optimisation. This course covers the following topics:
Introduction and review of relevant parts from real analysis, with emphasis on
higher dimensions.
Weierstrass’ Theorem on continuous functions on compact set.
Review with added rigour of unconstrained optimisation of differentiable functions.
Lagrange’s Theorem on equality-constrained optimisation.
The Kuhn-Tucker Theorem on inequality-constrained optimisation.
Finite and infinite horizon dynamic programming.
1.11
Acknowledgments
I would like to acknowledge the work of Prof. Jan van den Heuvel, Prof. Graham
Brightwell and Prof. Steve Alpern who lectured this course at the LSE before I taught
it. Between them they laid much of the foundation of this course, and hence this subject
guide would not exist without their efforts.
1.12
List of abbreviations used in this subject guide
ECOP
Equality-constrained optimisation problem
FHDP
Finite Horizon (Markovian) Dynamical Programming Problem
FOC
First-order conditions
ICOP
Inequality-constrained optimisation problem
MCOP
Mixed-constraint optimisation problem
SDP
Stationary Discounted Programming Problem
SOC
Second-order conditions
7
1
1
1. Introduction
8
Chapter 2
Mathematical preliminaries
2
Essential reading
R
Sundaram, L.R. A First Course in Optimisation Theory. (Cambridge University
Press, 1996)Chapter 3.
Further reading
R
Anthony, M. Subject guide 41 Advanced mathematical analysis.
Aims and objectives
In this chapter we will review the basic concepts that are used in the theorems (and
proofs) of the later chapters. A lot of these will have been seen in 116 Abstract
mathematics, while much of the rest builds upon these ideas. Students who are also
studying 41 Advanced mathematical analysis will recognise all the topics here.
All of the material here is covered more extensively in Chapter 3 of Sundaram, and
students who would like some insight into the definitions and results we simply state
here should look there first. More detailed discussions can be found in the references
given in the previous chapter.
2.1
Introduction
You are expected to have seen 116 Abstract mathematics, and may also be studying
41 Advanced mathematical analysis. What follows is a brief review of the subject
matter we assume you are familiar with.
Here we will introduce the basic definitions and results that we will use without
question in this subject guide. They are therefore examinable only to the extent they
are used in the statement of results and their proofs. Learning activities have been
provided to highlight some aspects of these concepts which are interesting or which have
proven tricky for students. While no answers have been provided, these questions are
tackled in almost any undergraduate textbook on real analysis.
9
2. Mathematical preliminaries
2.2
2
Vector calculus
A word on notation. We will not distinguish between scalars and vectors in this
course. Thus x can be used to refer to either a real number or to a point in Rn and the
you are expected to understand from the context which it is. Because we will need to
talk about sequences of vectors, we will modify our vector notation and use
superscripts to indicate vector indices. Thus a point y ∈ Rn is a n-tuple of
co-ordinates y i , i = 1, . . . , n, which is written as:
y = y1, . . . , yn .
Notice that we write vectors in rows but they are to be considered as column vectors. If
we wish to refer to y as a row vector, we will denote it y 0 .
2.2.1
Inner products
The inner, or dot, product of two vectors x, y ∈ Rn will be denoted x · y instead
of the
√
more cumbersome hx, yi notation. The norm of a vector x, is then kxk = x · x. We can
define the distance d(x, y) between two vectors x, y ∈ Rn as the norm of their difference:
d(x, y) = kx − yk. We now have enough to state a very important result for vector
analysis:
Theorem 2.1 (The Triangle Inequality) For any vectors x, y, z ∈ Rn we have
d(x, z) ≤ d(x, y) + d(y, z).
Equivalently, for any vectors u, v ∈ Rn , we have
ku + vk ≤ kuk + kvk .
Activity 2.1 It should not be beyond you to prove the Triangle Inequality above,
and the equivalence of the two forms. Once you have done this, use the result to
prove the following inequality: for any x, y, z ∈ Rn , we have
|d(x, y) − d(y, z)| ≤ d(x, z).
Equivalently: for any x, y ∈ Rn we have
|kxk − kyk| ≤ kx − yk .
Finally, a useful result you should have seen in either an analysis course or a linear
algebra course:
Theorem 2.2 (The Cauchy-Schwartz Inequality) For any x, y ∈ Rn we have
|x · y| ≤ kxk kyk .
10
2.3. Analysis
2.3
Analysis
Because we will often be working in Rn , the notation we will use throughout this course
is slightly different from that of 41 Advanced mathematical analysis. Subscripts
will once again be used to denote indices in a sequence; however, vector coordinate
indices will now be indicated by superscripts. Furthermore, vectors will be denoted by
the traditional parentheses, while sequences are denoted by braces.
A sequence of real numbers is denoted {xk }, where xk ∈ R, ∀k ∈ Z
We can also denote a sequence of n-dimensional vectors with {xk }; however, we
now have xk ∈ Rn , i.e. xk = (x1k , . . . , xnk ), ∀k ∈ Z.
Definition 2.1 (Bounded sequences) A sequence {xk } is
1. bounded above if and only if there exists an M such that xk ≤ M for all k ∈ N.
2. bounded below if and only if there exists a L such that xk ≥ L for all k ∈ N.
If {xk } is bounded both above and below, then it is said to be bounded.
Definition 2.2 (Convergence of a sequence) A sequence {xk } of points xk in Rn
converges to a point x (as k tends to ∞) if and only if: for any ε > 0 there exists an
integer K such that kxk − xk < ε for all k > K.
In this case, we write xk −→ x, where the point x is said to be the limit of the sequence
{xk }.
We can approach this definition another way by first defining a null sequence:
Definition 2.3 (Null sequences) A sequence of real numbers {xk } is a null
sequence if and only if: for any ε > 0 there exists an integer K such that |xk | < ε for
all k > K.
That is to say a null sequence is a sequence of real numbers which converges to 0 in the
sense of Definition 2.2. With this definition, Activity 2.2 should be a trivial task to
perform.
Activity 2.2 Prove that a sequence of points {xk } in Rn converges to a point x, i.e.
xk −→ x if and only if the sequence {xk − x} is a null sequence, i.e. xk − x −→ 0.
In the real number line, we have the particular example of a non-convergent sequence: a
sequence which diverges to ±∞.
Definition 2.4 (Divergence to infinity) A sequence of real numbers {xk } diverges
to +∞ if and only if: for all M ∈ R there exists an integer K such that |xk | > M for all
k > K.
However, there is no concept of this for general Rn spaces.
We list some of the basic results of sequences you should already know; their proof is
left as an activity.
11
2
2. Mathematical preliminaries
Theorem 2.3 A sequence in Rn has at most one limit.
2
Theorem 2.4 A sequence {xk } in Rn converges to x if and only if xik −→ xi , for all
i = 1, . . . , n.
Activity 2.3 Prove Theorems 2.3 and 2.4.
One last property of sequences which should be familiar to you is:
Theorem 2.5 Suppose a sequence {xk } of real numbers is monotone
non-decreasing (so that x1 ≤ x2 ≤ . . .) and there exists some number M ∈ R such
that xk ≤ M for all k ∈ N. Then the sequence has a limit x, where x ≤ M .
That is to say, an increasing sequence that is bounded above has a limit. Similarly, a
decreasing sequence that is bounded below has a limit.
2.4
2.4.1
Basic set theory
Subsets of R
Definition 2.5 (Upper and lower bounds) Suppose we have a non-empty set A ⊆ R.
An upper bound of A is a real number u such that u ≥ a for all a ∈ A.
A lower bound of A is a real number l such that l ≤ a for all a ∈ A.
If a set has an upper bound, we say it is bounded above, and similarly for bounded
below. If a set is both bounded above and below, then we simply say that the set is
bounded.
Activity 2.4 Prove that a non-empty set A ⊆ R is bounded if and only if there
exists an M > 0 such that |a| < M for all a ∈ A.
Definition 2.6 (Supremum and infimum) Suppose we have a non-empty set A ⊆ R.
If A has at least one upper bound, then there exists a least upper bound, which is
called the supremum of A, and is denoted sup(A).
If A has at least one lower bound, then there exists a greatest lower bound, which
is called the infimum of A, and is denoted inf(A).
Definition 2.7 (Maxima and minima) Suppose we have a non-empty set A ⊆ R.
If A has an element which is greater than or equal to all elements a ∈ A, then this
element is called the maximum of A, and is denoted max A. Thus max A ≥ a for
all a ∈ A.
12
2.4. Basic set theory
If A has an element which is less than or equal to all elements a ∈ A, then this
element is called the minimum of A, and is denoted min A. Thus min A ≤ a for all
a ∈ A.
The definitions of supremum and infimum are not easy to use when answering
questions, so it is helpful to identify certain properties of them.
Property 2.1 If A is non-empty subset of R that is bounded above, then the
supremum of A is the unique point σ such that:
1. for all x > σ, x ∈
/ A;
2. for all x < σ there exists an τ ∈ A such that x < τ < σ.
A further property, which is often taken as an axiom for the real numbers, is
Property 2.2 If non-empty subset of A ⊆ R has an upper (lower) bound, then its
supremum (infimum) is well defined.
If we follow the convention that:
if a set A admits no upper bounds, then we set sup A = +∞, and
if a set A admits no lower bounds, then we set inf A = −∞,
then we obtain the result that any non-empty subset of R has an infimum and a
supremum.
From Property 2.2 we can deduce that every Cauchy sequence of real numbers has a
limit. It is more useful to state this property in the following form:
Property 2.3 Let A ⊆ R be a non-empty set with an upper bound. Then there exists
a sequence {ak } in A such that ak −→ sup(A).
Finally, we note that while the supremum (infimum) of a bounded subset of R always
exists, the maximum (minimum) does not have to, though when it does, they are equal.
Property 2.4 Let A ⊆ R be a non-empty set.
If sup(A) ∈ A, then max(A) exists and max(A) = sup(A).
If max(A) exists, then max(A) = sup(A).
A similar property holds for infima and minima.
Activity 2.5 Prove Properties 2.1-2.4 above.
Activity 2.6 Determine which of the following sets are bounded, giving brief
reasons for your answer. Furthermore, indicate where possible the: infimum,
supremum, minimum and maximum of each set.
1. A = {a ∈ Z | a < 4}.
13
2
2. Mathematical preliminaries
2. B = {b ∈ N | b < 4}.
3. C = {c ∈ R | c2 < 4}.
2
4. D = {d ∈ R \ Q | d ∈ [0, 1]}.
5. E = {e ∈ R | d2 − 2d − 2 ≤ 0}.
Activity 2.7 Suppose S, A ⊆ R are non-empty sets such that S ⊆ A. Show that
sup A ≥ sup S and inf A ≤ inf S.
2.4.2
Bounded subsets of Rn
Since we do not have an ordering on Rn if n > 1, we must be cautious in the way we
define boundedness in this setting. In fact, we will generalise the result you should have
proved in §2.4.
Definition 2.8 (Bounded in Rn ) A set S ⊆ Rn is bounded if and only if there exists
an M > 0 such that kxk < M for all x ∈ S.
Anticipating Definition 2.12, we see that Definition 2.8 is equivalent to saying that the
set S is a subset of some open ball B(0, M ).
2.4.3
Functions
Before we move on to a topological discussion of functions and their properties, we first
discuss some set-theoretic aspects of them. In particular, when given a function defined
by its actions on points (in Rn ), what do we mean by the image of a set of points under
this function? Or the preimage of a set of points?
Definition 2.9 (The image of a set) Suppose we are given f : X −→ Y and A ⊆ X,
then the image of the set A under f is:
f (A) = { y ∈ Y | y = f (x), x ∈ A} .
Definition 2.10 (The preimage of set) Suppose we are given f : X −→ Y and
B ⊆ Y , then the preimage of the set B under f is:
f −1 (B) = {x ∈ X | f (x) ∈ B} .
Definition 2.10 should not be confused with the much more restrictive notion of the
inverse of the function f . In particular, note that the preimage is always defined for any
set and any function, although not all functions may allow inverses on their domain.
Activity 2.8 Consider the functions
f : [0, 1] −→ R,
14
f (x) = x2 ,
2.5. Topology
and
g : [−1, 1] −→ R,
g(x) = x2 .
First, recall that a function is more than just a mapping of an input value to an
output value: a domain for the mapping must also be specified. Thus, while f and g
may share the same functional form, they are essentially different functions.
Determine the following sets where possible. If this is not possible, give a brief
indication why.
i) The preimage of [0, 1] under f .
ii) The preimage g −1 ([0, 1]).
iii) The inverse of 1 under f .
iv) The inverse of 1 under g.
v) f −1 ({1}).
vi) The preimage of {1} under g.
As Activity 2.8 indicates, preimages can behave in apparently strange ways if you are
not careful. Activity 2.9 explores some of the properties of preimages, and is
recommended if you have not worked with the definition before.
Activity 2.9 Consider the function f : Rn −→ Rm . Show that for any subsets
A1 , A2 ⊆ Rn and B1 , B2 ⊆ Rm we have:
i) f −1 (B1 ∪ B2 ) = f −1 (B1 ) ∪ f −1 (B2 );
ii) f −1 (B1 ∩ B2 ) = f −1 (B1 ) ∩ f −1 (B2 );
iii) f −1 (Rm /B1 ) = Rn /f −1 (B1 );
iv) A1 ⊆ f −1 (f (A1 ));
v) f (f −1 (B1 )) ⊆ B1 .
Finally, we can use the image set of a function to define boundedness of a function
Definition 2.11 (Bounded function) A function f : X −→ Rm is bounded (above
and below) if and only if f (X) is a bounded set.
2.5
2.5.1
Topology
Open and closed sets
We define the general notion of an open set in Rn in terms of open balls, which are in
fact a special class of open sets.
15
2
2. Mathematical preliminaries
Definition 2.12 (Open balls) For any x ∈ Rn and r > 0, the open ball B(x, r) with
centre x and radius r is the set
2
B(x, r) = {x ∈ Rn | kx − yk < r} .
Definition 2.13 (Open sets) A set U ⊆ Rn is open if and only if for all x ∈ U there
exists an r > 0 such that the open ball B(x, r) ⊂ U .
Rather than appeal to the definition throughout the course, we will usually rely on the
observation you will see in the following activity.
Activity 2.10 Suppose a region S ⊆ Rn is such that:
S = {x ∈ Rn | f (x) < a} ,
where f : Rn −→ R is continuous, and a is an arbitrary real number.
Show that S is open.
Definition 2.14 (Closed sets) A set C ⊆ Rn is closed if and only if for all sequences
{xk } of points xk ∈ C such that xk −→ x, then x ∈ C also.
We may also define closed sets in terms of open sets using the following theorem.
Theorem 2.6 A set C ⊆ Rn is closed if and only if its complement in Rn , Rn /C, is
open.
As with open sets above, the observation in the following activity is usually sufficient
for problems in this course.
Activity 2.11 Suppose a region S ⊆ Rn is such that:
S = {x ∈ Rn | g(x) = a} ,
where g : Rn −→ R is continuous and a is an arbitrary real number.
Show that S is closed.
Show that the result still holds if the equality g(x) = a is replaced with a weak
inequality g(x) ≤ a.
The following results will prove useful for future work.
Theorem 2.7 We consider open subsets of Rn .
1. The intersection of a finite collection of open sets is itself open.
2. The union of an arbitrary collection of open sets is itself open.
Similar results hold for closed sets.
16
2.5. Topology
Theorem 2.8 (Closed set operations) We consider closed subsets of Rn .
1. The intersection of an arbitrary collection of closed sets is itself closed.
2
2. The union of a finite collection of closed sets is itself closed.
Note the skewed symmetry between these two results, which can be explained by De
Morgan’s laws of set operations.
We end this subsection with two definitions which will be important when we introduce
calculus to determine optima.
Definition 2.15 (Interior points of a set) Consider a set S ⊆ Rn . The point x ∈ S is
called an interior point of S if and only if there exists some ε > 0 such that
B(x, ε) ⊂ S. That is, if there exists an open ball centred on x which is contained
entirely within S.
Definition 2.16 (Interior of a set) Consider a set S ⊆ Rn . The collection of all
interior points of S is called the interior of S and is denoted int S.
2.5.2
Compactness
Although it requires a more abstract context beyond the scope of this course for this to
be obvious, compactness of a set is essentially a statement about finiteness. Its most
important role in this course is for the Weierstrass Theorem which is an existence result
for solutions to our optimisation problems. While it should be clear to you that a
function over a finite set of points always has a maximum and a minimum, for more
general settings this may not be the case. This is where compactness and continuity
come into play and we will see this in Chapter 3.
Definition 2.17 (Compactness) A set X ⊆ Rn is compact if and only if, for any
sequence {xk } of points in X, there exists a convergent subsequence {xkn } ⊆ {xk }
whose limit lies in X.
Try to see what this definition states and doesn’t state. From the point of view of
proving a given set X is compact solely using Definition 2.17, it is not enough that you
can find a convergent sequence or, indeed, a sequence which contains a convergent
subsequence: compactness means that you can find a convergent subsequence of any
sequence given to you in X. It is therefore quite a strong requirement of a set so we
should not be surprised that most sets fail this condition. It is by virtue of this stringent
definition that compact sets prove to be so useful in the existence result of Weierstrass.
Before we proceed, there is one very useful characterisation of compact sets in Rn .
Theorem 2.9 (Compactness in Rn ) A set X ⊆ Rn is compact if and only if it is
closed and bounded.
Activity 2.12 Prove Theorem 2.9. Remember, this is an ‘if and only if’ proof, so
you must prove two things.
1. If X ⊆ Rn is compact then it is closed and bounded.
17
2. Mathematical preliminaries
2. If X ⊆ Rn is closed and bounded then it is compact.
2
You will need to be thoroughly familiar with Definition 2.14 and Definition 2.17 to
be able to do this activity.
2.6
Functions
2.6.1
Continuity
Not only do we need workable definitions of continuity, but we also need to have a
collection of continuous functions from which to build up new ones - that is, from which
we can easily ascertain the continuity of a given function. Thus polynomials,
trigonometric functions, exponential and logarithmic functions are continuous.
Furthermore, we have the composition rules of continuous functions to fall back on.
Definition 2.18 (Continuity at a point) A function f : X −→ Rm , X ⊆ Rn , is
continuous at a point a ∈ X if and only if, for any ε > 0 there exists a δ > 0 such
that, for all x ∈ X:
kx − ak < δ
⇒
kf (x) − f (a)k < ε.
An equivalent definition of continuity that is useful for us is:
Definition 2.19 (Continuity at a point) A function f : X −→ Rm , X ⊆ Rn , is
continuous at a point a ∈ X if and only if for all sequences {xk } in X such that
xk −→ x as k −→ ∞, we have f (xk ) −→ f (x).
Definition 2.20 (Continuity on a set) A function f : X −→ Rm , X ⊆ Rn , is a
continuous on a set A ⊆ X if and only if it is continuous at every point a ∈ A.
In particular, if f is continuous at every point of its domain, we then simply say that f
is continuous.
Continuity is the lowest level of ‘nice behaviour’ we like in functions in the sense that
they give ‘nice’ results when applied to other ‘nice’ objects. A very useful example of
this is when we combine continuous functions with convergent sequences (that is, ‘nicely
behaved’ sequences).
Theorem 2.10 Suppose f : X −→ Y is continuous (on X) and {xn } is a convergent
sequence in X with limit x ∈ X. Then the sequence {f (xn )} in Y , defined by applying
the function to each point of the sequence {xn }, is also a convergent sequence in Y with
limit f (x) ∈ Y .
This result can be usefully remembered as ‘the limit of the image is the image of the
limit’. Notationally, it proves that we can take the limiting process through a
continuous function, that is:
lim f (xn ) = f lim xn .
x−→∞
18
x−→∞
2.6. Functions
In Chapter 3 we will see another ‘nice’ result for continuous functions and compact sets
(that is, ‘nicely behaved’ sets).
2.6.2
2
Differentiability
We will avoid discussing derivatives at boundary points and restrict our definitions to
purely interior points. Hence, in this subsection, we will restrict our functions to open
subsets U of Rn .
Definition 2.21 (Differentiability at a point) A function f : U −→ Rm , where
U ⊆ Rn is open, is differentiable at a point a ∈ X if and only if there exists an
m × n matrix A such that for any ε > 0 there exists a δ > 0 such that, for all x ∈ U :
kx − ak < δ
⇒
kf (x) − (f (a)k
< ε.
kx − ak
If A exists, then it is called the derivative of f at a and denoted Df (a).
In the special case where n = m = 1, we will also use the notation f 0 (a).
Definition 2.22 (Differentiability on a set) A function f : U −→ Rm , where U ⊆ Rn
is open, is differentiable on a set A ⊂ U , A open, if and only if f is differentiable at
all points a ∈ A.
If f is differentiable at all points of its domain, U , then f is unambiguously said to be
differentiable.
Theorem 2.11 If f : Rn −→ Rm is differentiable then f is continuous.
Activity 2.13 It is important to note that the converse of Theorem 2.11 does not
hold, and continuous functions are not necessarily differentiable. Consider the
function f : R −→ R defined by f (x) = |x|. Prove using the definitions that
1. f is continuous (on R), but
2. f is not differentiable at 0.
Since the derivative of a function f : Rn −→ Rm is itself a function Df : Rn −→ Rn+m ,
we can discuss its continuity and differentiability.
Definition 2.23 (Continuously differentiable) If f : U −→ Rm is differentiable, and
Df : U −→ Rn+m is continuous, then f is said to be continuously differentiable on
U or simply C 1 .
We have necessary and sufficient conditions to determine whether or not a given
function f : Rn −→ R is a C 1 function; however, to quote this we must review one more
notion.
Definition 2.24 (Partial derivative) Suppose we are given the function f : U −→ R,
where U ⊆ Rn . Denote the i-th standard basis vector by ei , so that eji = 1 if j = i and
eji = 0 if j 6= i. The i-th partial derivative of f at a point a is then the limit, where
19
2. Mathematical preliminaries
it exists:
f (a + tej ) − f (a)
.
t−→0
t
lim
2
If this limit exists, we denote it:
∂f (a)
∂xi
or
∂
f (a).
∂xi
We then have the following:
Theorem 2.12 Given a function f : S −→ R, where S ⊆ Rn is open, then the function
f is C 1 on S if and only if all partial derivatives of f exist and are continuous for all
s ∈ S.
If f is C 1 , then we can write:
Df (s) =
∂f (s)
∂f (x)
,...,
∂x1
∂xn
,
∀s ∈ S.
However, note that this is not how we define the derivative Df (a) of a function f at a
point a.
While a function may be continuously differentiable (and hence continuous) it is not
necessarily the case that the derivative is a continuous function itself.
Activity 2.14 Let f : R −→ R be given by

x = 0,
0,
f (x) =
x2 sin 1 , x 6= 0.
x2
As might be expected from its definition, problems arise around the origin. However,
the function is substantially more well-behaved than you might at first think.
1. Prove, using the definition, that f is continuous at x = 0.
2. Prove that f is differentiable at x = 0.
3. Compare the behaviour of f 0 (0) and limx→0 f 0 (x).
Learning outcomes
This chapter is simply a revision of the key ideas that underpin the ideas in this course.
After working through this chapter you should know the definitions of:
convergent sequences
suprema and infima, maxima and minima
bounded, open, closed, compact subsets of Rn
images and preimages of functions f : Rn −→ Rm
continuous and differentiable functions.
20
Chapter 3
Weierstrass’ Theorem
3
Essential reading
R
Sundaram, L.R. A First Course in Optimisation Theory. (Cambridge University
Press, 1996)Chapter 3.
Aims and objectives
In this chapter we will:
introduce and prove the Theorem of Weierstrass
learn how to use it to prove the existence of solutions to optimisation problems
see what happens when it is not applicable.
3.1
Introduction
While the content and proof of Weierstrass’ Theorem should not be taxing, we will
consider it exclusively in this chapter since it is an extremely important result. First
and foremost, it is an existence result: where applicable, it guarantees that our efforts
will not be fruitless and that there is indeed a solution to the problem under
consideration. Its importance to our work here can be seen by the simple fact that it is
the very first step in our cookbook procedures to constrained optimisation: before you
begin the lengthy procedure of finding (and verifying) a solution, you should get into
the habit of checking that one exists in the first place.
Secondly, it is the first major result of the course. As such, let us take the time to see
the sort of thinking a mathematical result requires: not only in proving it, but also in
reading and understanding it.
After introducing the theorem in §3.2 we will go on to detail how it is read, and thus
how it must be applied. We will use the proof as an opportunity to review how a
student should go about proving a theorem in general.
The essential content of the Weierstrass theorem is simply ‘nice sets will always yield
maxima and minima for nicely behaved functions‘. In set theory, ‘nice sets’ are compact
sets, while from topology, ‘nicely-behaved functions’ are continuous functions. Although
real life applications of these ideas may not readily provide compact sets or continuous
functions on them, it is often more useful to make such approximations to use the full
machinery of mathematics.
21
3. Weierstrass’ Theorem
3.2
The Theorem of Weierstrass
Theorem 3.1 (Weierstrass’ Theorem)
3
Let D ⊆ Rn be a non-empty compact set, and f : D → R be continuous. Then f
has both a maximum and a minimum on D.
First note that mathematical statements should contain all necessary information and
no superfluous details. Thus, there are two suppositions that must hold for the
conclusion to be valid:
1. the set D must be compact; and
2. the function f must be continuous.
Let us consider what happens if we relax these requirements. From the definition of
compactness, Definition 2.17, we see that this means that D fails to be either closed or
bounded. Of course, this is a logical ‘or’, so both conditions can fail at the same time,
but it is conceptually simpler to consider these failures on their own.
Activity 3.1 (Relaxing the requirement of compactness)
1. Suppose a set X ⊆ R is unbounded. Show that there exists a continuous
function f : X −→ R which does not attain a maximum or minimum on X.
2. Suppose a set X ⊆ R is not closed. Show that there exists a continuous function
f : X −→ R which does not attain a maximum or minimum on X.
Thus, to successfully apply Weierstrass’ theorem, you must check that the domain is
both closed and bounded. Fortunately, Chapter 1 of Sundaram has a variety of results
to make this easy, so there is no excuse for you not doing so. Indeed, by determining
that a particular optimisation problem has no solution, you can then avoid the
somewhat arduous task of following our cookbook procedures.
Finally, not that this is simply a sufficient condition (the implication goes one way)
rather than a necessary one (which would require an ‘if and only if’ statement). Thus, it
should be possible, for example, to find a function which is not continuous on a compact
set but nonetheless attains its maximum or minimum. In fact, the following activity
asks you to test your understanding of what the theorem does and does not say.
Activity 3.2 (Understanding the Weierstrass theorem)
Give examples of sets and functions with the following properties:
1. an open set D1 ⊆ R and a continuous function f : D1 −→ R such that f has
both a maximum and a minimum on D1 ;
22
3.3. Proving the theorem
2. a compact set D2 ⊆ R and two functions g + h : D2 −→ R such that neither g
nor h have either a maximum or minimum on D2 but the sum g + h does;
3. a set D3 ⊆ R and a continuous function f : D3 −→ R such that f attains a
maximum but not a minimum on D3 ;
4. a set D4 ⊆ R which is not compact and a function f : D4 −→ R which is not
continuous, but which attains both a maximum and a minimum on D4 .
3.3
Proving the theorem
We will prove two results which, together, give Weierstrass’ Theorem.
3.3.1
Continuity and compactness
Lemma 3.2 If f : D −→ R is continuous and D is compact, then f (D) is also
compact.
Writing our premises in full, we have:
Premise 1: f : D −→ R.
Premise 2: f is continuous, so that if {xn } is a sequence in D such that xn −→ x
then f (xn ) −→ f (x).
Premise 3: D is compact, so that if {yn } is an arbitrary sequence in D then there
exists a subsequence {ynk } ⊆ {yn } such that
1. ynk is convergent, i.e. there exists a y such that ynk −→ y as nk −→ ∞.
2. y ∈ D.
With these premises, we must show that f (D) is a compact set. Specifically, according
to the definition, we must show that f (D) satisfies the following:
If {zn } is an arbitrary sequence in f (D) then there exists a subsequence
{znk } ⊆ {zn } such that:
i) znk is convergent, i.e. there exists a z such that znk −→ z; and
ii) z ∈ f (D).
So, before we begin, what are we not supposed to be proving here? That is, in what
way could we possibly misread the definition? A common mistake that students make is
to jump to the end of the definition, which seems the meatiest. There is more going on
here, so one naturally looks to this first. On the principle that the last thing we read is
the first thing on our minds, let us work back through the definition and uncover some
frequent errors.
1. We are not asked to simply show the existence of a subsequence {znk }.
What sequence {zn } would your sequence {znk } be a subsequence of exactly?
23
3
3. Weierstrass’ Theorem
2. We are not asked to show the existence of a convergent sequence.
3
In fact, every non-empty set admits a convergent sequence: a constant sequence. So
there is nothing special in this. Nor are we saying that we can find a sequence
which contains a convergent subsequence or, indeed, a divergent sequence which
contains a convergent subsequence. These are all rather trivial in spaces with more
than two points.
If these two misreadings sound forced to you, they are actually quite common. Please be
sure to read the entire definition and realise that mathematicians are very economical
when it comes to statements of definitions and theorems, neither missing important
details nor, conversely, adding extraneous ones.
Let us consider a more subtle misreading of the definition.
3. We are not asked to find a sequence which contains a convergent subsequence.
This is perhaps closer to the what the definition is saying than the previous two,
but it is still wrong. The error here is to confuse when we must show the existence
of at least one object exhibiting a certain property, and when we must show the
property holds for all objects. Here, it is not enough to show that we can find at
least one sequence with a convergent subsequence (just take a constant sequence).
We must prove the far stronger claim that given any sequence in, here, f (D) we
can find a convergent subsequence.
With that in mind, let us now consider what the definition is asking us to prove. First,
it is a statement which starts off with:
‘If {zn } is an arbitrary sequence in f (D), then . . . ’
Thus, whatever we must prove, it must be about a sequence {zn }. Put another way, if
in our proof we start working with this arbitrary {zn }, we cannot make any
assumptions about the nature of the sequence. Some things you should certainly
find yourself not saying without thinking very long and hard about include:
1. Suppose zn ≥ 0 for all n ∈ N.
2. Suppose zn −→ z as n −→ ∞.
3. Consider the sequence defined by yn =
√
zn for all n ∈ N.
This list is by no means exhaustive, but it does illustrate the sorts of errors that crop
up entirely too often. At any point, if you need to rely upon a supposed property of the
zn , you must ask yourselves what would happen to your proof in its absence: would
your proof still be valid?
In order to discourage this sort of behaviour, you will generally find textbook proofs fix
arbitrary quantities at the beginning of the proof. We will start off, then, by doing this.
Let {zn } in f (D) be given.
Once this is done, we are no longer able to adjust or prescribe its properties, it is
written in stone and whatever deficiencies it might have for our proof, we will have to
work around this. Given this arbitrary sequence, we see that the definition requires us
to prove two things.
24
3.3. Proving the theorem
We will prove that {zn } contains a convergent subsequence
At this point, we are not concerned with any properties of this subsequence, merely
that it exists.
What tools have we got to prove this? Looking around at the premise of Lemma 3.2, we
have little more information about f (D). In particular, we have no existence statements
about convergent subsequences in this set. However, we do have an existence statement
about convergent sequences for the compact set D, so our first task would be to relate
our sequence {zn } in f (D) to the set D. Specifically, from Premise 3.3.1, we see that we
need to relate it to, at the very least, a sequence in D.
However, by assumption, we have for all n ∈ N, zn ∈ f (D), the image set of D under f .
Thus, for all n ∈ N there must exist a yn ∈ D such that zn = f (yn ). This then defines a
sequence {yn } in D.
Since D is compact, by Premise 3.3.1, we know that there exists a convergent
subsequence {ynk } ⊆ {yn }. Of course, this isn’t enough. First, it is not a convergent
sequence of our original sequence {zn }, and it doesn’t necessarily live in f (D). We can,
though, relate it to a sequence in f (D). Since ynk ∈ D for all k ∈ N, then by the
definition of the image of a set, Definition 2.9, we see that f (ynk ) ∈ f (D) for all k ∈ N.
The sequence {f (ynk )} is a convergent sequence. Why? It is the image of a convergent
sequence under the continuous function f (Premise 3.3.1), so that, by Theorem 2.10, it
is a convergent sequence itself.
The sequence {f (ynk )} is a subsequence of {zn }. If zn = f (yn ) for all n ∈ N, then since
the ynk form a subset of the yn , their image points must form a subset of the zn .
Thus, we have shown that, given an arbitrary sequence {zn } in f (D), then it has a
convergent subsequence {znk }.
We will prove that the limit of this convergent subsequence lies in f (D)
Once again, we have little information about f (D) to conclude anything just yet. If we
consider what results we have regarding the limits of convergent sequences and when we
can expect them to exist in certain sets, we should recall that a closed set is such that
any convergent sequence has its limit in the set. Unfortunately, we do not yet know if
f (D) is closed: in essence, we are trying to prove that.
However, we do know that D is closed. So any convergent sequence in D has limit in D.
Can we relate our convergent sequence {znk } in f (D) to one in D? Well, clearly we can
since it was constructed as the image of a convergent sequence in D. That is, the
subsequence {ynk } is a convergent sequence in D, say with limit y. So y ∈ D. By
continuity of f , Theorem 2.10 tells us that:
lim znk = lim f (ynk ) = f
k−→∞
k−→∞
lim ynk = f (y) ∈ f (D).
k−→∞
And in fact, we are done. We have been given an arbitrary sequence in f (D), and shown
that we can find a convergent subsequence whose limit lies in f (D). Since we have not
made any assumptions about the behaviour or nature of the arbitrary sequence, we can
conclude that this must be true for any sequence in f (D), and therefore we have shown
that this set satisfies the requirements of Definition 2.17 for compactness.
That is, f (D) is compact.
25
3
3. Weierstrass’ Theorem
This was a very detailed look at the proof for compactness. The aim of this was not just
to prove the result, but to get you to think about what is needed when proving results
in general. Notice the heavy reliance on definitions and existing results. There is no
hope of doing anything powerful in mathematics if you do not learn to use the
definitions and build results upon earlier results.
3
Having said that, the above proof has far more exposition than is needed for a straight
proof. Hence, the following activity will test what you have understood of this
subsection so far.
Activity 3.3 Concisely state the proof of Lemma 3.2.
3.3.2
Compactness and boundedness
There is just one more result to conclude Weierstrass’ Theorem.
Lemma 3.3 If D ⊂ R is a non-empty compact set, then D has a maximum and a
minimum.
We will prove that a maximum exists. Here we will not go into as much detail as we did
for the first result, but we will still try to raise as many questions about the process.
First, we are talking about compact sets on the one hand, and maxima and minima on
the other. The former is defined in terms of sequences and convergence, while the latter
are defined in terms of bounds. We therefore need a common ground to discuss these.
Fortunately, we have precisely that in our alternative characterisation of real compact
sets, Theorem 2.9. D is a non-empty bounded subset of R. In particular, from Property
2.2 of suprema, we know that D has a supremum. This will ultimately give us our
maximum.
So far we have used the fact that D is bounded, and that it is non-empty, but we have
one remaining property we have not yet used. D is closed. Now, if this were not
important, then we would not have specified it in the statement of the lemma (no
extraneous detail, remember), so we must need it somewhere. Remember, a set being
closed is a statement about limits of convergent sequences. A set having a maximum is
a statement, essentially about suprema: specifically that the set contains its supremum.
Combining these two statements suggests that we could try and construct a sequence in
D with limit sup D.
From our properties of suprema, we see that Property 2.1 is the most promising. This
says that we can always find points in D as close to, and not equal to, sup D. So let us
use this to construct a sequence {xn } in the following way. For each n ∈ N, consider the
interval:
1
Dn = sup D − , sup D .
n
From Property 2.1, the intersection Dk ∩ D must be non-empty for all n ∈ N. If it were
not, say for n0 , then we would have found an upper bound sup D − n10 < sup D, which
contradicts the definition of sup D as the lowest upper bound.
26
3.4. Worked examples
For each n ∈ N pick a point xn ∈ Dn ∩ D. This forms a sequence {xn } in D. Moreover:
1
1
xn ∈ sup D − , sup D
⇒
|xn − sup D| < .
n
n
Thus, as n −→ ∞ we see that xn −→ sup D.
So {xn } is a convergent sequence in D with limit sup D. Since D is closed, we must have
that sup D ∈ D, but then by Property 2.4 of suprema, this implies that D has a
maximum.
The proof for a minimum is very similar to this.
Activity 3.4
3.4
Complete the proof of Lemma 3.3 by showing that D has a minimum.
Worked examples
Example 3.1
Suppose we wish to determine whether the function f (x, y) = x2 − y 2 has a
maximum and a minimum on the unit circle, i..e on the set:
C = (x, y) ∈ R2 x2 + y 2 = 1 .
To apply the Weierstrass Theorem, we require C to be a compact set and f to be a
continuous function.
Is C compact?
Rather than use Definition 2.17 to test for compactness, we opt to use the simpler
real space characterisation found in Theorem 2.9. That is, we must ask ourselves:
Is C closed?
Since membership of the set C is defined purely in terms of one equality, the
result of Activity 2.11 tells us that C is indeed closed.
Is C bounded?
We will appeal to the result of Activity 2.4 to determine boundedness of the set
C. There we see that we must find some upper bound M on the magnitude of
elements in C, that is, for any x ∈ C, we have kxk < M . However, looking at
the membership requirement to be in the set C:
(x, y) ∈ C
⇔
x2 + y 2 = 1
⇔
k(x, y)k = 1.
So taking M > 1 suffices to show that C is bounded.
Thus, the set C ⊆ R2 is closed and bounded, and hence compact.
Is C non-empty?
This is easily seen by giving a simple example of a point in C, say, (1, 0).
Is the function f : C −→ R continuous?
27
3
3. Weierstrass’ Theorem
f (x) is a second-order polynomial in two variables, and hence is continuous.
Thus, f is a continuous function acting on the compact set C and the Weierstrass
Theorem tells us that a maximum and minimum for this function must exist in C.
3
Example 3.2 Suppose the function f : R+ −→ R is a continuous function such
that:
1. f (0) = 1, and
2. limx−→∞ f (x) = 0
Show that f must have a maximum on R+ .
Note that the domain of f , R+ = [0, ∞) is not bounded and hence not compact.
Thus, we cannot immediately apply Weierstrass’ Theorem. However, rather than
dismiss this result entirely, we will show that we can decompose our domain into a
compact set and a set that is, in some sense, negligible in that it does not contain
any candidates for a maximum. We will do this by picking a suitable value of the
function f to split the set into two.
First, note that limx−→∞ f (x) = 0 has a very precise meaning. Specifically, for any
ε > 0, there exists an N > 0 such that for all x > N we have:
ε > |f (x) − 0| = |f (x)| .
Thus, past a finite point N the behaviour of f can be considered negligible. For the
remainder of the domain, R+ /(N, ∞) = [0, N ], we are left with a closed and
bounded interval of R. This is compact, while, by assumption, f is continuous.
Hence, by Weierstrass’ Theorem, f has a maximum on [0, N ].
Since we wish to find a maximum on R+ , and this is the only maximum whose
existence we can guarantee, we would like to show that the maximum on [0, N ] is
indeed a maximum on R+ . That is, even if f has a maximum on (N, ∞), it should
be less than or equal to that on [0, N ].
Since we know f (0) = 1, an easy way to ensure (N, ∞) is inconsequential to our
analysis, is to ensure that any value f takes on it is no greater than f (0). We can do
this by picking ε = f (0) > 0. From above, this defines a critical value N past which
the function f (x) is no greater than 1 in magnitude (and hence no greater than 1 in
value). Weierstrass’ Theorem asserts the existence of an x? such that f (x? ) is a
maximum of f on [0, N ] and for which we have:
f (x? ) ≥ f (0) = 1 > f (x),
∀x ∈ (N, ∞).
Thus, f has a maximum at x? not only on [0, N ] but on R+ .
Sadly, we simply do not have enough to ensure the existence of a minimum. For
example, consider f (x) = exp (−x2 ). This is strictly positive for all x ∈ R+ and has
limit 0 as x −→ ∞. However, it never reaches this limit, simply getting smaller and
smaller, so that a minimum does not exist.
28
3.4. Learning outcomes
Learning outcomes
At the end of this chapter and the relevant reading you should be able to:
state and prove the Theorem of Weierstrass
apply Weierstrass’ Theorem to examples
show the impossibility of maxima and minima in certain situations.
3
Exercises
The exercises here are intended to help consolidate the examinable material that you
have seen in this chapter. Thus, whatever you are asked to do or are tested upon in these
exercises should be expected in an exam. However, these exercises are not indicative of
the style of examination questions. Put simply, these exercises indicate what we might
ask you to do in an exam, though not necessarily how we will ask you to do so. For
that, you should refer to the sample examination questions in Appendices B and D.
Similarly, the solutions provided are generally of a much higher level of detail than is
expected of you in an examination, and you are referred to Appendix C for guidance.
Exercise 3.1
Suppose the set D ⊂ Rn contains a finite number of points. Show that every function
f : D −→ R has a maximum or minimum on D. Is Weierstrass’ Theorem applicable
here?
Exercise 3.2
In Example 3.2 what can you say about the existence of a minimum? Justify your
answer, hence:
if you believe that a minimum must exist, then give a proof
if you believe that a minimum may not exist, then give a counter-example.
Exercise 3.3
Suppose the function f : R −→ R is a continuous function such that limx−→±∞ f (x) = 0.
i) Show that f must have a minimum or a maximum on R.
ii) Give an example that shows that it is not true in general that f has both a
maximum and a minimum on R.
Exercise 3.4
Suppose a monopolist faces:
a downward sloping inverse-demand curve p(x) such that p(0) < ∞ and p(x) ≥ 0
for all x ≥ 0, and
29
3. Weierstrass’ Theorem
a cost curve c(x) such that c(x) ≥ 0 for all x ≥ 0 and c(0) = 0.
Assume that both functions p, c : R+ −→ R are continuous.
The monopolist must maximise profit:
π(x) = p(x)x − c(x),
3
subject to the non-negativity constraint x ≥ 0.
i) Prove that a solution exists to the firm’s maximisation problem if there exists an
x? > 0 such that p (x? ) = 0.
ii) Prove that a solution exists to the firm’s maximisation problem if there exists an
x? > 0 such that c(x) ≥ p(x)x for all x ≥ x0 .
iii) Consider the case where c(x) −→ ∞ as x −→ ∞ and the demand curve is
infinitely elastic, (that is, when p : R −→ {p̄}, for some p̄ ∈ R). Does a solution
exist in this case? If not, give a counter-example.
Exercise 3.5
Consider the expenditure minimisation problem of the consumer. Suppose, in an n-good
market we have a vector of prices p = (p1 , . . . , pn ). The consumer has a utility function
u : Rn+ −→ R and wishes to minimise their expenditure subject to attaining some fixed
minimum level of utility ū. Thus, we have the problem:
Minimise p · x subject to u(x) ≥ ū.
i) If u : Rn+ −→ R is continuous and pi > 0 for all i = 1, . . . , n, show that the
minimisation problem has a solution.
ii) What happens if either the function u fails to be continuous or at least one of the
prices is zero?
Exercise 3.6
Show that the function f : R2+ −→ R given by:
f (x, y) =
yx2
x4 + y 4 + 1
has a maximum on its domain
Solutions to exercises
Solution to exercise 3.1
We have to prove a statement about every possible function one can define on the set
D, so you certainly should not be starting your proof with something along the lines of
30
3.4. Solutions to exercises
‘Let f be the function . . . ’. The way in which we shall proceed is to let an arbitrary
function f : D −→ R be given, and prove the result for this f . Since we will not have
assumed anything about the behaviour of f , we can conclude that our result holds for
all real-valued functions defined on D.
So, let f : D −→ R be given, and write D in of the form {x1 , . . . , xk }. Applying f to all
the points in D yields the image f (D) = {f (x1 ), . . . , f (xk )}, i.e. another finite set. Thus
f (D) contains its maximum and minimum, and thus f attains its maximum and
minimum on D.
In fact, the result also follows from Weierstrass’ Theorem. Any finite set is
automatically compact, and we can show that every function defined on a finite set is
continuous. To do so, let a sequence {xk } in D such that xk −→ x be given. We must
show that f (xk ) −→ f (x).
However, as there are finitely many points in D, we claim that the convergent sequence
{xk } must be eventually constant. Since D is finite, then so must the set:
{kx − yk | x, y ∈ D, x 6= y } .
Thus, it must have a well-defined minimum, which we shall denote δ > 0. From the
definition of convergence for {xk }, we then have a K > 0 such that for all k > K, we
have kxk − xk < δ. This can only be the case if xk = x for all k > K, and hence we have
that the sequence {xk } is eventually constant.
The sequence {f (xk )} must also be eventually constant and hence trivially satisfy the
definition of convergence.
So f is continuous. Since f was an arbitrary function, we conclude that all functions
defined on D are continuous.
Finally, if all functions are continuous, the Weierstrass’ Theorem applies to all functions
defined on a finite space D.
Solution to exercise 3.2
Sadly, we simply do not have enough to ensure the existence of a minimum. For
example, consider f (x) = exp (−x2 ). This has initial value f (0) = 1, is strictly positive
for all x ∈ R+ and has limit 0 as x −→ ∞. However, it never reaches this limit, simply
getting smaller and smaller, so that a minimum does not exist.
Solution to exercise 3.3
i) First, there is some ambiguity in the statement as to what will occur. Your answer
should indicate under what conditions you expect a maximum will occur, and
under what conditions you expect a minimum to occur.
Secondly, Weierstrass’ Theorem is not immediately applicable since R is not
compact. However, as we did in the previous exercise, we can again attempt to
decompose the domain into a compact subset and a set which is negligible to our
analysis. Recall that to do this we picked a value of f to split the domain into two.
So, pick an arbitrary x̄ and consider f (x̄). We need to show that either a
maximum occurs or a minimum occurs. If the function happens to be f (x) = 0 for
31
3
3. Weierstrass’ Theorem
all x ∈ R, then we have the special case where the maximum and minimum values
are both the same and occur at all points of the domain.
However, if not, suppose we have f (x̄) > 0. Then either:
1. the function has a maximum but no minimum, such as in the case of
f (x) = exp (−x2 ), or
3
2. the function has both a maximum and a minimum, such as in the case of
x
f (x) =
.
1 + x2
Thus, if f (x̄) > 0 for some x̄ ∈ R, we claim that a maximum of f must exist on R.
Similarly, if f (x̄) < 0 for some x̄ ∈ R, we claim that a minimum of f must exist on
R.
We shall now consider the case of a maximum (the proof for a minimum can be
derived easily from this).
Supposing that f (x̄) > 0 for some x̄ ∈ R, we wish to split the domain R into:
R = {x ∈ R | f (x) ≥ f (x̄)} ∪ {x ∈ R | f (x) < f (x̄)}
{z
} |
{z
}
|
Y
Z
where we claim that the set Y is compact, while the set Z is infinite.
f (x̄)
Turning to the limiting behaviour of f , let ε =
> 0. (Why have we not used
2
ε = f (x̄) > 0, as we did in the previous exercise?) Since lim f (x) = 0, there
x−→−∞
exists an x− such that for all x < x− :
|f (x)| <
f (x̄)
2
⇔
−
f (x̄)
f (x̄)
< f (x) <
.
2
2
Similarly, since limx−→+∞ f (x) = 0, there exists an x+ such that for all x > x+ :
|f (x)| <
f (x̄)
2
⇔
−
f (x̄)
f (x̄)
< f (x) <
.
2
2
These two points thus define a compact interval [x− , x+ ]. Furthermore, by our
choice of ε, we must have that x̄ 6= x− , x+ is an element of [x1 , x2 ], which is
therefore non-empty. Thus, by Weierstrass’ Theorem, the continuous function f
has a maximum on [x− , x+ ], say at x? .
Since x? ∈ [x− , x+ ], we must have:
f (x? ) ≥
f (x̄)
> f (x),
2
∀x ∈
/ [x− , x+ ].
So x? is indeed a global maximum for the function f .
2
ii) We have already seen a simple example of such a function: f (x) = exp − x2 .
This has a global maximum at x = 0 with value 1, but no minimum.
32
3.4. Solutions to exercises
Solution to exercise 3.4
Before we start looking at the first part of the exercise, we can already write down a lot
of information about the profit function
π(x) = xp(x) − c(x).
3
So let us do that.
We are given that p(x) is such that:
1. p is continuous,
2. p(0) < ∞,
3. non-increasing, and
4. non-negative, i.e. p(x) ≥ 0, ∀x ∈ R+ .
That is, it starts from a finite point and decreases (in a continuous manner) towards
zero. Furthermore, we also have that c(x) is such that
1. c is continuous,
2. c(0) = 0,
3. c(x) ≥ 0, ∀x ∈ R+ .
Thus c is a continuous function that starts at the origin and always remains
non-negative.
Immediately we see that π(x) is the product and difference of continuous functions on
R+ and thus is itself a continuous function on R+ , which is an unbounded, and hence
non-compact, domain. Furthermore, we note that
π(0) = 0 · p(0) − c(0) = 0 − 0 = 0.
i) Suppose we have an x? > 0 such that p(x? ) = 0. We are asked to apply the
Weierstrass Theorem to obtain the desired result, but our observation above that
the function is defined on a non-compact domain defies us from immediately doing
so. What we would like to do is restrict our problem to a compact sub-domain
without drastically changing the nature of the problem. That is, can we omit a
large part of this unbounded domain?
This is possible if we can show that the maximum, if it is to occur at all, must
occur within in a compact region of the domain.
First, at the point x = x? , we are given that p(x? ) = 0, and we can then deduce
that:
π(x? ) = x? · p(x? ) − c(x? ) ≤ 0.
>0
=0
≥0
Moreover, since we know p is non-increasing and non-negative on its domain, it
must be the case that p(x) = 0 ∀x ≥ 0. Thus π(x) ≤ 0 for all x ≥ x? .
33
3. Weierstrass’ Theorem
From both this observation and the fact that π(0) = 0, we are thus able to restrict
our consideration of π(x) to the closed and bounded interval [0, x? ]. Thus, we
know have:
i π(x) is continuous on R+ , and hence continuous on [0, x? ] ⊆ R+
ii [0, x? ] is non-empty, since x? > 0
iii [0, x? ] is compact.
3
Weierstrass’ Theorem can then be applied to conclude that π must therefore have
a maximum on [0, x? ].
Furthermore, we have that:
max π(x) ≥ π(0) = 0 ≥ π(x),
x∈[0,x? ]
∀x ∈
/ [0, x? ].
Thus, this maximum is a global maximum.
ii) This can be considered in much the same way. Suppose there exists an x0 > 0 such
that c(x) ≥ xp(x) ∀x ≥ x0 . Then
∀x ≥ x0 ,
π(x) = x · p(x) − c(x) ≤ 0,
while
π(0) = 0 ≥ π(x),
∀x ≥ x0 .
As before, we can thus restrict our consideration of π to the sub-domain [0, x0 ].
i [0, x0 ] 6= ∅ since x0 > 0;
ii [0, x0 ] is compact;
iii π is continuous on [0, x0 ].
By Weierstrass, we can conclude that π attains a maximum on [0, x0 ]. Furthermore:
max0 π(x) ≥ π(0) = 0 ≥ π(x),
∀x ≥ x0 .
x∈[0,x ]
So this is indeed a global maximum.
√
iii) Consider the situation where c(x) = x. It should be easy to show that π(x)
cannot attain a maximum on an unbounded set.
Solution to exercise 3.5
i) To guarantee the existence of a minimum, we shall appeal to Weierstrass’
Theorem.
P
First, the function we wish to minimise is the linear function p · x = ni=1 pi xi ,
which should be obviously continuous to you. We are restricting this to the set:
X(ū) = x ∈ Rn+ u(x) ≥ ū = {x ∈ Rn | u(x) ≥ ū, xi ≥ 0, i = 1, . . . , n} .
Since this is defined in terms of weak inequalities, X(ū) is closed. However, it is in
general not bounded, since one can reasonably expect utility functions to be
34
3.4. Solutions to exercises
increasing in the xi , and we simply have a lower bound on utility here. Even if we
may not always have this, our problem has not given enough information to
restrict our utility function in this manner, so we must consider the general case.
As we have done in previous exercises, we shall try to find a compact subset of the
domain which we can guarantee contains the minimum of the problem. Again, this
is done by picking an arbitrary value of the function to be minimised to
decompose X(ū).
So, consider an arbitrary vector x
b ∈ X(ū). Thus, x
bi ≥ 0 for all i = 1, . . . , n, and
u (b
x) ≥ ū. The minimum of f on X(ū) can be no greater than f (b
x) = p · x
b. We
then decompose X(ū) into the union of:
Y = x ∈ Rn+ u(x) ≥ ū, p · x ≤ f (b
x) ,
Z = x ∈ Rn+ u(x) ≥ ū, p · x > f (b
x) .
We claim that Y is compact. Since it is defined by weak inequalities, it is closed.
To show that it is bounded, consider an arbitrary element y ∈ Y . This has
magnitude:
v
u n
uX
yi2 .
kyk = t
i=1
Since y ∈ Y , we have p · y ≤ f (b
x). Non-negativity of the yi and positivity of the pi
then imply, for i = 1, . . . , n:
pi y i ≤
n
X
pi yi ≤ f (b
x)
i=1
⇒
yi ≤
f (b
x)
.
pi
Here we must have pi 6= 0 for all i = 1, . . . , n for this bound to be valid. Thus,
v
v
u n
u n
uX f 2 (b
uX 1
x)
t
t
kyk ≤
=
f
(b
x
)
.
2
p
p2i
i
i=1
i=1
Since, for fixed x
b, the left-hand side is a positive constant, we can choose M to be
equal to this so that we have proven, that there exists an M > 0 such that
kyk ≤ M for all y ∈ Y . That is, Y is a bounded set.
Finally, x
b ∈ Y , so that Y is a non-empty, compact set, and Weierstrass’ Theorem
asserts the existence of a minimum of p · x on Y . Suppose this occurs at x? ∈ Y ,
then:
f (x? ) ≤ f (b
x) < f (x),
∀x ∈ Z.
So this indeed provides a minimum on X(ū).
ii) We have already seen what could go wrong if one of the prices pi were equal to
zero. In that case, our argument that Y is a bounded set fails. However, this is not
enough to indicate that problems do occur, and that a minimum might not exist,
so we should come up with a counter-example.
Suppose that a good had a zero price then our solitary expenditure constraint will
do nothing to restrict our consumption of it. If our utility function is increasing in
this good, then we have a costless way of increasing our utility without bound.
35
3
3. Weierstrass’ Theorem
More concretely, suppose we have the expenditure minimisation problem:
Minimise x2 subject to x1 x2 ≥ 1
so that p1 = 0 and p2 = 1 and X(1) = {(x1 , x2 )|x1 x2 ≥ 1}. For any ε > 0 set
x1 = ε−1 and x2 = ε. This satisfies the utility constraint with strict equality.
Furthermore, the infimum of f on X(1) is 0, but this is never attained. Hence, the
problem has no minimum.
3
If the function fails continuity, then you should already have seen situations where
anything can happen, that is minima may or may not occur, depending on the
particular function.
Solution to exercise 3.6
Since the denominator is always greater than or equal to one, so that the quotient is
always well-defined, and the remaining components are just polynomials, the function f
is continuous.
The domain R2+ is defined in terms of the weak inequalities x1 ≥ 0 and x2 ≥ 0 and is
thus closed. However, it is unbounded, so Weierstrass’ Theorem is not immediately
applicable.
As we have done before, we shall try to find a point around which a compact set can be
reasonably expected to contain a maximum for the function (on the entire domain).
Since the function f (x, y) ≥ 0 for all (x, y) ∈ R+ , picking any vector with zero
components will not help us a great deal. So, failing that, we consider integer
components; the simplest case being the point (1, 1) with value f (1, 1) = 13 .
We wish to decompose the domain R2+ into a compact set, to apply Weierstrass’
Theorem, and an unbounded set which is negligible for our analysis. Since we wish to
maximise the function f (x, y), we want the unbounded set to be those points (x, y) such
that f (x, y) < f (1, 1) . To do this, let us consider what an upper bound for f (x, y)
would be in terms of x and y. First, x, y ≤ max {x, y} so that yx2 ≤ (max {x, y})3 . For
the denominator, we also have:
x4 + y 4 + 1 > x4 + y 4 ≥ (max {x, y})4 .
Thus, for all (x, y) ∈ R2+ , we have:
f (x, y) ≤
1
(max {x, y})3
.
4 =
max {x, y}
(max {x, y})
So, if either x > 3 or y > 3 then we have:
f (x, y) <
1
= f (1, 1).
3
We can therefore decompose R2+ into the union of:
Y = (x, y) ∈ R2+ 0 ≤ x ≤ 4, 0 ≤ y ≤ 4 ,
Z = R2+ /Y.
Since Y is defined in terms of weak inequalities, it is closed. It is also bounded, for
example by the length 6. Finally, (1, 1) ∈ Y , so it is non-empty. So, for the continuous
36
3.4. Solutions to exercises
function f , Weierstrass’ Theorem asserts the existence of a maximum for f on Y , say at
the point x? ∈ Y . However, for any (x, y) ∈ Z, we have:
f (x? ) ≥ f (1, 1) =
1
> f (x, y).
3
So x? is a global maximum for the function f .
3
37
3. Weierstrass’ Theorem
3
38