Mathematical Biology Education: Beyond Calculus

EDUCATIONFORUM
We do not argue against the mathematical
courses included in current undergraduate
biology curricula. But we believe that these
courses should be revised and extended.
Many key computational ideas can be better communicated and absorbed by biology
undergraduates with few prerequisites, in a
way that will make the students excited about
bioinformatics as a scientific discipline and
more creative when they employ bioinformatics methods and ideas in the future. We
feel that the best way to engage biology
undergraduates in bioinformatics is to appeal
to their innate intuition and common sense
and to avoid mathematical formalism as
much as possible. The proposed course may
become a first step toward building the new
computational curriculum for biologists.
References and Notes
1. National Research Council, BIO2010, Transforming
Undergraduate Education of Future Research Biologists
(National Academies Press, Washington, DC, 2003).
2. W. Bialek, D. Botstein, Science 303, 788 (2004).
3. P. A. Pevzner, Bioinformatics 20, 2159 (2004).
4. L. M. Iyer et al., Genome Biol. 2(12), RESEARCH0051.1
(2001).
5. P. Grindrod et al., Math. Today 44, 80 (2008).
6. N. Wingreen, D. Botstein, Nat. Rev. Mol. Cell Biol. 7, 829
(2006).
7. L. J. van ’t Veer et al., Nature 415, 530 (2002).
8. L. J. Gross, Cell Biol. Educ. 3, 85 (summer 2004).
9. R. Brent, Cell Biol. Educ. 3, 88 (summer 2004).
10. We are grateful to all participants of RECOMB Bioinformatics Education conference (La Jolla, 14 and 15 March 2009)
for many comments on various aspects of bioinformatics
education. We are also grateful to V. Bafna, N. Bandeira,
A. Tanay, and G. Tesler for many interesting discussions
and suggestions. The conference was supported by the
Howard Hughes Medical Institute Professors Program.
10.1126/science.1173876
EDUCATION
Mathematical Biology Education:
Beyond Calculus
Training in developing algebraic models
is often overlooked but can be valuable
to biologists and mathematicians.
Raina Robeva1* and Reinhard Laubenbacher2
I
n 2003, the National Research
Council’s BIO2010 report
M
recommended aggressive
curriculum restructuring to
educate the “quantitative biol- Ge
E
ogists” of the future (1). The
number of undergraduate and
L
graduate programs in mathematical and computational
biology has since increased,
and some institutions have added
courses in mathematical biology related to
biomedical research (2, 3). The National
Science Foundation (NSF) and the National
Institutes of Health are funding development
workshops and discussion forums for faculty (4, 5), research-related experiences (6,
7), and specialized research conferences in
mathematical biology for students (8, 9).
This new generation of biologists will
routinely use mathematical models and computational approaches to frame hypotheses,
design experiments, and analyze results. To
accomplish this, a toolbox of diverse mathematical approaches will be needed.
Nowhere is this trend more evident than in
Department of Mathematical Sciences, Sweet Briar College, Sweet Briar, VA 24595, USA. 2Virginia Bioinformatics
Institute, Virginia Tech, Blacksburg, VA 24061, USA.
1
*To whom correspondence should be addressed. E-mail:
[email protected].
542
)
Le
DE and Boolean models of the lac operon mechanism. Each component of the shaded part of the wiring
diagram is a variable in the model, and the compartments outside of the shaded region are parameters.
Directed links represent influences between the variables: A positive influence is indicated by an arrow; a
negative influence is depicted by a circle.
systems biology. At the molecular level, this
involves understanding a complex network of
interacting molecular species that incorporates gene regulation, protein-protein interactions, and metabolism. Two types of models have been used successfully to organize
insights of molecular biology and to capture
network structure and dynamics: (i) discreteand continuous-time models built from difference equations or differential equations
(DE) models, which focus on the kinetics
of biochemical reactions; and (ii) discretetime algebraic models built from functions
of finite-state variables (in particular Boolean
networks), which focus on the logic of the
network variables’ interconnections.
Algebraic models were introduced in
1969 to study dynamic properties of gene
regulatory networks (10). They have proven
useful in cases where network dynamics
are determined by the logic of interactions
rather than finely tuned kinetics, which often
are not known. Published algebraic models
include the metabolic network in Escherichia coli (11) and the abscisic acid signaling
pathway (12).
The use of algebraic methods is extending beyond systems biology. Methods
from algebraic geometry have been used
in evolutionary biology to develop new
approaches to sequence alignment (13), and
new modeling of viral capsid assembly has
been developed using geometric constraint
theory (14). Algorithms based on algebraic
combinatorics have been used to study RNA
secondary structures (15).
25 JULY 2009 VOL 325 SCIENCE www.sciencemag.org
Published by AAAS
Downloaded from www.sciencemag.org on October 5, 2009
tional ideas (hierarchical clustering, pattern classification, and feature selection) are
introduced as part of the “story” to promote
ease of understanding by students. Such
examples may also instill in students the
real-life impact of computational methods.
This research, for example, led to the first
cancer diagnostic chip to be approved by
the U.S. Food and Drug Administration; it is
currently used to determine which patients
may benefit from additional chemotherapy.
The question of whether similar innovative courses can be implemented at the
undergraduate level at many universities
remains subject to debate (8, 9). Nevertheless, such courses are pioneering steps
toward developing a new computational
biology curriculum.
EDUCATIONFORUM
DE Versus Algebraic Models
Many institutions now offer DE-focused
courses that include problems from the life
sciences, rather than just the traditional
linkages with physics and engineering. For
example, in bio-calculus and precalculuslevel courses, differential and difference
equations are used to model system dynamics. Textbooks emphasize development of
DE models for various biological systems
related to, e.g., population dynamics and the
spread of an epidemic (16, 17). In contrast,
few curricular materials linking biology
with modern algebra have been created. The
courses “Mathematics for Computational
Biology” (18) and “Algebraic Statistics for
Computational Biology” (19) are two such
examples targeting advanced undergraduate mathematics and biology students. A
NSF curriculum development pilot project
to produce algebraic educational modules is
under way (20), as is a project to create an
integrated biology curriculum, incorporating mathematics, statistics, and computational methods beyond calculus (21).
This relative lack of interest and investment in algebraic approaches compared
with DE is due primarily to a lack of awareness of the importance of algebraic models in modern biology and mathematics
research, as well as to a lack of appropriate
educational materials on algebraic models,
rather than to inaccessibility of the essential
underlying mathematics.
Variables in a DE model can span a continuous range of biologically feasible values. Modelers need detailed knowledge of
interactions between variables, for example, specifics of control mechanisms, rates
of production and degradation, or highest
and lowest biologically relevant concentrations. But in an algebraic model, only values from a finite set are allowed. The special
case of a Boolean network allows only two
states, for example, 0 and 1, representing
the absence or presence of gene products in
a model of gene regulation. In contrast to
DE, the information necessary to construct
a Boolean model requires only conceptual
understanding of the causal links of dependency. Thus, continuous models are quanti-
tative while Boolean models are qualitative
in nature.
To illustrate, we use one of the simplest and best-understood mechanisms of
gene regulation: the lactose (lac) operon
that controls the transport and metabolism
of lactose in E. coli (fig. S1). A wiring diagram is constructed to represent major components and causal interactions of the system (see figure, page 542).
The wiring diagram on the left of the figure depicts both DE and Boolean models of
the lac operon. Although it involves only three
variables, the DE model is complex, involving additional parameters reflecting the reaction kinetics. The Boolean model is simpler,
reflecting only basic interactions depicted in
the wiring diagram. Despite its simplicity, the
Boolean model exhibits the same qualitative
behavior as the DE model. Both are capable
of reflecting the key feature of bistability of
the operon (22).
In contrast with the high level of detail
needed for the construction of the DE model,
the Boolean model is relatively intuitive and
was essentially obtained by translating a
prose description of the molecular pathways
depicted in fig. S1 and formalized by the
wiring diagram into logical statements. This
feature of algebraic models makes them particularly appealing as an entry into mathematical modeling.
Constructing algebraic models of biological networks requires only a modest amount
of mathematical background, some of which
is typically included in a college algebra
course. This provides a quick path to mathematical modeling for students (and researchers, for that matter) in the life sciences. For
mathematics students, algebraic models of
biological networks provide a meaningful
way to introduce many of the concepts in the
discrete mathematics curriculum. And they
are easily connected to more advanced topics in abstract algebra (23).
A Call for Change
Algebraic models should be considered
critical for the professional development of
biologists. Mathematics and biology educators must work to determine the best way
of including these in undergraduate curricula. Because the mathematics involved is
more easily accessible to faculty in the life
sciences, relative to DE, this could lead to
increased engagement on their part. Algebraic models will introduce problems from
modern biology into the traditional mathematics courses, bringing life to the primarily theoretical abstract algebra curricula.
Concurrent or subsequent introduction to
DE models in the courses already in place
will only reinforce the conceptual framework and further elevate students’ mathematical sophistication.
References and Notes
1. National Research Council, BIO2010: Transforming
Undergraduate Education for Future Research Biologists
(National Academies Press, Washington, DC, 2003).
2. N. Wingreen, D. Botstein, Nat. Rev. Mol. Cell Biol. 7, 829
(2006).
3. R. Robeva, Methods Enzymol. 454, 305 (2009).
4. Over the Fence: Mathematicians and Biologists Talk
About Bridging the Curricular Divide, Mathematical
Biosciences Institute workshop, Columbus, OH, 1 and 2
June 2007.
5. Computational and Mathematical Biology, Mathematical
Association of America, Professional Enhancement
Programs, 2003, 2004, 2005, 2006, and 2007.
6. Expanding Biomedical Research Opportunities at the
Undergraduate Level, in Recovery Act Limited Competition: NIH Challenge Grants in Health and Science
Research (RC1) (National Institutes of Health Program
Solicitation 12-DK-102, Challenge Grant Applications,
2009); http://grants.nih.gov/grants/funding/challenge_
award/Omnibus.pdf.
7. Interdisciplinary Training for Undergraduates in Biological and Mathematical Sciences, (National Science
Foundation Program Solicitation, NSF 08-510, 2008);
www.nsf.gov/pubs/2008/nsf08510/nsf08510.htm.
8. Undergraduate Conference at the Interface between
Mathematics and Biology, National Institute for Mathematical and Biological Synthesis (NimBioS), Knoxville,
TN, 23 and 24 October 2009.
9. Undergraduate Research Conference in Quantitative
Biology, Appalachian State University, Boone, NC, 2 and
3 November 2007 and 14 and 15 November 2008.
10. S. A. Kauffman, J. Theor. Biol. 22, 437 (1969).
11. A. Samal, S. Jain, BMC Syst. Biol. 2, 21 (2008).
12. R. Zhang et al., Proc. Natl. Acad. Sci. U.S.A. 105, 16308
(2008).
13. L. Pachter, B. Sturmfels, Proc. Natl. Acad. Sci. U.S.A. 101,
16132 (2004).
14. M. Sitharam, M. Agbandje-Mckenna, J. Comput. Biol. 13,
1232 (2006).
15. A. Apostolico et al., Nucleic Acids Res. 37, e29 (2009).
16. C. Neuhauser, Calculus for Biology and Medicine
(Prentice-Hall, Upper Saddle River, NJ, ed. 2, 2003).
17. F. Adler, Modeling the Dynamics of Life: Calculus and
Probability for Life Scientists (Thompson, Belmont, CA,
ed. 2, 2005).
18. “MATH 127—Mathematics for Computational Biology”
(University of California, Berkeley, CA, 2007);
http://math.berkeley.edu/~bernd/math127.html.
19. “MATH, 295—Algebraic Statistics for Computational
Biology” (Duke University, Durham, NC, 2005); www.
math.duke.edu/graduate/courses/fall05/math295.html.
20. R. Robeva et al., National Science Foundation DUE award
0737467, 2008; www.nsf.gov:80/awardsearch/
showAward.do?AwardNumber=0737467.
21. Biology Through Numbers Howard Hughes Medical
Institute Award, 2006; www.hhmi.org/research/
professors/neuhauser.html.
22. E. M. Ozbudak, M. Thattai, H. N. Lim, B. I. Shraiman,
A. van Oudenaarden, Nature 427, 737 (2004).
23. R. Laubenbacher, B. Sturmfels, Am. Math. Month.
arXiv:0712.4248v2, in press.
24. The authors thank R. Davies from Sweet Briar College
for her review of the biological content. Support by the
NSF under the Division of Undergraduate Education; NSF
Course, Curriculum, and Laboratory Improvement (CCLI)
award 0737467 is also gratefully acknowledged.
Supporting Online Material
www.sciencemag.org/cgi/content/full/325/5940/542/DC1
www.sciencemag.org SCIENCE VOL 325 31 JULY 2009
Published by AAAS
Downloaded from www.sciencemag.org on October 5, 2009
Because of the success of DE methods
for modeling biochemical networks, they
have been the primary focus of curriculum
reform efforts, whereas algebraic models
have received less attention. But algebraic
models have many features needed to integrate mathematics and biology into the training of students at all levels; thus, their inclusion in curricula is necessary and timely.
10.1126/science.1176016
543