Assessing Elementary Algebra with STACK

Assessing Elementary Algebra with STACK
Christopher J Sangwin
School of Mathematics, University of Birmingham, B15 2TT
+44 121 414 6197 (phone) +44 121 414 3389 (fax)
April 6, 2006
Abstract
This paper concerns computer aided assessment (CAA) of mathematics in which a computer algebra system (CAS) is used to help assess students’ responses to elementary algebra questions. Using a methodology of documentary analysis, we examine what is taught
in elementary algebra. The STACK CAA system, http://www.stack.bham.ac.uk/,
which uses the CAS Maxima, is taken as a case study with which to test the implementation of the ideas developed in this paper. The general characteristics needed from
a CAS for the application of computer aided assessment, or computer based learning is
discussed. In particular the need for consistently implemented noun forms of elementary
arithmetic operations, together with their traditional verb forms is identified.
Keywords: Computer algebra, computer aided assessment, elementary algebra.
1
Introduction
This paper concerns computer aided assessment (CAA) in which a computer algebra system
(CAS) is used to help assess students’ responses, i.e. answers, to elementary algebra questions.
As a simple illustration, imagine a student interacting with a CAA system and in some way
entering a mathematical expression as an answer to a question. The CAA system uses a CAS
to algebraically subtract the student’s response from that of the teacher and to simplify the
resulting expression. If the result of this algebraic simplification is zero, then the system has
established an algebraic equivalence between the student’s answer and the teacher’s answer.
The key point is that the student’s answer contains mathematical content.
General purpose computer aided assessment systems predominantly use multiple choice
questions, or modifications of these. Such modifications provide a significant range of variations, and include multiple response, and matching questions. What all these have in common
is that they are provided response question (PRQ) types. That is to say, questions in which a
student is provided with a list of potential answers and asked to make a selection, match-up,
rearrange or perform various other kinds interactions. Contemporary CAA systems usually
include numerical entry although these have only very limited use in mathematics. Free text
entry is a feature of many systems although tools to assess these are primitive.
The use of a PRQ is almost always a constraint dictated by the software, and not the
preferred choice of the user. Indeed there are well-aired fundamental problems with provided
response questions which are (i) question distortion, (ii) assessment only of lower order skills,
and (iii) strategic learning. While there is evidence these can be ameliorated to some extent
by carefully designing questions, the problems associated with question distortion are severe
1
in mathematics. This is particularly the case with questions involving an essentially reversible
process, where one direction is significantly more difficult. For example, factor vs. expand,
integrate vs. differentiate and so on. Questions which ask students to solve a problem are also
difficult to set as a PRQ since it is too easy for a strategic student to interpret the question as
asking ‘which of these are solutions?’. Since a student may not actually perform the process
asked but do something else, the question may well not test what the teacher intends. In
fact, we might be quite disappointed if students did not take the mathematically sensible, if
strategic, approach. This would at least demonstrate an understanding of the reversibility of
the process, and their relative difficulties. Many of these problems are fundamental to the
PRQ type and are difficult, if not impossible, to remove. What is pedagogically more honest
is to ask for, and assess, a mathematical response from the student.
1.1
Background to CAS-enabled CAA
Since 2000, a number of specialist computer aided assessment systems have used an existing
CAS to provide tools to assess mathematical expressions as answers. Perhaps the first system
to make a mainstream CAS a central feature was the AiM System, first described by [8], with
subsequent technical developments described in [16]. This system operates using Maple, as
does the Wallis system of [9] and the proprietary system MapleTA. Other systems have access
to a different CAS, such as CABLE [10] which uses AXIOM and the STACK system which
uses the CAS Maxima, see [14]. From private correspondence, the author is also aware of
systems which use Derive and Mathematica in a similar way.
In these systems the CAS supports various ancillary functions, such as the display of
mathematical expressions (via perhaps LATEX) and the input of expressions using the typed
linear syntax supplied by the CAS. Indeed, in all the CAA systems mentioned above the
student uses the syntax of the CAS itself to express their response to a question. The syntax
issue is particularly important for elementary algebra assessment. If a learner is only just
meeting concepts such as gathering like terms, or multiplication of brackets then they cannot
be encumbered by the need to learn syntax. As a result, STACK also provides the liberalised
mathematical syntax described in [13].
However, in all these systems the primary purpose of the CAS is to provide the teacher
with a ready made library of useful mathematical functions. This allows the generation
and manipulation of expressions to create structured random questions. These functions
also allow a student’s answer to be manipulated and tested objectively against a variety
of mathematical criteria beyond the prototype of establishing algebraic equivalence with a
teacher’s expression. It is the ability to make fine grained distinctions in elementary algebra
which is the topic of this paper. Distinguishing between different possible responses is central
to the assessment process, whether in CAA or by more traditional means. Only once this has
been done can various outcomes be assigned, which might include a mark (or score), feedback
and so on. While humans are able to extract information from the question context to guide
the assessment of students’ responses, in CAA explicit instructions are needed. Furthermore,
these decisions need to be made in advance of the question being posed to the student. The
functions are also invaluable for generating feedback based on mathematical properties of the
student’s answer and also for constructing worked solutions.
While a CAS provides access to a large and well supported library of mathematical functions, the mainstream CAS cited above have been designed for the research mathematician.
When attention is paid by the designers of CAS to students, as it often is, the envisaged use
2
is almost always as a sophisticated calculator, see for example [6]. The experiences of the
author in using a variety of the CAA systems mentioned above while teaching undergraduate
students since 2000, suggests that CAA as an application is quite different.
In this paper we address the following questions.
1. How do we recognize and describe the characteristics of a ‘correct answer’ to an elementary mathematics question?
2. To what extent can the CAS Maxima establish these characteristics?
Being able to answer the second research question presupposes an understanding of the distinctions a teacher might make when assessing students’ answers in the traditional way. Hence,
we will begin by looking in some detail at the academic subject to which CAA is being applied.
There are significant differences between various CAS, and a comparison from the point
of view of a research mathematician are given in [17] together with other general comments.
Maxima has been chosen for this paper since it underpins the STACK CAA system currently
being developed by the author. We shall comment on some of the differences between Maxima
and other CAS in the discussion.
1.2
Caveats
One philosophical objection to this research is that mathematical work is not about obtaining
the ‘correct answer’. Indeed, the author does not believe that obtaining an isolated expression
is sufficient. In learning and teaching the method used forms essential evidence for a student’s
understanding of the processes involved. Nevertheless, being able to automatically assess a
logically connected sequence of mathematical statements is predicated upon the ability to
examine the properties of a single expression. Nevertheless, establishing the mathematical
properties of a single expression is a significant advance from the use of PRQ question types
currently used by contemporary CAA. Furthermore, practice is a legitimate and necessary
activity and CAA systems such as STACK have been used most often for this purpose.
In this paper we assume that only the student’s final answer is available to the CAA
system. We have no working, logical connections or method comments from which to draw
inferences. Feedback would be richer is this information were available, and in practice
students do complain at the lack of partial credit. Again, this assumption is in line with
contemporary CAA. In this paper we consider only an assessment function, rather than online
intelligent tutoring. In practice the immediacy of feedback, together with the opportunity
for the student to make repeated attempts, can be used to encourage the student to check
their own incorrect working carefully to identify any errors for themselves. Partial credit and
‘follow through’ marking are contrived devices to recognize and reward students’ achievement
in the context of asynchronous marking of traditional paper based work. With CAA the
importance of accuracy in achieving a correct final answer could be reinforced in a way which
does not unfairly penalize the inevitable and minor mistakes which virtually everyone makes
from time to time.
In many cases a general purpose CAS is not required to manipulate the response of a
student to establish its mathematical properties. For example, algebraic equivalence between
two expressions can be automatically established by choosing a dozen or so incommensurate
random values, substituting these into the respective expressions, evaluating numerically and
comparing the results. While this method does not guarantee equivalence, it does have a
3
firm basis in numerical analysis and has been found to be sufficiently reliable for the kinds
of expressions which occur in learning and teaching. Other tests, whether an expression is
in factored or expanded form for example, would require bespoke libraries of mathematical
functions. Systems which take this approach to CAA are the CALM system of [1], the Metric
system of [12] and the Aplusix system of [11], to cite but three examples from an extensive
field. We consider such libraries of functions to be ‘computer algebra’, even if they are not
traditional general purpose systems or explicitly conceptualized as such by their authors. Our
discussion of which mathematical distinctions a teacher might wish to make is just as relevant
to these systems.
In this paper we do not comment on how a student actually interacts with a CAA system
to express themselves mathematically. We assume that through a process of interactions the
student has communicated a syntactically valid expression, and that the CAS has interpreted
this in the way the student intended. This is not trivial. For example, in many CAS the string
4^1/2 is interpreted as two, not because the CAS takes the positive square root, but because
41
1
2 = 2. Since we tell our students that ‘a half’, 2 , as a rational number is a single entity, they
might expect the division order of precedence in this case to exceed that of exponentiation
so that the input string is interpreted as 4^(1/2) rather than (4^1)/2. One alternative to a
linear syntax is to use an ‘equation editor”, and various handwriting recognition systems are
being developed to facilitate interactions with CAS which could also be used in CAA, such
as that of [4]. The option used by the majority of existing CAA systems is for the student
to adopt a typed linear syntax usually identical or very close to that of the underlying CAS.
The issue of entry of mathematical expressions is an important but logically separate topic
from that we consider here, and is addressed in [13].
Simply being able to make a mathematical distinction does not mean that feedback must,
or should be given. We do not comment on what the teacher should do, rather we only
indicate the distinctions which could be, or are likely to need to be, made. For example,
while it may be perfectly acceptable for an engineering student to write the roots of x2 − x − 1
as x = 1.618 and −0.618, we would naturally expect an exact answer from a mathematics
undergraduate. Exactly what are considered appropriate outcomes are context dependent,
and hence will vary. However, being able to make such distinctions is a necessary precursor
to being able to provide educationally useful feedback.
2
Methodology
We begin with an examination and description of what is taught and assessed in elementary
algebra. There is certainly not an unique way of teaching elementary algebra, however there
is surprising uniformity in the approaches taken by contemporary text books and curricula.
This uniformity is reflected in both the ordering of material, and the activities (i.e. questions
or problems) required of the students. We do not wish to dwell on larger scale curricula design
issues, since our research pertains to automatic assessment of individual questions. Hence,
we have taken [2] as a summary of what might be viewed as the mainstream approach. This
is supplemented by questions from the algebra textbook [3]. While these two works are
broadly in agreement differences do occur since [3] is designed as a revision aid. As a specific
example, expansion of brackets and factorization are taken together which is not usual in a
first encounter with elementary algebra.
The methodology adopted is to work through these two sources systematically, matching
4
questions in [3] with topics and concepts in [2]. For each grouping of elementary algebra
tasks, exemplar questions have been selected from [3] and the following considered.
• What is being asked by the question?
• What mathematical processes does the student employ?
• What criteria are being used to mark an answer? i.e. what are legitimate answers and
how do we recognize them?
• What distinctions are possible for answers which are partially correct?
Following from this theoretical analysis, examples of each of the questions from [3] were
automated using the STACK CAA system. Looking at these questions and implementing
marking schemes with CAS code highlights the issues and focuses the discussion. While the
focus is thus naturally on the CAS Maxima, in Section 4 we shall draw on examples from
Maple and other CAS which have been employed for assessment by the author. While Maxima
underlies STACK the results are presented in such a way as to highlight the essential issues
which are likely to be encountered in mathematical CAA, regardless of which implementation
platform is adopted.
3
Results
In this section we give examples of questions from elementary algebra which illustrate the
issues involved when using CAS to assess answers to them in the CAA context outlined above.
In each case the question numbers refer to [3], where for example 3.2a would be Section 3,
question 2a. Algebra is an extension of arithmetic and so it is natural to begin elementary
algebra with arithmetic questions.
Question 1.1. Express each of the following as a fraction in its simplest form.
a)
20
45
b)
16
36
c) −
42
21
d) · · ·
The mathematical task is to remove the highest common factor from each integer in the
fraction. The student is expected to factor the numerator and denominator, cancel common
factors and if necessary multiply the remaining factors together to fully simplify the fraction.
Essentially this is an algorithmic process. To recognize the answer the teacher is looking for
either a single fraction with particular numbers in the numerator and denominator, or the
special case where the denominator is 1, which should not be written as a fraction. In the case
of a rational number greater than one, it may not be clear to the student if ‘simplest’ means
a single fraction or a mixed fraction. Indeed, [2] separates the cases of Reduction of fractions
to lowest terms from Change of improper fractions to mixed fractions and vice-versa.
3
Question 1.3a. Calculate 45 × 16
.
The mathematical task here can be thought of in two stages. The first is to perform a
multiplication calculation, the second is to simplify the result of this calculation, essentially
replicating the task of Question 1.1. Alternatively the student might consider the question
as presenting a partial factoring of the result and work from there, canceling first before
performing a multiplication.
5
These two questions typify many in which the teacher expects the student to (i) perform
an algorithmic numeric calculation and then (ii) express the result in a particular simplified
exact form.
There are other purely numerical tasks. When asking for a prime factorization of an
integer, the expected form of the answer is a product of powers preferably in ascending
order. When finding the highest common factor or least common multiple of two integers,
the expected answer is an integer.
Algebra is signified with the introduction of indeterminate quantities, e.g. x, into expressions. The task of substituting numbers for letters appears algebraic, but from the point of
view of automatic assessment this immediately reduces to numerical work. The beginning of
algebra proper occurs with the manipulation of algebraic expressions. While [2] begins with
collecting like terms as an explicit task, [3] omits exercises on this topic and moves straight
to manipulation of indices. For example.
Question 2.1. Simplify the following algebraic expressions.
a) x3 × x4
b) y 2 × y 3 × y 5
c) · · ·
The object of the question is to write an algebraic expression in a particular syntactic form.
The task to be performed involves applying the rules of indices, which include an arithmetical
calculation similar to that above. Negative or fractional powers can add complexity, without
fundamentally changing the question from the point of view of automatic assessment.
Surds, and their interaction with arithmetic operations including powers,
√ √ appear
√ at this
stage. First is the simplification of products and quotients of surds (eg 2 3 = 6). With
indeterminant quantities the teacher needs to make some choices about what simplifications
√
are permissible. In the early stages √
of elementary algebra it is usual to assume x2 = x. For
√
example, [2] gives the general rule n am = ( n a)m , and [3] confirms this by giving 5y as the
answer to the following question.
p
Question 2.13e. Simplify: 25y 2 .
Similarly, when examining an answer a decision needs to be taken about whether fractional
indices or surd notation is acceptable or perhaps required, depending on the purpose and
phrasing of the question itself.
The next significant topic is that of expanding algebraic expressions. For example
Question 3.1. Write the following expressions without using brackets:
· · · f ) (am)n
g) (a + m)n h) · · ·
We shall include within the discussion quadratic and more complex manipulations such
as the following.
Question 3.1. Write the following expressions without using brackets and simplify where
possible:
a) (2 + a)(3 + b) b) (x + 1)(x + 2) c) · · ·
Again, the question is asking the student to manipulate an expression and to write it
in a particular syntactic form. In this case as a sum of terms, rather than as a product,
which is to say expanded. Fundamentally the student has to distribute multiplication over
addition. Doing this may entail sub-processes of multiplication of numbers, gathering like
terms (i.e. counting or addition), re-ordering and using rules of indices. Indeed all of the
algebraic processes a student has learned up to this topic may be subsumed, in the sense of
[7], into this task.
6
The primary criteria are that the student’s expression is algebraically equivalent to that
in the question and that it has been written as a sum, where each summand is fully simplified,
perhaps a product of numbers and indeterminant quantities. Numerical calculations should
be performed, fractions rendered to their lowest terms and any rules of indices applied. In
addition, traditionally terms are ‘gathered’ and ‘sorted’. For a single variable polynomial
powers usually ordered with the highest first. Hence, recognizing a fully correct answer
requires the application of a number of quite separate criteria.
The reverse process mathematically to expanding out expressions is that of factoring.
Rather then consider this immediately, the topic of algebraic fractions occurs next. The
numerous cases of operations distinguished by [2] can be summarized by analogy to those
with numerical fractions. For example, canceling terms, adding, multiplication and division
and so on. In all these cases the answer is (i) a single fraction algebraically equivalent to that
in the question, and (ii) fully simplified. From the point of view of computer aided assessment
there is little technical variation. Again, a number of separate criteria are applied.
1
2
Question 4.7a. Simplify x+2
+ x+3
.
So far we have dealt only with the manipulations of expressions. Either we have performed
some calculation or manipulated an algebraic expression to render it into a particular syntactic
form. Next we shall deal with mathematical equations. Both [2] and [3] distinguish two types:
solving linear equations and the transposition of formulae.
Question 6.15. Solve the equation 2(y + 1) = −8.
nE
.
Question 7.4. Make n the subject of the formula J = nL+m
In the former the answer is a number, whereas in the latter we would expect an algebraic
expression. The student could state the answer either on its own, or in the form of an equation,
such as y = −5. We have already commented on how a teacher recognizes these forms, and
so there is little new from the point of view of computer aided assessment.
Another important topic is solving simple simultaneous equations. Since we would expect
a list of equations, e.g. y = 1, x = 4, recognizing answers to such questions is a small
incremental step from linear equations.
The next topic of importance is factoring algebraic expressions. Essentially this involves
changing a representation where the first operation is addition, to one in which the first
operation is multiplication and work naturally starts with simple linear factors. For example
Question 3.5a. Factorise 5x + 15y.
Again, such questions ask the student to re-write an algebraic expression in a particular
syntactic form. Much like when expanding out, we need to confirm both algebraic equivalence with the expression given in the question and the form of the answer. Here we make
comparisons in the context of commutative addition and multiplication operations, although
the student is expected to perform any calculations.
Work on factoring linear expressions moves onto factoring quadratic expressions. The
processes used by the student may not be more sophisticated that ‘spotting’ the correct
answer, or even a ‘guess and check’ method. Traditional algebra books contain large numbers
of practice questions, subdivided into a variety of cases. From the point of view of automatic
marking there is little difference between these.
Question 3.7b. Factorise x2 + 6x − 7.
Once factoring of quadratics has been taught, algebra opens up in a number of possible
directions. Hereafter there is little if any uniformity between textbook authors in the order of
topics. The most important topic is solving quadratic equations. Of course, solving quadratics
7
can be, and often is, taught by three methods: (i) factoring; (ii) completing the square; or
(iii) using the formula.
Question 8.1. Solve the following equation by factorisation x2 − 3x + 2 = 0.
Here the answer consists of one or two solutions, either integers or real numbers involving
quadratic surds. Hence, recognizing solutions relies on the ability to recognize numbers.
Solving higher order polynomial equations naturally follows, although from the point of view
of recognizing answers using computer aided assessment there is almost no difference from
the quadratic case.
Question 10.4. Solve the equation x4 − 2x2 + 1 = 0.
Question 10.5. Factorise x4 − 7x3 + 3x2 + 21x + 20 given that (x + 2)2 is a factor.
At this stage the ability to factor quadratics allows one to revisit algebraic fractions of
more complex types, such as the following.
x+2
Question 4.17. Simplify x2 +3x+2
.
Surds may also be revisited with a view to rationalization, to remove surd terms from the
denominators of fractions. Again, a particular syntactic form is required.
Partial fractions may perhaps also be introduced around this time.
x−3
Question 11.4.a. Express the following in partial fractions x2 −2x+1
.
It is the final form of the answer which is important here. The extent to which the
numerator of the fraction needs to be ‘simplified’ to satisfy the teacher, is of course, a matter
of taste.
3.1
A question taxonomy: syntax; calculations; properties
The above analysis of elementary algebra questions has highlighted three broad categories of
questions.
The first category are questions which ask the student to manipulate the written syntax
of a given algebraic expression. Essentially, the mathematical object remains algebraically
equivalent to that in the question and the student is expected to manipulate this. Forms of
numbers include prime-factored, rational numbers can be in single fraction or mixed fraction,
with all rational parts in lowest terms. Fractional indices can be converted to surds, or vice
versa. A teacher needs to take a view on whether decimals are acceptable when they are
exact (eg 0.5 for 12 ) or approximations (eg 0.333 for 13 ). The accuracy of any acceptable
approximations also needs to be considered. Any particular requirements for the form of
numbers applies to those occurring in algebraic expressions. Important algebraic forms are
gathered (including rules of indices), sorted, expanded, factored over the integers, reals (and for
completeness the complex numbers). Forms for algebraic fractions mirror those for rational
numbers: lowest terms, single fraction or mixed fraction. In addition we have partial fraction
form and rationalization of surds. A special form unique to quadratics is the completed
square. Rearranging an equation may be included within the category of syntax questions.
For example, solving a linear equation is essentially a syntax issue since the process involves
manipulating the equation by performing identical operations to both sides of the equal sign.
Here the student may be expected to give the answer in a form such as x =?.
The second category are questions which asked the student to perform a calculation. Here
a functional characteristic of the elementary arithmetic operations are used by the students.
For example, adding two rational numbers is a calculation. However, reducing the result of
this calculation to its lowest terms is itself a syntax question applied to the intermediate step
in this calculation. Calculation can and does occur when manipulating algebraic expressions
8
to render them into a particular syntactic form. However, the primary or explicit purpose
of the question is not to assess the ability to calculate. This illustrates the particular serial
nature of learning mathematics, where earlier topics become prerequisites for later ones.
The third category asks the student to establish some property of a number or algebraic
expression. Finding the highest common factor of a pair of integers, or the set of prime factors
is included here. Also, questions which ask the student to identify part of an expression. For
example, selecting the denominator from a fraction or selecting the coefficient from the highest
order term in a polynomial. More typically we might require the student to find the roots of
an equation. In such a question the form of the answer may have no obvious relationship to
that of the question. In the case of solving a quadratic, for example, the question contains
an equation, but the answer is a list or set of solutions which may be numbers or equations
of the form x = 1.
This taxonomy has been found to be useful when implementing questions in the STACK
computer aided assessment system, particularly when deciding on feedback and issues of
partial credit. For example, in syntax questions it may be appropriate to condone calculations
which have not been fully performed: gathering like terms in a complex expansion for example.
Similarly there is a loose hierarchy of simplifications and, as we shall see in the next section,
in a CAS all of these are performed in a single step. When providing feedback it is useful to
be able to decompress these processes somewhat. In a calculation, one might condone a lack
of full simplification. When asking for properties it may be permissable to condone answers
which are not fully simplified, or in which all calculations are not fully performed. Hence,
the author has found these distinctions to be quite useful when deciding which automatic
simplification regimes to apply to the answer of a particular question when implementing
them in CAA.
While we are discussing taxonomies for describing algebraic tasks, it is worth noting that
in reading through textbooks as part of this study, the word ‘simplify’ is often synonymous
with ‘do what I’ve just shown you in the previous worked example’. It is even possible to
find occurrences where the instruction to perform both directions of a reversible process is
‘simplify’ in different sections of a particular book. These are rare in elementary algebra, but
are more common in trigonometry or work with logarithms, which are not included here. By
adopting more explicit instructions the students can begin to build a vocabulary for describing
mathematical objects, and the syntactic forms they take.
4
Discussion
In the previous section we have mapped out the considerations which need to be made when
assessing answers to elementary algebra questions. In this section we comment in detail on
the ability of the CAS Maxima to implement these, and briefly compare Maxima to other
mainstream general purpose CAS.
Fundamentally the task in computer aided assessment is to establish the properties of
an expression provided by the student. The prototype property is when this expression
is the same as an algebraic expression supplied by the teacher. Either we compare the
student’s expression with the ‘correct’ answer, or establish that it is the same as something
not quite correct in order to be able to provide formative feedback. From the strict computer
science point of view, ‘same’ can have only one meaning: two objects have identical internal
representations in the data structure of the CAS. As a result of this requirement all CAS
9
perform simplifications, which is the process by which a representation is transformed into
a canonical form to enable such comparisons to take place. However as we have already
established, from a pedagogic standpoint there are multiple senses of ‘same’. For example,
there is algebraic equivalence and that two expressions have the same syntactic forms. So, for
example, we might wish to say that a student’s expression is correctly in completed square
form, the numbers occurring in the expression are correct but are themselves in the wrong
form, e.g. decimal approximations rather than fully simplified rational form.
The default behaviour of Maxima, like all general purpose CAS, is to automatically perform simplification and render expressions into a canonical form. This includes performing
all numerical calculations and simplifying all rational numbers to their lowest terms whenever
they appear in expressions. The evenness of cosine is used to simplify cos(−x) to cos(x). For
this application such behaviour is quite inappropriate, since we need to deal with what the
student actually gave, rather than the canonical form of this expression.
In Maxima, simplification can be switched off using the command simp:false, where
in Maxima’s language the colon is used to signify the assignment of a value to a variable.
Maxima is unusual in its ability to suppress automatic simplification completely, and few
other CAS have this ability, making is difficult or impossible when using them to distinguish
between potential responses to elementary algebra questions.
We begin the specific and detailed discussion with Maxima’s representation of numbers.
The following remarks might appear to be trivial wrinkles, and indeed they do not bother
the intelligent user employing CAS as a sophisticated calculator. However, they need to be
resolved for the application of CAA to avoid incorrect marking of student’s work.
Firstly, in Maxima, the unary minus is a prefix operation. Hence, a sum such as 3 − 4 is
represented internally1 as
((MPLUS) 3 ((MMINUS) 4))
literally as 3 + (−4). Similarly, a division such as ab is represented internally essentially as
a × b−1 . Both design decisions result in fewer operations, and remove potential duplication
of operations, significantly reducing the technical complexity of the internal simplifier.
These design decisions have implications when trying to recognize numbers in the absence
4
of simplification. For example, −4
3 and − 3 have quite different internal representations.
4
Hence, without any simplification −4
3 and − 3 are not equal, where here we take equality
to be that of the internal data structure representing the mathematical objects. Complex
numbers are similarly not single entities a + ib, but rather an expression involving addition
and multiplication. Floating point numbers are considered to be a different data type.
Maxima is not a strongly typed language which has distinct advantages over say Maple or
Axiom for CAA. For this application a CAS should tolerate mixed data types within a single
algebraic expression. For example, it is common for students to write expressions containing
both floating point and rational coefficients, such as x3 + 0.5, even if discouraged to do so.
Many CAS have a data structure which only allows a single type of coefficient, and coerces
all to a single type. The example above would be coerced to 0.33333x + 0.5 in Maple, Axiom
and some other systems. Maxima is unusual in allowing mixed types in expressions and when
a coercion is unavoidable the type of the result can be controlled. The default is to coerce to
float, although floats can be coerced to rational numbers using the command ratsimp().
We assume from this point onwards that tests can be applied that flag up the following
issues when they occur. Outcomes can then be assigned by the teacher as appropriate.
1
The command ?print(ex) can be used to view Maxima’s internal representation of the expression ex.
10
1. Whether arithmetic calculations with numbers have been performed by the student.
Hence, no arithmetic ‘simplification’ is required by the CAS.
2. Whether rational numbers are expressed in the lowest terms.
3. Whether floating point representations are used and whether these are exact.
With surd/radical
terms design decisions need to be made. Firstly, automatic simpli√
fication of x2√can result in three different results. If we assume the
√ variable is real and
2
non-negative, x = x, and if √
we assume only that x is real then x2 = |x|. When we
make no assumptions about x, x2 should√remain un-simplified, as it does in Maple. The
default behaviour of Maxima is to simplify
x2 = |x| which can be overridden by a command
√
2
such as assume(x>0), in which case x is simplified to x. For elementry algebra the second
option is more useful, although both are necessary for learning and teaching. Within Maxima
even finer degrees of control over the simplifier can be achieved via the variable radexpand.
The second design decision is simply a matter of how to display expressions involving radical
√
signs. Whether a traditional
symbol is used or fractional powers given instead can be
controlled with the Maxima variable sqrtdispflag. Flexibility such as this is invaluable for
this application.
It is questions which ask the student to expand an expression which present the first
significant challenge. For example, we would like to turn off any automatic simplification so
that a distinction between
x2 + 3x + 2,
and x2 + 2x + x + 2
can be drawn. In this latter expression terms have not been gathered, which automatic
simplification by the CAS would perform. However switching off all simplification does not
suffice. By default in Maxima variables in an algebraic expression are reverse lexicographically
ordered by the simplifier so that a+b would be simplified to b + a. Comparing a + b with b + a,
literally
if a+b=b+a then true else false;
results in true. However, if simplification is switched off this comparison results in false.
As a result, if we assume that we have simp:false, then an automatic comparison procedure
will reject perfectly good answers, such as 1+ x which when treated as a polynomial is usually
represented as x + 1.
To automatically recognize answers we would like some of the algebraic properties, but
not the functional properties of the arithmetic operations to be active. That is to say, we
would like to assume that addition and multiplication are commutative and associative and
perform our comparison in that context. However we do not want to actually perform any
additions or multiplications.
Currently all mainstream CAS treat basic arithmetic operations as ‘verb’ forms, rather
than ‘noun’ forms. For example, 1 + 2 is a calculation to do rather than representing an
addition. The work of [5] develops this distinction via the notion of a procept to capture the
dual process/concept nature of basic arithmetic operations. They comment on the ambiguities
in using the same symbol, e.g. +, for both as follows.
By using the notation ambiguously to represent either process or product, whichever
is convenient at the time, the mathematician manages to encompass both — neatly
side-stepping a possible object/process dichotomy. [5]
11
The supremacy of the verb over noun forms is quite natural when one remembers the origins
of CAS as a sophisticated calculator. However this does not accurately reflect mathematical
usage and furthermore, for our application, we need a much finer degree of control than is
currently available.
The approach taken in STACK is to define parallel noun forms of addition and multiplication operators, and define these as commutative and associative operators. These are
indistinguishable from the usual forms to the student, but their internal behavior is quite
different. The provision of such operations gives the teacher a much needed and finer degree
of control.
Interestingly, many computer algebra systems acknowledge the need for noun forms of
calculus operations. For example, in Maple a capital initial letter is used to denote an inert
differentiation or integration operation. e.g. Diff(y,x). Such a mechanism is necessary
when entering a differential equation for example. After all, here we certainly do not want to
perform the differentiation with respect to x, say, on a symbol y. Unfortunately, the provision
of matching noun and verb forms does not appear to the author to have been implemented
with a consistency which would allow them to be extended to the elementary arithmetic
operations in a natural way. Indeed, their presence seems to be the result of evolution to
allow, for example, differential equations to be expressed, rather than conscious planning
from the proceptual point of view.
Practical experience also suggests that selective simplification is invaluable. One particular
example is identity simplification in which x + 0 is simplified to x and 1 × x to x. Similarly
at this elementary level 0 × x should simplify to 0, and we assume that x1 = x and x0 = 1.
In elementary algebra there are a large number of forms of algebraic expressions. As part
of the CAA system we need functions which will establish that a particular expression is of
this form. Ideally, where an expression is not of a correct form some feedback should be
generated for the student. These functions are not available in general purpose CAS, and as
an example of those developed for the STACK system we consider in detail how to establish
that an algebraic expression is factored over the integers.
Recognizing that an expression is factored is quite different from using a CAS to factor an
expression over, say, the integers. Hence, it is not sufficient to compare a student’s answer with
the expression obtained from the factor function provided by a CAS, even if this comparison
is made in the context of commutative multiplication and addition. To illustrate this in more
detail consider the following factored forms of x2 − 4x + 4, to which a CAA system would
need to respond.
(x − 2)2 ,
(2 − x)2 ,
(x − 2)(x − 2),
(2 − x)(2 − x),
x 2
and 4 1 −
.
2
(1)
We assume here that simplification is off, so that repeated terms in a product are not automatically converted by the CAS to powers. There are additional complexities with even
powers which do occur even when asking students to factor a quadratic expression, as can be
seen in the example above.
The most effective approach is an elementary one: an expression is factored if it is the
product of terms, or powers of terms, each of which are irreducible over a particular field.
Hence it is necessary to pick apart the structure of the expression to establish this. We might
also establish algebraic equivalence of the student’s and teacher’s expressions, however that
is logically quite separate from deciding that an expression is factored.
12
Figure 1: Feedback from the STACK CAA system
Such a function might also return feedback, perhaps highlighting which terms in a product
could be further factored as required. For example, a student may well partially factor an
expression, specifically they may not pull out an integer factor as in
(3x − 6)(x + 5)
vs
3(x − 2)(x + 5).
(2)
One could well argue that both of these forms are legitimate factored quadratics, where the
focus is on the quadratic nature, not the linear integer component. If the teacher wished to
automatically provide tailored feedback to the first expression, CAS tools need to be adapted
for this purpose. This has been done in STACK, and is illustrated in Figure 1. While it is
not claimed that this feedback is optimal, it illustrates what is possible automatically.
The problems of recognizing when an expression is in either partial fraction or rationalized
surd form are very similar to that of recognizing a factored expression. Here again we need
functions which pick apart an expression. Similar issues occur with even powers, and with
pulling out integer factors. The unary minus also causes significant problems, where we might
have expressions such as
1
−1
1
,
, or
.
−
1−x
1−x
x−1
5
Other mathematical topics and associate CAS issues
This paper has concentrated exclusively on algebraic considerations. However trigonometry is
another important topic encountered in elementary mathematics. All mainstream computer
algebra systems assume radian angular measure. This design decision has implications for input, processing and display of mathematical expressions involving elementary trigonometrical
functions. In terms of automatic simplifications, most mainstream CAS understand that sine
is an odd function, and cosine is an even function. Hence, they simplify expressions such as
cos(−x) to cos(x). For CAA the ability to control such simplifications would be advantageous.
A similar issue is related to the design decisions for logarithmic and exponential functions.
While issues relating to input syntax have been dealt with in [15], briefly, all mainstream
computer algebra systems assume log(x) refers to the natural logarithm, and no system by
default accepts the input string e^x as the exponential function. Maxima and Maple both
use exp(x).
The topic of sets is often included at an elementary level. Since duplicate elements are
removed from a set automatically by the computer algebra system there is an implicit notion
of when two elements are considered to be the same. As we have seen this is not quite so
13
simple as it first appears. The ability to impose a various notions of sameness when removing
duplicate elements from a set would be advantageous for this application.
6
Conclusions
This paper seeks to explain the difficulties of using a general purpose computer algebra system
to assess elementary algebra questions. These issues are mostly associated with very simple
mathematics, and in particular when making very fine grained distinctions in elementary
algebra.
If designers of CAA choose not to adapt an existing CAS, but rather write a bespoke
library of mathematical functions for this application then all the issues raised in this paper
will still need to be resolved. Certainly for more advanced work CAS has proved to be
powerful, reliable and useful. Hence, the benefit gained from using an existing CAS far
outweighs the difficulties described in this paper. Maxima’s weak concept of data type; the
ability to suppress all simplification; and the ease with which new operations can be included
make it particularly suited for this application.
The STACK CAA system has been built upon the basis of the principles described in this
paper and a number of tests and functions developed to implement the kinds of questions
detailed in Section 3. Given the complexities of the subject, it is probably the case that
writing reliable questions is a job for an expert. Certainly with the inherent complexities any
question author needs to understand what choices are available to them, and what decisions
have already been made. STACK has been designed very much with the teacher in mind, and
details of the question authoring process are given in [14] where colleagues are the ‘neglected
learners’ referred to in the title.
This system has been used successfully with groups of approximately two hundred undergraduate mathematics students with weekly assignments since September 2005. Ongoing
work will ensure the full implementation noun forms of arithmetic operations in Maxima,
and mechanisms to control simplification to facilitate assessment of all areas of elementary
algebra.
References
[1] H. Ashton, C. E. Beevers, A. A. Koraninski, and M. A. Youngson. Incorporating partial
credit in computer aided assessment of mathematics in secondary education. British
Journal of Educational Technology, 2005.
[2] T. Barnard. A Pocket Map of Algebraic Manipulation. The Mathematical Association,
1999.
[3] A. Croft. An Algebra Refresher. LTSN Maths, Stats & OR Network, The University of
Birmingham, Birmingham, B15 2TT, 2002.
[4] M. Fujimoto and M. Suzuki. Infty Editor — A Mathematics Typesetting Tool with
a Handwriting Interface and a Graphical Front-end to OpenXM Servers, volume 1335
of Computer Algebra – Algorithms, Implementations and Applications, pages 217–226.
RIMS Kokyuroku, 2003.
14
[5] E. M. Gray and D. O. Tall. Duality, ambiguity and flexibility: A proceptual view of
simple arithmetic. Journal for Research in Mathematics Education, 26(2):115–141, 1994.
[6] D. Guin, K. Ruthven, and L. Trouche. The Didactical Chellenge of Symbolic Calculators: Turning a Computational Device into a Mathematical Instrument, volume 36 of
Mathematics Education Library. Springer, 2005.
[7] D. Hewitt. Mathematical fluency: the nature of practice and the role of subordination.
For the learning of mathematics, 16(2):28–35, 1996.
[8] S. Klai, T. Kolokolnikov, and N. Van den Bergh. Using Maple and the web to grade
mathematics tests. In Proceedings of the International Workshop on Advanced Learning
Technologies, Palmerston North, New Zealand, 4–6 December, 2000.
[9] M. Mavrikis and A. Maciocia. Wallis: a web-based ILE for science and engineering
students studying mathematics. In Workshop of Advanced Technology for Mathematics
Education in the 11th International Conference on Artificial Intelligence in Education,
pages 505–512, Sydney, Australia, 2003.
[10] L. Naismith and C. J. Sangwin. Computer algebra based assessment of mathematics
online. In Proceedings of the 8th CAA Conference 2004, 6th and 7th July, The University
of Loughborough, UK, 2004.
[11] J. F. Nicaud, D. Bouhineau, and H. Chaachoua. Mixing microworlds and CAS features
in building computer systems that help students learn algebra. International Journal of
Computers for Mathematical Learning, 9(2):169–211, 2004.
[12] P Ramsden.
Fresh Questions, Free Expressions:
METRICs Web-based
Self-test Exercises.
Maths Stats and OR Network online CAA series
http://ltsn.mathstore.ac.uk/articles/maths-caa-series/, June 2004.
[13] P. Ramsden and C. J. Sangwin. A liberalised mathematical syntax for computer-aided assessment. In Proceedings of the International Mathematica Symposium, Perth, Australia,
2005.
[14] C. J. Sangwin and M. J. Grove. STACK: addressing the needs of the “neglected learners”.
In Proceedings of the WebAlt Conference, Eindhoven, 2006.
[15] C. J. Sangwin and P. Ramsden. Linear syntax for communicating elementary mathematics. (Submitted), 2006.
[16] N. Strickland. Alice interactive mathematics. MSOR Connections, 2(1):27–30, 2002.
http://ltsn.mathstore.ac.uk/newsletter/feb2002/pdf/aim.pdf (viewed December 2002).
[17] M. Wester. Computer Algebra Systems: a Practical Guide. Wiley, 1999.
15

Download Report

Assessing Elementary Algebra with STACK

Paperzz.com

Your Paperzz