Active-Set Reduced-Space Methods with Nonlinear Elimination for

Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
SIAM J. SCI. COMPUT.
Vol. 38, No. 4, pp. B593–B618
c 2016 Society for Industrial and Applied Mathematics
ACTIVE-SET REDUCED-SPACE METHODS WITH NONLINEAR
ELIMINATION FOR TWO-PHASE FLOW PROBLEMS IN
POROUS MEDIA∗
HAIJIAN YANG† , CHAO YANG‡ , AND SHUYU SUN§
Abstract. Fully implicit methods are drawing more attention in scientific and engineering
applications due to the allowance of large time steps in extreme-scale simulations. When using a
fully implicit method to solve two-phase flow problems in porous media, one major challenge is the
solution of the resultant nonlinear system at each time step. To solve such nonlinear systems, traditional nonlinear iterative methods, such as the class of the Newton methods, often fail to achieve
the desired convergent rate due to the high nonlinearity of the system and/or the violation of the
boundedness requirement of the saturation. In the paper, we reformulate the two-phase model as
a variational inequality that naturally ensures the physical feasibility of the saturation variable.
The variational inequality is then solved by an active-set reduced-space method with a nonlinear
elimination preconditioner to remove the high nonlinear components that often causes the failure of
the nonlinear iteration for convergence. To validate the effectiveness of the proposed method, we
compare it with the classical implicit pressure-explicit saturation method for two-phase flow problems with strong heterogeneity. The numerical results show that our nonlinear solver overcomes the
often severe limits on the time step associated with existing methods, results in superior convergence performance, and achieves reduction in the total computing time by more than one order of
magnitude.
Key words. two-phase flow, variational inequality, active-set reduced-space methods, nonlinear
preconditioners, nonlinear elimination, parallel computing
AMS subject classifications. 76S05, 68W10
DOI. 10.1137/15M1041882
1. Introduction. Modeling and simulation of two-phase flow in porous media
have been a major effort in reservoir engineering [37]. In addition to oil reservoir
management, understanding and modeling of two-phase flow are crucial also to many
environment issues. For example, one of the most attractive and practical solutions
to reducing the CO2 emission problem is to inject and store CO2 in subsurface geological formations [43], such as depleted reservoirs and deep saline aquifers. The large
capacity of subsurface storage provides several advantages over other possible alternatives of carbon sequestration. For subsurface carbon sequestration, there are at
least four major mechanisms to trap the injected CO2 for long-term storage, namely,
structural (stratigraphic) trapping, residual fluid trapping, solubility trapping, and
mineral trapping. Some of these mechanisms may coexist and interplay simultaneously during the long span of storage process. A deep understanding of these trapping
mechanisms and exploring possibly new trapping mechanisms require accurate mod∗ Submitted to the journal’s Computational Methods in Science and Engineering section September 29, 2015; accepted for publication (in revised form) May 17, 2016; published electronically July
26, 2016. The research was supported in part by NSF China grants 11571100, 91530323 and 91530103.
http://www.siam.org/journals/sisc/38-4/M104188.html
† College of Mathematics and Econometrics, Hunan University, Changsha, Hunan, 410082, China
([email protected]).
‡ Corresponding author. Institute of Software, Chinese Academy of Sciences, Beijing 100190,
China, and State Key Laboratory of Computer Science, Chinese Academy of Sciences, Beijing 100190,
China ([email protected]).
§ Division of Physical Sciences and Engineering (PSE), King Abdullah University of Science and
Technology (KAUST), Thuwal 23955-6900, Saudi Arabia ([email protected]). This author
was supported by KAUST through the grant BAS/1/1351-01-01.
B593
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B594
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
eling and simulation of two-phase and multiphase flow in subsurface formations. This
paper is restricted to the discussion of two-phase flow, but the applied methodology
is applicable to full compositional multiphase flow.
One basic requirement for accurate modeling and simulation of multiphase flow is
to have the predicted saturation sit within a physically meaningful range, i.e., between
zero and one. The saturation can be spatially and temporally varied within this
range, but most algorithms used in practice do not guarantee the computed saturation
will stay within this range. This is especially so when the applied computational
method manifests numerical oscillation and nonphysical undershoot or overshoot [33].
A commonly used fix to this problem is to simply apply a cut-off operator to the
computed saturation at each time step, i.e., to set the saturation to be zero whenever
it becomes negative, and to set it to one whenever it becomes larger than one. This
treatment, however, has a serious unpleasant consequence, which is the loss of mass
conservation. Global and local mass conservation are desired properties for reservoir
flow and other subsurface flow simulation. The majority of algorithms used in reservoir
simulation have the property of local mass conservation, which of course implies also
global conservation. Unfortunately, the applied cut-off operators not only destroy the
local mass conservation but also damage the global mass conservation, which seriously
ruins the numerical accuracy and physical interpretability of the simulation results.
It is safe to state that one major challenge in two-phase flow simulation is the
nonlinearity of the process. Two main nonlinearities of two-phase flow are capillarity
and relative permeability. Capillary pressure is one of main driving forces in fluid
(oil, gas, and water) flow and transport in subsurface [37]. The capillarity effect can
be the leading mechanism in oil recovery from fractured oil reservoirs. Capillarity is
caused by surface tension between immiscible (or partially miscible) fluids. Relative
permeability is a dimensionless measure of the effective permeability of a fluid phase,
and it also plays an important role in enhanced oil recovery. In commonly used
multiphase flow models, both the capillary pressure and relative permeability are
nonlinear functions of saturation. They can be defined by interpolating experimental
data or by using a closed-form formula, and they usually require the saturation to
stay within a physically meaningful range, because otherwise the capillary pressure
and relative permeability become undefined. As a result, it is important to ensure the
saturation within a physically meaningful range, even from the viewpoint of treating
capillarity and relative permeability.
A popular approach for solving two-phase flow problems is the implicit pressureexplicit saturation (IMPES) method [11, 12, 19, 59], in which the nonlinear problem
is decoupled into two subproblems, namely, a linear equation for the potential component that needs to be solved implicitly and a nonlinear equation for the saturation
variable that can be explicitly updated. However, the IMPES method often requires
use of a small time step size to maintain a physical meaningful solution because of the
stability restriction [13]. The situation becomes worse at the extreme scale due to the
dependency between the time step size and the spatial resolution. Moreover, the semiimplicit nature of the algorithm may introduce a large splitting error due to the decoupling of the original problem [53]. Fully implicit methods [14, 36, 42, 51, 53, 54, 57],
on the other hand, can relax the stability requirement on the time step size and therefore provide a consistent and robust way to solve the nonlinear problem in long-term
and extreme-scale simulations. In particular, for problems with various time scales,
fully implicit methods with a suitable time adaptivity strategy [44, 58] can lead to
substantial reduction the total computing time, which greatly expands the scope of
application of the fully implicit algorithm.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B595
When a fully implicit method is applied, one needs to solve a large-scale nonlinear
algebraic system at each time step. Efforts have been made in employing the classical
Newton method and its many variants to solve the two-phase flow problem [14, 36, 42].
However, numerical challenges arising from the boundedness of the solution still exist
and may cause the Newton methods fail to converge. In this work, we introduce a
variational inequality formulation of the saturation equilibrium with a box inequality
constraint, which naturally satisfies the boundedness requirement. The two-phase
variational inequality is then formulated as a nonlinear complementarity problem
[21] which is solved by an active-set reduced-space method [3, 4, 22]. The active-set
reduced-space method consists of two major steps: (a) in the first phase, an index
set with respect to the computational domain is decomposed into active and inactive
parts, based on a criterion specifying a certain active set method; and (b) in the
second phase, a reduced linear system associated with the inactive set is solved. We
would like to point out that the class of active-set reduced-space method has proven
to be very efficient in a variety of applications [23, 28, 34, 35, 56], but very little work
has been done in the two-phase flow problem as far as we know.
For problems with high nonlinearities, preconditioning techniques on the nonlinear level, such as the additive Schwarz preconditioned inexact Newton (ASPIN) methods [9, 25, 39], the nonlinear restricted Schwarz preconditioners [10, 16], the nonlinear
dual-domain decomposition methods [48], the nonlinear balancing domain decomposition by constraints methods [30], the nonlinear elimination methods [26, 27, 29],
and the composite nonlinear algebraic methods [6], have received increasing attention in recent years. In particular, some efforts have been made in applying the
ASPIN method to solve the two-phase flow problems [51, 53]. Although the numerical results show that it is superior to the classical linear additive Schwarz method,
the success of the ASPIN method depends strongly on an effective nonlinear subdomain solver, which limits the robustness of the algorithm, especially for the highly
nonlinear models of two-phase flow problems. In this study, we present a nonlinear
elimination algorithm as a nonlinear preconditioner to accelerate the convergence of
the active-set reduced-space method. In the algorithm, the nonlinear elimination is
carried out in a field-split manner to implicitly eliminate the solution components
with high magnitude that often triggers the divergence of the nonlinear solver. We
show by numerical experiments that the proposed algorithm is robust and efficient for
a number of two-phase flow examples with strongly nonlinear relative permeabilities
and spatially varied capillary pressure functions.
The paper is organized as follows. In section 2, we present a detailed description
of the active-set reduced-space method with nonlinear elimination. In section 3, a
variational inequality based model of two-phase flow problems is introduced and then
the proposed algorithm is applied to solve the resultant nonlinear systems. We carry
out numerical experiments in section 4 to examine the efficiency and effectiveness of
the proposed algorithm. The paper is concluded in section 5.
2. Active-set reduced-space method with nonlinear elimination. The
proposed active-set reduced-space method with nonlinear elimination consists of two
major ingredients: (a) a global update based on an active-set reduced-space method as
an outer nonlinear solve and (b) a subspace correction based on a field-split nonlinear
elimination as a nonlinear preconditioner.
2.1. Active-set reduced-space methods. We first review the class of activeset reduced-space methods (see, e.g., [2, 45]) for the numerical solution of large spare
nonlinear variational inequality system of equations with lower and upper bounds,
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B596
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
i.e., for a given nonlinear function F : Rn

 Xi = φi
Xi = ψi
(2.1)

Xi ∈ [φi , ψi ]
→ Rn , compute a vector X ∈ Rn , such that
and Fi (X) > 0,
and Fi (X) < 0,
and Fi (X) = 0,
where F = (F1 , . . . , Fn )T , Fi = Fi (X1 , . . . , Xn ), X = (X1 , . . . , Xn )T , and the inequality holds componentwisely by φ = (φ1 , . . . , φn )T and ψ = (ψ1 , . . . , ψn )T , i.e., only one
of these three equations holds at a time. Let S = {1, 2, . . . , n} be an index set with
each index corresponding to an unknown component Xi and a nonlinear residual
component Fi . Two special cases are the system of nonlinear equations, F (X) = 0,
obtained by taking φi = −∞ and ψi = +∞ for each i ∈ S, and the mixed nonlinear
problem, where for some i in the subset of S, φi = −∞ and ψi = +∞. In the study, we
propose a class of active-set reduced-space methods for solving (2.1), which consists
of the following steps: (a) in the first phase, the index set is decomposed into active
and inactive parts, based on a criterion specifying a certain active set method; (b) in
the second phase, a reduced linear system associated with the inactive set is solved,
and then a line search method is applied to improve the efficiency of the iterative
process.
The active and inactive sets used within the reduced-space method are defined as

 Aφ (X) := {i ∈ S | Xi = φi and Fi (X) > 0} ,
Aψ (X) := {i ∈ S | Xi = ψi and Fi (X) < 0} ,

I(X) := S\ (Aφ (X) ∪ Aψ (X)) ,
where Aφ (X) and Aψ (X) denote the active constraints at X with respect to the lower
bound φ and the upper bound ψ, respectively, and I(X) the inactive constraints. In
fact, the active sets denote the variables where the lower and upper bounds are active
and the function value can be ignored, i.e.,
φi if i ∈ Aφ (X),
Xi =
ψi if i ∈ Aψ (X).
The inactive set contains the remainder of the variables associated with a reduced
linear system.
Starting from an initial guess X 0 ∈ Rn , suppose X k is the current approximate
solution; then a new approximate solution X k+1 can be computed through
X k+1 = π X k + λk dk ,
where dk is a search direction, λk is the step length for line search, and the operator π
is a projection to cut off the undershoot or overshoot of the solution from the interval
[φ, ψ], i.e., to set the component of the solution to be the lower bound whenever it
becomes less than φ, and to set it to ψ whenever it becomes larger than the upper
bound. To be more precise, at every iteration of the reduced-space method, the
search direction dk includes three mutually disjoint subvectors dAkφ , dAkψ , and dI k ,
with the first two corresponding to the active sets and the third to the inactive set.
In the computation, we first set dAkφ and dAkψ to 0, and then dI k is calculated by
approximately solving the linear system equation
(2.2)
∇F (X k ) I k ,I k dI k = −FI k (X k ),
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B597
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
Algorithm 1. Active-set reduced-space method (RS).
Step 1 Choose an initial guess X 0 ∈ [φ, ψ]
and set k = 0.
Step 2 While kFΘ (X k )k > εr kFΘ (X 0 )k and kFΘ (X k )k > εa do
• Define the active and inactive sets

 Aφ (X) := {i ∈ S | Xi = φi and Fi (X) > 0},
Aψ (X) := {i ∈ S | Xi = ψi and Fi (X) < 0},

I(X) := S\ (Aφ (X) ∪ Aψ (X)) .
• Set dAkφ = 0 and dAkψ = 0, and approximately solve the linear system
∇F (X k )
I k ,I k
dI k = −FI k (X k )
to calculate a direction
dIk .
• Set X k+1 = π X k + λk dk , where λk ∈ (0, 1] is determined to satisfy
FΘ π X k + λk dk k2 6 (1 − αλk )kFΘ (X k ) .
2
• Set k = k + 1.
End While
where ∇F (X k ) I k ,I k is a submatrix of ∇F (X k ) and FI k (X k ) is a subvector of
F (X k ), both based on the same index set I k . The step size λk is chosen to be the
optimal value for residual reduction with the constraint of
(2.3)
kFΘ π X k + λk dk k2 6 (1 − αλk )kFΘ (X k )k2 ,
where α is employed to ensure that the reduction of kF (X k )k2 is sufficient, and FΘ (X)
is a restriction operator of F (X) defined componentwise as
Fi (X)
if φi < Xi < ψi ,
[FΘ (X)]i =
min{Fi (X), 0} others.
Now the complete active-set reduced-space method (RS) algorithm is described in
Algorithm 1.
In the RS method, we obtain a search direction dk by solving inexactly the reduced
linear system (2.2), then compute the next approximate solution along this search
direction. In (2.3), the damping scalar λk is used to determine the step length that
one should go in the selected search direction, and the parameter α is used to ensure
that the reduction of the residual function kFΘ (X)k2 is sufficient. In the ideal case,
λk = 1 is sufficient for RS, which means that a full step is taken. A near quadratic
convergence may be observed, when the nonlinearities in the system are well-balanced.
However, when the nonlinearity of the system is severely imbalanced, the size of
λk could be very small and the convergence becomes much slower. Observed from
many numerical experiments, the slow convergence or sometimes divergence is often
determined by the variables of equations in the system with the highest nonlinearities
[9, 10, 25]. In our proposed algorithm, a nonlinear elimination step is applied as a
subproblem solver inside the global RS iteration to smooth out the “high nonlinearity.”
Through subspace nonlinear solves, the high nonlinearities are removed, and the fast
convergence can then be restored when the active-set reduced-space method is applied
after the preconditioning.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B598
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
2.2. Nonlinear elimination. To define the nonlinear elimination (NE) preconditioner that is based on a field partition, the index set S is decomposed into two
parts Sb and Sg corresponding to different physical or algorithmic components, i.e.,
S = Sb ∪ Sg . For notational convenience, we denote the vector X = (XSb , XSg ), and
the nonlinear system F (X) is rewritten as
FSb (XSb , XSg )
F (X) = F (XSb , XSg ) =
= 0.
FSg (XSb , XSg )
For this partition, we define the subspace corresponding to the set Sb , Vb ⊂ Rn
as
(2.4)
Vb = {v | v = (v1 , . . . , vn )T ∈ Rn , vi = 0, if i 6∈ Sb },
and the corresponding restriction operators Rb , which transfer a vector from Rn to
Vb . Similarly, we define the subspace
(2.5)
Vg = {v | v = (v1 , . . . , vn )T ∈ Rn , vi = 0, if i 6∈ Sg },
and the corresponding operator Rg , which restrict the vector from Rn to Vg . Hence,
using the restriction operator Rb , for any given X ∈ Rn , we have
FSb (X) = Rb (F (X)) .
Moreover, we define Tb (X) : Rn → Vb as the solution of the following subspace
nonlinear system:
(2.6)
FSb (Rg (X) + Tb (X)) = 0.
Using the subspaces mapping function, we introduce a new global function,
Y = G(X) = Rg (X) + Tb (X).
2.3. General framework of RS–NE. The proposed nonlinear elimination preconditioned active-set reduced-space method can be seen as a class of nonlinear rightpreconditioning methods, i.e., the nonlinear elimination method is proposed for the
construction of the inner nonlinear preconditioner and the classical active-set reducedspace method is used for the outer nonlinear solver. A high-level description of the
basic algorithm for a general problem is presented in Algorithm 2.
The basic idea of RS–NE is to approximately eliminate the high nonlinearities
before applying the global nonlinear iteration. Skipping the subspace correction phase,
the proposed method is reduced to the classical RS method. Moreover, the RS method
degrades into the classical inexact Newton method with backtracking [15, 17] for
solving (2.1) when φ = −∞ and ψ = +∞. Hence, for the mixed nonlinear problem,
where for some i in the subset of S, φi = −∞ and ψi = +∞, the proposed method
can be considered as a class of coupled methods, i.e., the inexact Newton method with
backtracking is used to solve the part of nonlinear system without restrictions, while
the RS method is employed to solve the part of nonlinear system with restrictions. In
Algorithm 2, a parameter switch , which is problem-dependent and requires the user to
set, is employed as a switch to determine the timing of turning on or off the subspace
correction phase. In addition, if the line search fails in the process of global nonlinear
iterations, then the RS–NE method reverts to the subspace correction phase.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B599
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
Algorithm 2. Active-set reduced-space method with nonlinear elimination (RS–NE).
Step 1 Choose an initial guess X 0 ∈ [φ, ψ]
and set k = 0.
Step 2 While kFΘ (X k )k > εr kFΘ (X 0 )k and kFΘ (X k )k > εa do
Global update phase:
• Define the active and inactive sets

 Aφ (X) := {i ∈ S | Xi = φi and Fi (X) > 0},
Aψ (X) := {i ∈ S | Xi = ψi and Fi (X) < 0},

I(X) := S\ (Aφ (X) ∪ Aψ (X)) .
• Set dAkφ = 0 and dAkψ = 0, and approximately solve the linear system
∇F (X k ) I k ,I k dI k = −FI k (X k )
to calculate a direction dI k .
• Set X k+1 = π X k + λk dk , where λk ∈ (0, 1] is determined to satisfy
kFΘ π X k + λk dk k2 6 (1 − αλk )kFΘ (X k )k2 .
• Set k = k + 1.
Subspace correction phase:
If (line search fails) or (switch == 1), then
• Given X k = (Xbk , Xgk ), find Tb (X k ) by solving the subspace problem
FSb (Rg (X k ) + Tb (X k )) = 0.
• Compute Y k = G(X k ) = Rg (X k ) + Tb (X k ), and update X k = Y k .
End if
End While
It is worth mentioning that the proposed nonlinear elimination method is based
on the nonlinear function F (X), rather than the variational inequality (2.1), to build
the subspace systems (2.4) and (2.5). Moreover, the subspace problem (2.6), which
is a reduced-space nonlinear system, can be solved using any classical nonlinear iterative method, such as the inexact Newton method with backtracking and its many
variations [7, 31, 46, 47], and other composite solvers [6]. A key element of the NE
step is the choice of Sb and Sg to build the subspaces Vg and Vb . Some physics-based
or field-based strategy can be used to determine the subset indices, and the partition
of Sb and Sg for the nonlinear elimination may change with k. It differs from case to
case. We also remark that the use of the nonlinear elimination preconditioner in this
paper is different from the work in [26, 27] for the one- and two-dimensional scalar
full potential equations. In [26, 27], the to-be-eliminated component is determined in
advance based on certain knowledge of the problem, and thus the NE preconditioner
is applied starting from the beginning until the intermediate solution is close to the
desired solution. In our study, a class of nonlinear elimination methods as a right preconditioner is used for the general multicomponent systems, and the to-be-eliminated
component is implicitly removed, based on the the feedback from the intermediate
global nonlinear iterations. Hence, the NE preconditioner is activated only when the
line search fails or the control parameter switch is triggered in case that the global
update is difficult to converge.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B600
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
3. RS–NE for two-phase flow in porous media. In this section, we introduce the continuous physical model together with the discretization employed for
two-phase flow problems in porous media, and then present the implementation details
of the RS–NE method for the resultant nonlinear systems.
3.1. Model problem. The flow consisting of two incompressible and immiscible
fluids in porous media can be described by

∂S

φ α + ∇ · vα = qα ,
∂t
(3.1)
α = w, n,
k k

vα = − rα (∇pα + ρα g∇z) ,
µα
where g is the gravity acceleration constant, z is the depth, and φ and k are the
porosity and the absolute permeability of the media. Here the wetting phase of the
flow is denoted by subscript w and the nonwetting phase n. Each phase α has its
own saturation Sα , velocity vα , pressure pα , density ρα , viscosity µα , and relative
permeability krα . Later, we will show that the primary variables to be solved in (3.1)
are Sw and pw . In this paper, the relative permeabilities are given by
krw = Seβ ,
krn = (1 − Se )β ,
where β is a positive integer number and Se is the normalized saturation. A commonly
employed formulation of the normalized saturation is
Se =
Sw − Srw
,
1 − Srw − Srn
where Srw and Srn are the residual saturations for the wetting and nonwetting phases,
respectively.
In (3.1), the first equation is determined by the mass conservation law, and the
second one is the velocity equation following the Darcy’s law. For the two-phase flow,
the saturations of the phases are constrained by
Sw + Sn = 1,
and the pressures of the two phases are related through the capillary pressure function
pc (Sw ) = pn − pw .
Throughout the study, we use the following nonlinear capillary pressure function [24]:
Bc
pc (Sw ) ≡ − √ log Se ,
k
where Bc > 0 is related to the media property.
The heterogeneity of the capillary pressure in permeable media may result in
discontinuity in the saturation, which in turn complicates the numerical simulation.
For example, in the classical fractional flow formulation, contrast in capillary pressure
may lead to discontinuous global pressure that makes the formulation inconsistent. In
this paper, we follow the formulation in [24, 41]. We first introduce the flow potential
Φα for phase α:
Φα = pα + ρα gz.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B601
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
The capillary potential is then defined as
(3.2)
Φc ≡ Φn − Φw = pc (Sw ) + (ρn − ρw )gz.
Further, we denote the mobility of phase α as λα = krα /µα . The total velocity can
be rewritten into
v ≡ vw + vn = va + vc
with
(3.3)
va = −λa k∇Φw ,
vc = −λn k∇Φc .
Here the velocity va is associated with a mobility λa = λw + λn that is smoother compared to either λw or λn . The wetting-phase velocity can be recovered as vw = fw va
with fw = λw /λa . Then, the potential equation of the two-phase flow is reformulated as
(3.4)
∇ · (va + vc ) = qw + qn ,
and the wetting-phase saturation equation becomes
(3.5)
φ
∂Sw
+ ∇ · (fw va ) = qw ,
∂t
which holds on the set 0 6 Sw 6 1.
Suppose the boundary of the computational domain Ω is composed of two parts
∂Ω = ΓD + ΓN with ΓD ∩ ΓN = ∅. The boundary conditions associated to (3.4) and
(3.5) are

on ΓD ,
 pw = pD
N
v·n=q
on ΓN ,

N
Sw = S
on ∂Ω.
And the initial condition for the saturation equation is
0
(3.6)
Sw t=0 = Sw
in Ω.
When using a fully implicit method, (3.4) and (3.5) are coupled and a nonlinear
system is required to be solved at each time step. However, if we directly apply the
Newton method or its many variations to solve such nonlinear systems, we face the
numerical challenges arising from physically feasible saturation fractions between 0
and 1. Hence, we take the restriction of the saturation into account and introduce a
variational inequality problem of the wetting-phase saturation equation (3.5). Considering a scaled L2 -inner product and the obstacle potential 0 6 Sw 6 1, we obtain
the following variational inequality for (3.5); see [4, 5, 22] for more details.
0
0
Problem 1. Given Sw (·, 0) = Sw
∈ H 1 (Ω) with 0 6 Sw
6 1, find Sw ∈ H 1 (ΩT )
such that 0 6 Sw 6 1 a.e. in ΩT = Ω × [0, T ] and
(3.7)
D ∂S
E D
E D
E
w
φ
, X − Sw + ∇ · (fw va ) , X − Sw − qw , X − Sw > 0,
∂t
which has to hold for almost all t and all X ∈ H 1 (Ω) with 0 6 X 6 1. Here h·, ·i
denotes the L2 -inner product.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
B602
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
It is well known that the above problem is equivalent to the following problem:
(3.8)

∂Sw


+ ∇ · (fw va ) − qw > 0
Sw = 0, φ


∂t





∂Sw
Sw = 1, φ
+ ∇ · (fw va ) − qw < 0

∂t







 Sw ∈ [0, 1] , φ ∂Sw + ∇ · (fw va ) − qw = 0
∂t
a.e. in
ΩT ,
a.e. in
ΩT ,
a.e. in
ΩT ,
where all equalities and inequalities have to hold almost everywhere. Equation (3.8) is
known as a box constrained variational inequality or mixed complementarity problem.
For more details, we recommend [18, 20, 21, 45, 49] and references therein.
For the purpose of comparison, an IMPES method is also implemented. The
semi-implicit scheme comprises two split steps: one implicitly solves the potential
equation (3.4) to obtain a new potential of the wetting phase Φw and thus va from
(3.3), and the other explicitly updates the wetting-phase saturation (3.5).
3.2. Discretization. In this study, a cell-centered finite difference (CCFD) is
applied to spatially discretize the optimization problem for the spatial terms; the detailed process of CCFD can be found [32, 40, 42]. The employed CCFD discretization
for the pressure equation in rectangular meshes can also be viewed as the mixed finite element method with Raviart–Thomas basis functions of lowest order equipped
with the trapezoidal quadrature rule. We remark here that the proposed nonlinearly
preconditioned solutions algorithms can be generalized to the case of other spatial
discretizations.
Let Ω be the computational domain covered with Nx × Ny mesh cells. Then each
mesh point pi,j = (xi , yj ) is centered at the position xi = i × hx and yj = j × hy with
i = 1, . . . , Nx , j = 1, . . . , Ny , hx = 1/Nx , and hy = 1/Ny . For a given time-stepping
sequence 0 = t(0) < t(1) < t(2) < · · · , define the time step size ∆t(l) = t(l+1) − t(l) and
use superscript (l) to denote the discretized evaluation at time level t = t(l) . Then,
the discretization of the saturation equation in (3.5) is
(3.9)
(l)
(l+1)
− Sw
Sw
∗
(l+1) (l+1)
(l+1) (l+1)
(l+1)
FSw Sw
+
∇
·
f
v
, pw
− qw
= 0,
=φ
h
w
a
∆t(l)
where ∇∗h · represent CCFD discretization of the gradient operator by using the up(l+1)
wind scheme according to the velocity va
. Hence, the discretization of variational
inequality (3.8) is given as
(3.10)

(l+1)

Sw
=0







(l+1)
Sw
=1







(l+1)
 Sw
∈ [0, 1]
(l+1) (l+1)
and FSw Sw , pw
> 0,
(l+1) (l+1)
and FSw Sw , pw
< 0,
(l+1) (l+1)
and FSw Sw , pw
= 0.
Similarly, we employ a CCFD scheme for (3.4); then the discretization of the
potential equation is
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B603
(3.11)
(l+1) (l+1)
(l+1)
= −∇h · λ(l+1)
Fpw Sw
, pw
∇h Φ(l+1)
− ∇h · λ(l+1)
∇h Φ(l+1)
− qw
− qn(l+1)
a
w
n
c
= 0,
where ∇h · and ∇h represent CCFD discretizations of the divergence and gradient
operators. We remark here that the CCFD scheme conserves the mass both globally
and locally because of the consistent discretization of the local differential operator
across all mesh cell interfaces.
3.3. The application of RS–NE to two-phase flow. In this subsection, we
introduce the implementation details of the RS–NE method for two-phase flow. In the
implementation, we use the fully coupled ordering to build up the nonlinear system
(2.1), by which we mean that different physical variables defined at the same grid
point are always put together throughout the calculations. At each grid point, we
arrange the unknowns in the order of (pw )ij and (Sw )ij , and then all grid points
numbered in the natural ordering. That is, the unknowns are ordered in the order of
(3.12) X = (pw )11 , (Sw )11 , . . . , (pw )ij , (Sw )ij , . . . , (pw )Nx Ny , (Sw )Nx Ny ∈ Rn ,
where n = 2Nx Ny . The corresponding functions are ordered in the order of
(3.13)
F (X) = (Fpw )11 , (FSw )11 , . . . , (Fpw )ij , (FSw )ij , . . . , (Fpw )Nx Ny , (FSw )Nx Ny ∈ Rn .
The lower bound in (2.1) is given as
φ = (−∞, 0, . . . , −∞, 0, . . . , −∞, 0) ∈ Rn ,
and the upper bound is
ψ = (+∞, 1, . . . , +∞, 1, . . . , +∞, 1) ∈ Rn .
In the NE step, we use a field-based strategy to determine Sb and Sg and propose
the inexact Newton method with backtracking for solving the subspace problem (2.6).
In the following, we take the wetting-phase saturation approach (i.e., removing the
Sw -component) as an example to build the NE step. In the approach, we consider all
the wetting-phase saturation Sw as bad components. Following the order in (3.12),
the index set Sb with respect to the component Sw is all the even numbers of the set
S, and Sg = S\Sb . Also, we can define the corresponding subspaces Vb and Vg in
(2.4) and (2.5), respectively. Then,
Xi
if i ∈ Sg ,
Yi =
(Tb (X))i if i ∈ Sb ,
where Tb is the solution of subspace nonlinear system (2.6) with the inexact Newton
method with backtracking. To active the NE step, we need to set the parameter
switch = 1 when necessary. The strategy we use here is as follows. We first take
several (controlled by a threshold Nswitch , unless the line search fails) global nonlinear
iterations using the RS method, then solve the subspace problem by the inexact
Newton method with backtracking, and this process continues until convergence.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B604
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
A major advantage of fully implicit schemes is that the time step size 4t is
no longer constrained by the CFL condition, which is often required by explicit or
semi-implicit methods. Meanwhile, for the simulation of two-phase flow, the evolution
of the system usually admits various time scales and the calculation often lasts for a
long time. Hence, we can use an adaptive time step control to enhance the temporal
accuracy and reduce the total computing time. In the study, analogous to the switched
evolution/relaxation strategy in [44], we start with a relatively small time step size
4t(0) and update its value via 4t(l+1) = β (l) 4t(l) with β (l) being the scaling factor
of the adjacent time step size
η !!
kF (X (l) )k2
1
(l)
, min ρ,
,
(3.14)
β = max
ρ
kF (X (l+1) )k2
where kF X (l+1) k2 is the Euclidean norm of F X (l+1) . In the formulation, ρ ∈
(0, +∞) is a safeguard to avoid excessive change of the time step size between any
two immediate time steps, and η ∈ (0, 1] is used to control the adjustment of the time
step size.
4. Numerical experiments. In this section, we report some results of numerical experiments to examine the performance of both traditional and the proposed
algorithms listed as follows:
• IMPES denotes the implicit pressure-explicit saturation method.
• INB denotes the inexact Newton method with backtracking.
• RS denotes the classical active-set reduced-space method.
• RS–NE denotes the active-set reduced-space method with nonlinear elimination.
• Adaptive RS–NE denotes the nonlinear elimination preconditioned active-set
reduced-space method with adaptive time stepping.
All these algorithms are implemented based on the Portable, Extensible Toolkits for
Scientific computation (PETSc) [1] library. We test the algorithms on a Dell PowerEdge C6100 supercomputer that contains two hex-core 2.8-GHz Intel Westmere
processors and 24 GB local memory in each node and equipped with a nonblocking
QDR Infiniband high performance network. It is worth mentioning that all algorithms except the IMPES method are employed in conjunction with the fully implicit
method. There are several nonlinear and linear iterative procedures in the proposed
algorithms, and each requires a proper stopping condition. In the RS method, an
absolute (relative) tolerance of 10−3 (10−8 ) is utilized for the global nonlinear iteration, and the absolute (relative) tolerance is set to 10−10 (10−6 ) for the subspace
nonlinear iteration. In nonlinear iterations, we use the standard cubic backtracking
algorithm [15, 17] with α = 10−4 to pick the step length λ. In the IMPES method,
the time step subdivision for saturation is 5. All linear systems are solved by the
Schwarz preconditioned GMRES method [8, 50, 52, 55], with relative and absolute
tolerances of 10−3 and 10−6 , respectively. The subdomain problems in the Schwarz
preconditioner are solved with the sparse LU factorization. Moreover, in the adaptive
RS–NE method we use ρ = 2.5 and η = 0.75 to control the time step size 4t, if not
explicitly stated.
In the study, we use five test cases for two-phase fluid flows in a horizontal layer
consisting of regular heterogeneous permeability [24, 32, 53]. In all cases, with the
porous media being horizontal, the effect of gravity is neglected, and the void of the
medium is initially fully saturated with oil, i.e., the initial conditions are pw = 0 and
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B605
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
Table 1
Parameters for all test cases.
Parameters
Computational domain (m)
φ
k (md)
µw (cp)
µn (cp)
β
Bc (bar md1/2 )
Srw
Srn
Injection rate (PV/year)
Case-1
300 × 150
0.2
1 or 100
1
0.45
2
70
0
0
0.15
Case-2
300 × 150
0.2
1, 50 or 100
1
0.25
2
30
0
0
0.11
Case-3
300 × 150
0.2
1 or 100
1
0.5
2
100
0
0
0.2
Case-4
500 × 270
0.2
1 or 100
1
0.45
2
50
0
0
0.11
Case-5
100 × 100
0.2
0.03-400
1
0.2
2
25
0
0
0.1
Sw = Sm , where Sm = 10−4 is the minimum of saturation. We assume the top and
bottom boundaries of the reservoir to be impermeable, i.e., the normal component
of the Darcy velocity on these boundaries vanishes. Then the system is flooded by
w
water from left to right, i.e., we set ∂p
∂n = 0, Sw = 1 − Sm at the left boundary
∂Sw
and set pw = 0, ∂n = 0 at the right boundary. There is no injection/extraction
to the interior of the domain. Table 1 summarizes the main features of each test
case, including the domain size, permeability, and porosity data. The purposes of
the tests include (a) the verification of the numerical accuracy of the fully implicit
method; (b) a comparison of the performance of the IMPES method and the RS–NE
method; (c) a comparison of different fully implicit solvers, including the classical
INB and RS methods and the RS–NE method with adaptive time stepping; (d) a
comparison of different parameters for RS–NE with adaptive time stepping; (e) the
parallel performance of the fully implicit method.
4.1. Numerical validation. We first validate the discretization scheme and
the fully implicit solver by running Case-1. As shown in Figure 1, the computational
domain for Case-1 is 300 meters long by 150 meters wide and includes two kinds of
permeabilities (1 md and 100 md) in subdomains, respectively, i.e.,
(
100 md 70 6 x 6 150 and 50 6 y 6 100,
k=
1 md
otherwise.
The saturation is calculated until 0.5 pore volume injection (PVI), and the wetting
phase is injected with the flow rate 0.15 PV/year from the left boundary, i.e., the
simulation time is ended at t = 0.5/0.15 year. In this test, we use a fixed time step
size 4t = 0.1 year (i.e., 36.5 days) for the RS–NE method and use 4t = 5 × 10−4
year (i.e., 0.1825 day) for the IMPES method. Numerical results of both the IMPES
method and the fully implicit method on a 100×50 mesh are shown in Figure 1, which
verifies the consistency between the semi-implicit and the fully implicit methods. We
remark that the IPMES method cannot obtain the smooth results when the time step
size 4t is larger than 6 × 10−4 year, which is consistent with [32].
We further run Case-2 to verify the robustness of the fully implicit method. Compared to the previous test case with two different permeabilities, in Case-2 there are
three subdomains with the permeabilities of 1 md, 50 md, and 100 md, as shown
in the top panel of Figure 2. We run the test on a 100 × 50 mesh using the fully
implicit and semi-implicit methods. In the simulation, we again use a fixed time step
size 4t = 0.1 year for the fully implicit method and use 4t = 5 × 10−4 year for the
IMPES method. The wetting phase is injected with the flow rate 0.11 PV/year from
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B606
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
Fig. 1. Case-1: Heterogeneous permeabilities and wetting-phase saturation profiles at 0.5 PVI.
The mesh is 100 × 50. The top-left panel shows the distribution of the heterogeneous permeabilities;
the top-right panel shows the contour plot of the wetting-phase saturation by using RS–NE with
4t = 0.1 year; the bottom panel shows the contour plot of the wetting-phase saturation by using
IMPES with 4t = 5 × 10−4 year.
Fig. 2. Case-2: Heterogeneous permeabilities and wetting-phase saturation profiles at 0.6 PVI.
The mesh is 100 × 50. The top-left panel shows the distribution of the heterogeneous permeabilities;
the top-right panel shows the contour plot of the wetting-phase saturation by using RS–NE with
4t = 0.1 year; the bottom panel shows the contour plot of the wetting-phase saturation by using
IMPES with 4t = 5 × 10−4 year.
the left boundary. The wetting-phase saturation profiles as 0.5 PVI computed by the
two methods are presented in the bottom panel of Figure 2. The results illustrate
that the new method can yet compute a reasonable solution with a much larger time
step size, when compared with the IMPES approach.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B607
Fig. 3. Case-3: Heterogeneous permeabilities and wetting-phase saturation profiles by using RS–
NE. The mesh is 100 × 50. The top-left panel is for the heterogeneous permeabilities; the top-right
panel is for wetting-phase saturation profiles at 0.2 PVI; the bottom-left panel is for wetting-phase
saturation profiles at 0.4 PVI; the bottom-right panel is for wetting-phase saturation profiles at 0.5
PVI.
Next, we compare the RS–NE method with the IMPES method for Case-3 and
Case-4 in terms of time step size and total compute time. In RS–NE, we consider its
two different versions, i.e., the standard and adaptive RS–NE methods. In Case-3,
the tested medium consists of three subdomains with the different permeabilities of 1
md or 100 md, i.e.,


70 6 x 6 100 and 0 6 y 6 100,
1 md,
k = 1 md,
150 6 x 6 180 and 50 6 y 6 150,


100 md otherwise.
The configurations are shown in the top-left panel of Figure 3. The porosity of
this medium φ is 0.2. The relative permeabilities are quadratic function of water
saturation, i.e., β = 2. The capillary pressure parameter Bc is taken as 100 bar.
The injection rate is 0.2 PV/year. Other relevant parameters are provided in Table
1. We run the test on a 100 × 50 mesh using RS with a fixed time step size 4t =
0.02 year. In Figure 3, we show the contour plots of the calculated wetting-phase
saturation at t = 0.2, 0.4, and 0.5 PVI. We can see that the fully implicit approach
successfully resolves the evolution of the wetting-phase saturations, and we find that
the resemblance between the simulated results and the published results in [32] is
remarkable.
In this case, the domain is highly heterogeneous and the capillary pressure functions differ on the different subdomains, leading to the increase in nonlinearity of
the problem that affects the size of the time step. In Table 2, we compare the fully
implicit method with the semi-implicit method in terms of time step sizes and the
computing time. The time step size of IMPES is selected as 4t = 2.5×10−4 such that
it satisfies the requirement of stability. In the adaptive RS–NE method, we set the
initial time step size to be 4t = 0.01 and then adaptively control it by using (3.14).
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B608
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
Table 2
A comparison of RS–NE and IMPES for Case-3. The mesh is 100 × 50 on four processors.
The simulation is finished at 0.5 PVI. In the adaptive RS–NE method, we set the initial time step
size to be 4t = 0.01.
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
IMPES
104
2.5 × 10−4
−−
137.4
RS–NE
125
0.02
6.4
54.7
Adaptive RS–NE
42
0.061
12.4
30.9
In the standard RS–NE method, the time step size is fixed to 4t = 0.02. We would
like to emphasize that the IPMES method and the standard RS–NE method cannot
obtain smoothed solutions when the time step size becomes larger. It is clear from
Table 2 that the required number of time steps of RS–NE is much smaller than that
of IMPES. Moreover, we can take a larger time step size for the RS–NE method when
an adaptivity mechanism is used to control the time step size, and in this case, the
execution time can be further reduced, although the averaged number of nonlinear
iterations increases. This indicates the new method has better stability and efficiency
than the IMPES method.
In the following, we compare the RS–NE method with the IMPES method for
Case-4. In Case-4, as shown in Figure 4, the domain is also composed of layers of
alternate permeabilities (1 md and 100 md), i.e.,
(
1 md,
30 + 60i ≤ y ≤ 60 + 60i, i = 0, 1, 2, 3,
k=
100 md otherwise.
The wetting phase is uniformly injected across the left-hand side of the layered domain,
which is initially saturated with the nonwetting phase. The production is across
the opposite right-hand side. The injection rate is 0.11 PV/year. Other relevant
parameters are again given in Table 1. In Figure 4, we show the contour plots of
the calculated wetting-phase saturation for the test case at different PVI. As shown
in the figures, the flow behavior with homogeneous capillary pressure is essentially
influenced by the low permeability field and causes a great delay of displacement of
the layer with the smaller permeability value.
We then compare the performance in terms of time step sizes and the computing
time in Table 3. In the adaptive RS–NE method, we again set the initial time step size
to be 4t = 0.01 year and then adaptively control it by using (3.14). In the IMPES
method, the time step size is changed to 4t = 5 × 10−4 year. In the standard RS–NE
method, the time step size is changed to 4t = 0.03 year. From Table 3, it is clear
that the time step size of the fully implicit method is far larger than that of IMPES,
and the total computing time of the fully implicit method is also much smaller than
that of the semi-implicit method. We remark that even with large time step sizes,
the fully implicit method with adaptive time stepping has a similar accuracy as the
semi-implicit method and the fully implicit method with a fixed, small time step size.
From the above tests, we conclude that the RS–NE method, especially the adaptive time-stepping approach, is faster and more robust than the IMPES method in
terms of time step sizes and the computing time. We would like to emphasize that,
with refined spatial resolution, the advantage of the fully implicit method with adaptive time stepping becomes increasingly pronounced, as seen in the weak scaling tests
to be discussed in the last subsection.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B609
Fig. 4. Case-4: Heterogeneous permeabilities and wetting-phase saturation profiles at different
times. The mesh is 200×100. The top-left panel is for the heterogeneous permeabilities; the top-right
panel is for wetting-phase saturation profiles at 0.2 PVI; the bottom-left panel is for wetting-phase
saturation profiles at 0.4 PVI; the bottom-right panel is for wetting-phase saturation profiles at 0.6
PVI.
Table 3
A comparison of RS–NE and IMPES for Case-4. The mesh is 200 × 100 on four processors.
The simulation is 0.6 PVI. In the adaptive RS–NE method, we set the initial time step size to be
4t = 0.01.
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
IMPES
10909
5 × 10−4
−−
925.1
RS–NE
182
0.03
5.7
630.3
Adaptive RS–NE
75
0.0735
10.4
397.9
Finally, we test the adaptive RS–NE method with a more difficult test case, in
which the permeabilities of the porous medium are random. The configuration of the
test case is listed as Case-5 in Table 1. In the test, the permeabilities of the porous
medium are generated by a geostatistical model using the open source code MRST
(MATLAB Reservoir Simulation Toolbox) [38], ranging from 0.03 to 400 md. The
heterogeneity of the permeabilities often causes unexpected early water breakthrough
and is challenging for the numerical techniques. In the test we set the initial time
step to 4t = 0.02 and run the simulation on a 100 × 100 mesh. The contour plots of
wetting-phase saturation profiles by using the adaptive RS–NE method at different
times are shown in Figure 5. We can see that the proposed approach successfully
resolves the rapid and abrupt evolution of the wetting-phase saturation while keeping
it within the physically meaningful range.
4.2. Comparison of fully implicit methods. In this subsection, we restrict
ourselves to the class of fully implicit methods for Case-3 and Case-4. In the tests,
we first study some parameters that impact the performance of the adaptive RS–NE
method. These parameters include the threshold Nswitch defined in subsection 3.3
for the nonlinear iterations and the adaptivity parameters ρ and η for controlling the
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B610
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
Fig. 5. Case-5: Heterogeneous permeabilities (log(k), where k has a unit of md) and wettingphase saturation profiles by using the adaptive RS–NE method. The mesh is 100 × 100. The top-left
panel is for the heterogeneous permeabilities; the top-right panel is for wetting-phase saturation
profiles at 0.1 PVI; the bottom-left panel is for wetting-phase saturation profiles at 0.2 PVI; the
bottom-right panel is for wetting-phase saturation profiles at 0.3 PVI.
time step size 4t. We then compare in detail the classical INB and RS methods with
the adaptive RS–NE method.
4.2.1. The choice of parameters in adaptive RS–NE. A threshold Nswitch
is used to determine the moment of freezing the global nonlinear problems and turning
to the nonlinear elimination step, in order to balance the costs and accuracy of solving
the global and subspace nonlinear problems. So in this test we focus on the performance of the adaptive RS–NE method with different thresholds Nswitch for the first
and second time steps, as shown in Table 4. The table presents the total computing
time, the number of global nonlinear iterations, and the number of subspace nonlinear
iterations and, in parentheses, the average number of iterations for solving the global
and subspace Jacobian systems, respectively. The tests are done on a fixed 400 × 200
mesh using eight processors. We set the initial time step size to be 4t = 2.5 × 10−3
for Case-3 and 4t = 5 × 10−3 for Case-4, respectively. It is observed from the table
that (a) when Nswitch is small, such as two, the proposed adaptive RS–NE method
works poorly or even fails to converge; (b) when we increase Nswitch , the number of
the global linear iterations becomes larger, which may degrade the performance of the
RS–NE method. Based on the above observations, we find that the RS–NE method
with the threshold Nswitch around five is appropriate in terms of the total computing
time and the robustness and stability of the solver.
In the adaptive RS–NE method, we adaptively control the time step size 4t by
using the strategy (3.14). In the simulation, ρ ∈ (0, +∞) is a safeguard to avoid
excessive change of the time step size between any two immediate time steps, and
η ∈ (0, 1] is used to control the adjustment of the time step size. Hence, the parameters ρ and η are used to determine the maximum allowable time step size in the
simulation. In Tables 5 and 6, we show the effect of the parameters for Case-3 and
Case-4. It is observed from the results that the average time step size will increase
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
B611
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
Table 4
A comparison of the threshold Nswitch for Case-3 and Case-4. In the table, “Nonlinear”
denotes the number of inexact nonlinear iterations per time step, “Linear” denotes the average
number of GMRES iterations per nonlinear iteration, and “Time” denotes the total computing time
in seconds. “Global” (“Local”) denotes the performance of RS–NE in the global (subspace) iteration
and “1st-step” (“2nd-step”) denotes the performance of RS–NE at the first (second) time step. The
“∗∗” means divergence of the nonlinear iteration.
Case
Nswitch
Case-3
2
5
10
15
2
5
10
15
Case-4
1st-step: Global (Local)
Nonlinear
Linear
Time
∗∗
15 (8)
30.3 (1.0)
29.6
19 (7)
31.2 (1.2)
31.3
26 (6)
34.7 (1.3)
48.6
34 (101)
24.9 (1.0)
134.9
14 (8)
41.5 (1.0)
36.8
19 (6)
42.8 (1.0)
39.8
18 (2)
46.7 (1.0)
42.7
2nd-step: Global (Local)
Nonlinear
Linear
Time
∗∗
15 (12)
26.0 (1.6)
29.1
16 (10)
27.6 (1.7)
29.4
18 (8)
27.7 (1.8)
34.1
18 (34)
22.5 (1.0)
55.8
17 (18)
24.1 (1.0)
36.9
16 (10)
26.4 (1.0)
29.6
17 (8)
28.8 (1.0)
32.5
Table 5
A comparison of ρ and η for Case-3. The initial time step size is 4t = 0.01. 100 × 50 mesh.
η = 0.75
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
ρ = 2.5
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
ρ = 3.5
30
0.0845
17.5
29.8
η = 1.0
31
0.0814
17.5
30.2
ρ = 3.0
35
0.0729
14.8
30.5
η = 0.9
35
0.0726
14.9
29.8
ρ = 2.5
42
0.0610
12.4
30.9
η = 0.7
44
0.0574
11.8
31.2
ρ = 2.0
52
0.0490
10.5
33.7
η = 0.6
50
0.0511
10.7
33.3
ρ = 1.5
68
0.0368
8.3
36.7
η = 0.5
56
0.0453
11.0
38.0
Table 6
A comparison of ρ and η for Case-4. The initial time step size is 4t = 0.01. 200 × 100 mesh.
η = 0.75
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
ρ = 2.5
Number of time steps
Average time step size (years)
Average nonlinear iteration
Execution time (second)
ρ = 3.5
53
0.1031
14.6
433.0
η = 1.0
52
0.1055
15.1
370.9
ρ = 3.0
62
0.0883
12.4
390.4
η = 0.9
60
0.0913
13.3
387.6
ρ = 2.5
75
0.735
10.4
397.8
η = 0.7
80
0.0683
9.3
397.7
ρ = 2.0
93
0.587
9.2
527.9
η = 0.6
93
0.0592
8.9
438.6
ρ = 1.5
125
0.0439
7.9
522.4
η = 0.5
107
0.0512
7.8
465.3
and fewer time steps are required when we use larger ρ and η, in general resulting
in the reduction of the total computing time; meanwhile it leads to the number of
average nonlinear iterations decreases, hence the cost for solving nonlinear problems
may increase. Observed from the tables, we find that ρ ∈ (1.5, 3.5) and η ∈ (0.5, 1]
are appropriate to guarantee the convergence of the proposed method.
4.2.2. A comparison of INB and RS. The price to pay for using implicit
methods for the fully coupled problem is to solve a nonlinear system (3.13) at each
time step. When solving such nonlinear systems using the Newton method or its
many variations, however, we face the numerical challenges arising from physically
feasible saturation fractions between 0 and 1. As shown in Figure 6, the INB method
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
4
4
10
2
10
2
Residual
Residual
10
10
−1
−1
10
10
RS
INB
−3
0
RS
INB
−3
10
10
2
4
6
8
10
12
Nonlinear iteration
14
16
18
20
0
2
4
6
8
10
12
Nonlinear iteration
14
16
18
20
Fig. 6. Nonlinear residual history for Case-3 (left) and Case-4 (right) at the first time step
using the INB and RS methods. The time step size is 4t = 0.01.
4
4
10
10
2
2
10
Residual
Residual
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B612
−1
10
10
−1
RS
10
RS−NE−Saturation
−3
−3
10
0
10
2
4
6
RS
RS−NE−Saturation
RS−NE−Pressure
8
10
12
Nonliear iteration
14
16
18
20
0
RS−NE−Pressure
2
4
6
8
10
12
Nonlinear iteration
14
16
18
20
Fig. 7. Nonlinear residual history for Case-3 (left) and Case-4 (right) at the first time step.
In the adaptive RS–NE method, the initial time step size is 4t = 0.01.
fails after eight Newton iterations, which is caused by the floating point exception
from the restriction of the saturation variable. Hence, in the study we use the class
of active-set reduced-space methods to fix this issue. However, along with the simulation moving onward, the nonlinear system becomes higher nonlinearity arising from
heterogeneous permeability of high contrast, strong nonlinearities of relative permeability, and spatially varied capillary pressure functions. As a result, the nonlinear
system is extremely difficult to solve, and the classical RS method is not convergent
when it attempts to do the first nonlinear iteration. The divergence of the nonlinear
iteration for the RS method is caused by the failure of linear search. Hence, we apply
the NE–RS method to resolve the issue.
4.2.3. A comparison of RS and adaptive RS–NE. The RS–NE method
consists of two major ingredients: a subspace correction and a global update. In the
subspace correction step, the effective choice of the components to be eliminated plays
an important role in the success of the whole algorithm.The RS–NE method may be
not convergent when suitable components are not eliminated properly. Hence, we
investigate the influence of the choices on the components to be eliminated. In the
test, we carry out two different strategies: the pressure approach (i.e., removing the
pw -component) and the wetting-phase saturation approach (i.e., removing the Sw component). In Figure 7, we show the results with the choices of removing components
for Case-3 and Case-4 at the first time step. We can see that the pressure approach
cannot make any progress after several global nonlinear iterations are finished and then
we turn to the first subspace correction. This divergence of the nonlinear iteration
for the pressure approach is caused by the failure of linear search. On the other
hand, by using the second strategy, the RS–NE method can make a profit from the
subspace correction, and the number of nonlinear iterations in the global step is
smaller than that for the classical RS method. We would like to point out that the
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
B613
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
3
10
4
2
10
−4
Value
Residual
10
−1
−10
10
10
RS
Gnorm
Lambda
RS−NE
−3
10
−16
0
1
2
3
4
5
6
7
8
Nonlinear iteration
9
10
10
11
5
10
15
20
Line search
25
30
Fig. 8. Case-3: A comparison of RS and adaptive RS–NE. Left panel: Nonlinear residual
history at the second time step. Right panel: History of the step length λ when it attempts to do the
first nonlinear iteration. The mesh is 100 × 50, and the initial time step size is 4t = 0.01.
3
10
4
10
2
10
−4
10
Value
Residual
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
10
−1
−10
10
10
RS
−3
10
Gnorm
Lambda
RS−NE
−16
0
2
4
6
Nonlinear iteration
8
10
12
10
5
10
15
20
Line search
25
30
35
Fig. 9. Case-4: A comparison of RS and adaptive RS–NE. Left panel: Nonlinear residual
history at the second time step. Right panel: History of the step length λ when it attempts to do the
first nonlinear iteration. The mesh is 200 × 100, and the initial time step size is 4t = 0.01.
pressure approach always fails to work for two-phase flow problems at other time
steps. Hence, in all the following tests we will choose the second strategy for the
subspace correction.
We next investigate the performance of the adaptive RS–NE method with the
classical RS method at the second time step. In Figures 8 and 9, we show the nonlinear
residual history for Case-3 and Case-4, respectively. From the left panel of the figures,
one can see that the nonlinear system is extremely difficult to solve, and the classical
RS method is not convergent when it attempts to do the first nonlinear iteration. The
divergence of the nonlinear iteration for the RS method is caused by the failure of the
linear search. To figure out the reason for the failure, in the right panel of Figures
8 and 9, we plot the histories of the step length λ (denoted by the blue line) and
the norm of nonlinear residual functions (denoted by the red line) when it attempts
to do the first nonlinear iteration. In the process of the first nonlinear iteration, we
first find the search direction by solving inexactly the Jacobian system, then try to
compute the next approximate solution along this search direction by using a step
length λ. The purpose of step length is to determine how far we should go from the
current approximation and meanwhile make sure the reduction of nonlinear residual
function. However, as seen in the right panel of the figures, the value of the nonlinear
residual function stagnates around 10 without any improvement after dozens of line
searches, even for tiny values of step length with λ = 10−15 , while with the help of
nonlinear elimination, it is clear that RS–NE converges much easier and can reach
convergence with up to 12 iterations for these test cases. So we conclude that the
performance of the RS–NE method is better than that of the classical RS method.
We also remark that the classical RS method begins to fail to converge for any time
steps l > 2.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
B614
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
6
6
10
10
4
10
2
Residual
Residual
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
4
10
10
−1
10
−3
10
0
2
10
−1
10
Mx=100
Mx=200
Mx=400
2
−3
10
4
6
8
Nonlinear iteration
10
12
14
0
Mx=100
Mx=200
Mx=400
2
4
6
8
10
12
14
16
Nonlinear iteration
Fig. 10. Case-4: A comparison of adaptive RS–NE with different mesh sizes. Left panel:
Nonlinear residual history at the first time step. Right panel: Nonlinear residual history at the
second time step.
Finally, we focus on Case-4 and study the convergence behavior when the mesh
size is gradually refined from 100 × 50 to 400 × 200. In the test, the initial time step
size is 4t = 0.02 for the case of a 100 × 50 mesh and then is reduced by half as the
mesh is refined accordingly. Figure 10 shows the nonlinear residual history at the
first and second time steps. It is observed from the figure that as the mesh is refined
the adaptive RS–NE method always converges and the number of nonlinear iterations
only increases very slowly.
4.3. Parallel performance study. Scalability is an important issue in parallel
computing, and the issue is more significant when solving large-scale problems with
many processors. Therefore, we numerically investigate the weak and strong scalabilities of the proposed algorithm for Case-4 in terms of the computing time and
iterations.
A weak scaling test is carried out to examine the performance of the solver when
the problem size is increased in proportion to the number of processor cores. In this
test, we are particularly interested in stopping the calculation at a target simulation
time. In the test, we start with two processor cores and a 128 × 64 mesh, and the
number of processor cores is increased as the mesh is refined accordingly. We show
in Table 7 the results obtained with the fully implicit method with adaptive time
stepping and the semi-implicit method. In the adaptive fully implicit method, the
initial time step size is 4t = 0.02 for the case of a 128 × 64 mesh and then is reduced by half as the mesh is refined accordingly. In the semi-implicit method, the
time step size is fixed to 4t = 10−3 for the case of a 128 × 64 mesh and then is
reduced by half as the mesh is refined accordingly. However, the IMPES method
is not convergent for the cases of a 512 × 256 mesh with 4t = 2.5 × 10−4 and a
1024 × 512 mesh with 4t = 6.25 × 10−5 . Hence, we choose smaller time step sizes for
these two cases. The simulation is terminated at 0.11 PVI. Table 7 gives a comparison between the computing time results of the fully implicit method with adaptive
time stepping and the semi-implicit method. For the semi-implicit method, the total
computing time can increase by more than one order of magnitude, as the mesh is
refined, which is due to the stability restriction on the time step size. For the fully
implicit method, the total numbers of time steps increase as more processor cores
are used. As a result, the total numbers of nonlinear and linear iterations, as well
as the total compute time, increase rapidly as the mesh is refined. It is clear that
the implicit approach still outperforms the semi-implicit approach, especially for finer
spatial resolutions.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
B615
Table 7
Weak scaling results of the adaptive RS-NE method and the IMPES method. The “−−” means
that the computing time is more than 10 hours.
Number of processors
Mesh size
Initial time step size
Number of time steps (Adaptive)
Average time step size (years)
Average nonlinear iteration
Average linear iteration
Execution time (second)
Number of time steps (IMPES)
Time step size (years)
Average linear iteration
Execution time (second)
2
128 × 64
2 × 10−2
12
0.0918
12.0
2.2
15.7
103
10−3
14.0
20.3
8
256 × 128
10−2
19
0.0553
16.9
14.2
48.0
2 × 103
5 × 10−4
83.4
217.8
32
512 × 256
5 × 10−3
31
0.0331
21.5
22.8
165.2
8 × 103
1.25 × 10−4
192.2
3361.2
128
1024 × 512
2.5 × 10−3
51
0.0196
32.0
31.8
680.8
3.2 × 104
3.125 × 10−5
−−
−−
Table 8
Strong scalability with different number of processors Np for RS–NE.
Mesh
1024 × 512
Np
2
4
16
64
128
32
64
128
256
512
2048 × 1024
Nonlinear
4.4
4.4
4.4
4.4
4.4
6.8
6.8
6.8
6.8
6.8
Linear
4.5
30.5
42.6
46.0
57.8
65.0
65.4
81.2
91.2
111.8
Time
416.6
197.9
63.2
18.8
12.7
388.8
169.8
107.4
72.6
50.1
64
400
32
200
16
100
Speedup
Computing time
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
50
8
4
Ideal
Mx=1024
Ideal
Mx=2048
25
12.5
2
4
8
2
16
32
64
Processors
128
256
512
2
4
8
16
32
64
Processors
128
256
512
Fig. 11. Strong scalability results with different number of processors Np for RS–NE.
To further study the parallel scalability of the fully implicit method, we perform
the strong scaling test on two fixed meshes and different numbers of processor cores
in Table 8. The test is run with a fixed time step size 4t = 10−5 and the simulation
is terminated at the fifth time step. We can see from the table that the number of
Newton iterations does not change and the number of linear iterations increases mildly
with growth of the number of processor cores. In Figure 11, we report the computing
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B616
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
time with respect to the number of processors. When the number of processors Np
increases from 2 to 512, the total computing time decreases at a reasonably good
rate, which indicates that the proposed method has a good speedup for this range of
numbers of processors.
5. Conclusions. In this paper, we have proposed an active-set reduced-space
method with nonlinear elimination for the fully implicit simulation of two-phase flows.
In the proposed algorithm, we present a variational inequality formulation of twophase flow problems to avoid nonphysical undershoot or overshoot of the saturation
fractions, and we employ an active-set reduced-space algorithm to solve the resultant
nonlinear complementarity system arising at each implicit time step. To improve the
robustness of the nonlinear solve, a field-split nonlinear elimination preconditioner is
proposed to remove the high nonlinearity that often triggers the failure of convergence.
Numerical results for several test cases have shown that the proposed algorithm is
robust and superior to the classical IMPES method. Although our discussion in this
paper is restricted to the two-dimensional two-phase flows, we believe that the general
methodology is applicable to the other nonlinear multiphase flow problems in two or
three dimensions.
Acknowledgments. The authors would like to thank the anonymous reviewers
for the valuable suggestions leading to the improvement of the paper.
REFERENCES
[1] S. Balay, S. Abhyankar, F. A. Adams, J. Brown, P. Brune, K. Buschelman, L. Dalcin, V.
Eijkhout, W. D. Gropp, D. Kaushik, M. G. Knepley, L. C. McInnes, K. Rupp, B. F.
Smith, S. Zampini, and H. Zhang, PETSc Users Manual, Argonne National Laboratory,
2015.
[2] S. J. Benson and T. S. Munson, Flexible complementarity solvers for large-scale applications,
Optim. Methods Softw., 21 (2006), pp. 155–168.
[3] M. Bergounioux, K. Ito, and K. Kunisch, Primal-dual strategy for constrained optimal
control problems, SIAM J. Control Optim., 37 (1999), pp. 1176–1194.
[4] L. Blank, H. Garcke, L. Sarbu, and, V. Styles, Primal-dual active set methods for AllenCahn variational inequalities with nonlocal constraints, Numer. Methods Partial Differential Equations, 29 (2013), pp. 999–1030.
[5] J. F. Blowey and C. M. Elliott, Curvature dependent phase boundary motion and parabolic
double obstacle problems, in Degenerate Diffusions (Minneapolis, MN, 1991), IMA Vol.
Appl. Math. 47, Springer, New York, 1993.
[6] P. Brune, M. Knepley, B. Smith, and X. Tu, Composing scalable nonlinear algebraic solvers,
SIAM Rev., 57 (2015), pp. 535–565.
[7] X.-C. Cai, W. D. Gropp, D. E. Keyes, R. G. Melvin, and D. P. Young, Parallel NewtonKrylov-Schwarz algorithms for the transonic full potential equation, SIAM J. Sci. Comput.,
19 (1998), pp. 246–265.
[8] X.-C. Cai and M. Sarkis, A restricted additive Schwarz preconditioner for general sparse
linear systems, SIAM J. Sci. Comput., 21 (1999), pp. 792–797.
[9] X.-C. Cai and D. E. Keyes, Nonlinearly preconditioned inexact Newton algorithms, SIAM J.
Sci. Comput., 24 (2002), pp. 183–200.
[10] X.-C. Cai and X. Li, Inexact Newton methods with restricted additive Schwarz based nonlinear
elimination for problems with high local nonlinearity, SIAM J. Sci. Comput., 33 (2011),
pp. 746–762.
[11] Z. Chen, G. Huan, and B. Li, An improved IMPES method for two-phase flow in porous
media, Transport Porous Media, 54 (2004), pp. 361–76.
[12] Z. Chen, G. Huan, and Y. Ma, Computational Methods for Multiphase Flows in Porous
Media, SIAM, Philadelphia, PA, 2006.
[13] K. H. Coats, IMPES stability: the CFL limit, presented at SPE Reservoir Simulation Symposium, Houston, TX, 2001.
[14] C. N. Dawson, H. Klie, M. F. Wheeler, and C. S. Woodward, A parallel, implicit, cellcentered method for two-phase flow with a Newton-Krylov solver, Comput. Geosci., 1
(1997), pp. 215–249.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
NONLINEAR ELIMINATION FOR TWO-PHASE FLOW
B617
[15] J. E. Dennis and R. B. Schnabel, Numerical Methods for Unconstrained Optimization and
Nonlinear Equations, SIAM, Philadelphia, 1996.
[16] V. Dolean, M. J. Gander, W. Kheriji, F. Kwok, and R. Massin, Nonlinear Preconditioning:
How to Use a Nonlinear schwarz to Precondition Newton’s Method, preprint, 2015.
[17] S. C. Eisenstat and H. F. Walker, Globally convergent inexact Newton method, SIAM J.
Optim., 4 (1994), pp. 393–422.
[18] I. Ekeland and R Témam, Convex Analysis and Variational Problems, SIAM, Philadelphia,
1999.
[19] R. G. Fagin and C. H. Stewart, Jr., A new approach to the two-dimensional multiphase
reservoir simulator, SPE J., 6 (1966), pp. 175–182.
[20] R. Glowinski, Numerical Methods for Nonlinear Variational Problems, Springer-Verlag,
Berlin, 2008.
[21] P. T. Harker and J. S. Pang, Finite-dimensional variational inequality and nonlinear complementarity problems: A survey of theory, algorithms and applications, Math. Program.,
48 (1990), pp. 161–220.
[22] M. Hintermüller, K. Ito, and K. Kunisch, The primal-dual active set strategy as a semismooth Newton method, SIAM J. Optim., 13 (2003), pp. 865–888.
[23] M. Hintermüller, V. A. Kovtunenko, and K. Kunisch, The primal-dual active set method
for a crack problem with non-penetration, IMA J. Appl. Math., 69 (2004), pp. 1–26.
[24] H. Hoteit and A. Firoozabadi, Numerical modeling of two-phase flow in heterogeneous
permeable media with different capillarity pressures, Adv. Water Resources, 31 (2008),
pp. 56–73.
[25] F.-N. Hwang and X.-C. Cai, A parallel nonlinear additive Schwarz preconditioned inexact Newton algorithm for incompressible Navier-Stokes equations, J. Comput. Phys., 204
(2005), pp. 666–691.
[26] F.-N. Hwang, H.-L. Lin, and X.-C. Cai, Two-level nonlinear elimination based preconditioners for inexact Newton methods with application in shocked duct flow calculation, Electron.
Trans. Numer. Anal., 37 (2010), pp. 239–251.
[27] F.-N. Hwang, Y.-C. Su, and X.-C. Cai, A parallel adaptive nonlinear elimination preconditioned inexact Newton method for transonic full potential flow problems, Comput. &
Fluids, (2015), pp. 96–107.
[28] K. Ito and K. Kunisch, The primal-dual active set method for nonlinear optimal control
problems with bilateral constraints, SIAM J. Control Optim., 43 (2004), pp. 357–376.
[29] P. J. Lanzkron, D. J. Rose, and J. T. Wilkes, An analysis of approximate nonlinear elimination, SIAM, J. Sci. Comput., 17 (1996), pp. 538–559.
[30] A. Klawonn, M. Lanser, and O. Rheinbach, Nonlinear FETI-DP and BDDC Methods,
SIAM, J. Sci. Comput., 36 (2014), pp. A737–A765.
[31] D. A. Knoll and D. E. Keyes, Jacobian-free Newton-Krylov methods: a survey of approaches
and applications, J. Comput. Phys., 193 (2004), pp. 357–397.
[32] J. Kou and S. Sun, A new treatment of capillarity to improve the stability of IMPES two-phase
flow formulation, Comput. & Fluids, 39 (2010), pp. 1293–1931.
[33] J. Kou and S. Sun, Convergence of discontinuous Galerkin methods for incompressible twophase flow in heterogeneous media, SIAM J. Numer. Anal., 51 (2013), pp. 3280–3306.
[34] D. Krishnan, P. Lin, and A. M. Yip, A primal-dual active-set method for non-negativity
constrained total variation deblurring problems, IEEE Trans. Image Process., 16 (2007),
pp. 2766–2777.
[35] D. Krishnan, Q. Pham, and A. M. Yip, A primal-dual active-set algorithm for bilaterally
constrained total variation deblurring and piecewise constant Mumford-Shah segmentation
problems, Adv. Comput. Math., 31 (2009), pp. 237–266.
[36] F. Kwok and H. Tchelepi, Potential-based reduced newton algorithm for nonlinear multiphase
flow in porous media, J. Comput. Phys., 227 (2007), pp. 706–727.
[37] L. W. Lake, Enhanced Oil Recovery, Prentice-Hall, Englewood Cliffs, NJ, 1989.
[38] K. Lie, S. Krogstad, I. Ligaarden, J. Natvig, H. Nilsen, and B. Skaflestad, Open-source
MATLAB implementation of consistent discretisations on complex grids, Comput. Geosci.,
16 (2012), pp. 297–322.
[39] L. Liu and D. E. Keyes, Field-split preconditioned inexact Newton algorithms, SIAM J. Sci.
Comput., 37 (2015), pp. A1388–A1409.
[40] D. Loudyi, A 2D Finite Volume Model for Groundwater Flow Simulations: Integrating Nonorthogonal Grid Capability into MODFLOW, Ph.D. dissertation, Cardiff University, UK,
2005.
[41] J. E. P. Monteagudo and A. Firoozabadi, Control-volume method for numerical simulation
of two-phase immiscible flow in two- and three-dimensional discrete-fracture media, Water
Resources Res., 40 (2004), W07405.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.
Downloaded 08/04/16 to 109.171.137.210. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
B618
HAIJIAN YANG, CHAO YANG, AND SHUYU SUN
[42] J. E. P. Monteagudo and A. Firoozabadi, Comparison of fully implicit and IMPES formulations for simulation of water injection in fractured and unfractured media, Int. J. Numer.
Methods Engrg., 69 (2007), pp. 698–728.
[43] J. Moortgat, S. Sun, and A. Firoozabadi, Compositional modeling of three-phase flow
with gravity using higher-order finite element methods, Water Resources Res., 47 (2011),
pp. W05511.
[44] W. A. Mulder and B. Van Leer, Experiments with implicit upwind methods for the Euler
equations, J. Comput. Phys., 59 (1985), pp. 232–246.
[45] T. S. Munson, F. Facchinei, M. C. Ferris, A. Fischer, and C. Kanzow, The semi-smooth
algorithm for large scale complementarity problems, INFORMS J. Comput., 13 (2001),
pp. 294–311.
[46] M. Munteanu, L. F. Pavarino, and S. Scacchi, A scalable Newton-Krylov-Schwarz
method for the bidomain reaction-diffusion system, SIAM J. Sci. Comput., 31 (2009),
pp. 3861–3883.
[47] M. Osusky and D. W. Zingg, Parallel Newton-Krylov-Schur flow solver for the Navier-Stokes
equations, AIAA J., 51 (2013), pp. 2833–2851.
[48] J. Pebrel, C. Rey, and P. Gosselet, A nonlinear dual-domain decomposition method: Application to structural problems with damage, Internat. J. Multiscale Comput. Engrg., 6
(2008), pp. 251–262.
[49] J. F. Rodrigues, Obstacle Problems in Mathematical Physics, North-Holland, Amsterdam,
1987.
[50] Y. Saad, Iterative Methods for Sparse Linear Systems, 2nd ed., SIAM, Philadelphia, 2003.
[51] J. O. Skogestad, E. Keilegavlen, and J. M. Nordbotten, Domain decomposition strategies
for nonlinear flow problems in porous media, J. Comput. Phys., 234 (2013), pp. 439–451.
[52] B. Smith, P. Bjørstad, and W. Gropp, Domain Decomposition: Parallel Multilevel Methods
for Elliptic Partial Differential Equations, Cambridge University Press, Cambridge, UK,
1996.
[53] S. Sun, D. E. Keyes, and L. Liu, Fully implicit two-phase reservoir simulation with the additive Schwarz preconditioned inexact Newton method, presented at SPE Reservoir Characterization and Simulation Conference and Exhibition, Society of Petroleum Engineers,
2013.
[54] P. Sun, G. Xue, C. Wang, and J. Xu, Fast numerical simulation of two-phase transport
model in the cathode of a polymer electrolyte fuel cell, Commun. Comput. Phys., 6 (2009),
pp. 49–71.
[55] A. Toselli and O. Widlund, Domain Decomposition Methods-Algorithms and Theory,
Springer-Verlag, Berlin, 2005.
[56] P. Wilmott, J. Dewynne, and S. Howison, Option Pricing, Oxford, England, Oxford
Financial Press, Oxford, UK, 1993.
[57] Y. Wu and G. Qin, A generalized numerical approach for modeling multiphase flow and transport in fractured porous media, Commun. Comput. Phys., 6 (2009), pp. 85–108.
[58] C. Yang and X.-C. Cai, A scalable fully implicit compressible Euler solver for mesoscale
nonhydrostatic simulation of atmospheric flows, SIAM J. Sci. Comput., 35 (2014),
pp. S23–S47.
[59] L. C. Young and R. E. Stephenson, A generalized compositional approach for reservoir
simulation, SPE J., 23 (1983), pp. 727–742.
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.