IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 2215 Geometric Direct Search Algorithms for Image Registration Seok Lee, Minseok Choi, Hyungmin Kim, and Frank Chongwoo Park Abstract—A widely used approach to image registration involves finding the general linear transformation that maximizes the mutual information between two images, with the transformation (3)] or volume-preserving being rigid-body [i.e., belonging to [i.e., belonging to (3)]. In this paper, we present coordinate-invariant, geometric versions of the Nelder–Mead optimization (3), (3), and their various subalgorithm on the groups groups, that are applicable to a wide class of image registration problems. Because the algorithms respect the geometric structure of the underlying groups, they are numerically more stable, and exhibit better convergence properties than existing local coordinate-based algorithms. Experimental results demonstrate the improved convergence properties of our geometric algorithms. Index Terms—Image registration, mutual information, Nelder–Mead, optimization, transformation group. I. INTRODUCTION O NE of the fundamental steps in image fusion is the alignment of two or more sets of volumetric image data, commonly referred to as the image registration problem. In one popular formulation of this problem, the 3-D linear transformation is sought that optimizes some fitting criterion between two given image volumes; quite often the transformation is assumed to be either of the rigid body type, i.e., belonging to the special , or of a more general volume preserving Euclidean group type, i.e., belonging to the special linear group . Two-dimensional versions of the above also arise when the data sets are restricted to be planar; in this case, the relevant transformation and . A popular choice of fitting critegroups are rion is the mutual information between the two image volumes [5], [19], [23]. In this paper, we shall not address more general formulations that seek nonlinear diffeomorphisms between two sets of volume data (e.g., [6], [16], and [22]), although such problems can, in some instances, be locally decomposed into a series of subproblems of the above type. Two features of the ensuing optimization make this problem challenging: 1) evaluating the mutual information objective function is computationally very expensive, and 2) the search , , space is nonlinear, in the sense that the groups and their various subgroups do not admit the structure of a vector space. Because of the expense in evaluating the objective Manuscript received December 15, 2006; revised April 10, 2007. This work was supported in part by the Center for Intelligent Robotics, Cybermed, Inc., IAMD-SNU, and in part by the BK21 Program in Mechanical Engineering at Seoul National University. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Til Aach. The authors are with Seoul National University, Seoul, Korea (e-mail: [email protected]; [email protected]; [email protected]; [email protected]) Digital Object Identifier 10.1109/TIP.2007.901809 function, classical descent methods that rely on finite difference approximations of the gradient are highly inefficient: at each iteration they would require several evaluations of the objective function. Methods based on gradient-free, direct search methods, such as the Nelder–Mead algorithm [18], are a practical alternative. While it is true that the convergence properties of these algorithms are difficult to characterize analytically except in the simplest cases [12], in practice, they have proven to be very effective, particularly when the objective function is reasonably well-behaved, in the sense of being sufficiently smooth, without a large number of local extrema, and an initial guess reasonably close to local optima can be provided [20]. In [20], a Nelder–Mead algorithm formulated in terms of local coordinates is presented for addressing the rigid-body version of the image registration problem. To find the rotation matrix component of the optimal rigid body transformation, the original vector space Nelder–Mead algorithm is formulated in terms of the chosen local coordinates (the Euler angles in this case) that parametrize the rotation group. An obvious drawback with this approach is that local coordinates contain singularities; whenever the iteration ventures near such a singularity, it becomes necessary to transfer to another set of local coordinates, thereby complicating the algorithm structure. Optimistically, one might regard this merely as a tolerable inconvenience. More critically, however, a local coordinate-based algorithm that fails to properly take into account the geometric structure of the search space—some regions may be highly distorted relative to other regions, for example, requiring smaller steps and more numerical iterations—will inevitably lead to very inconsistent, in some cases even catastrophic performance. Several papers, e.g., [7] and [9], show how the geometry of the underlying space needs to be taken into account in the general setting of optimization and numerical analysis on Riemannian manifolds and Lie groups. Similarly, in previous work by the authors [8], classical steepest descent and Newton methods are generalized in a local coordinate-invariant way to quadratic , and the advantages explicitly objective functions on demonstrated. This paper makes two contributions: 1) a geometric—in the sense of being invariant with respect to choice of local coordinates, and respecting the chosen metric structure of the underlying space—generalization of the Nelder–Mead algorithm to and (and by extension the transformation groups their subgroups), and 2) applications of these geometric algorithms to several classes of image registration problems. Our image registration applications treat in detail the various numerical issues encountered in a practical implementation, and we , that improve the suggest modifications, particularly for computational performance of the algorithm. 1057-7149/$25.00 © 2007 IEEE 2216 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 Experimental results indicate that, conservatively, performance improvements of around 15% in computational efficiency can be achieved relative to existing local coordinate-based methods. More significantly, our geometric algorithm is shown to be more numerically robust (with respect to, e.g., input image data and initial conditions) than existing methods; for several cases in which local coordinate-based methods failed to converge, our geometric algorithm was able to converge to a physically meaningful solution. This paper is organized as follows. After establishing some geometric preliminaries in Section II, in Section III, we present the geometric version of the Nelder–Mead algorithm on the Lie and . In Section IV, we detail the appligroups cation of this algorithm to the image registration problem. Section V concludes with a summary and suggestions for further extensions. II. GEOMETRIC PRELIMINARIES In this section, we discuss the geometric structure of the Lie and , paying particular attention to the groups metric space structure, and computational formulas for the disare well tance metric and mass centers. The results for known and have been reported in the literature, e.g., [10]; here, we offer only a brief summary of the main results. The results , which, to our knowledge, have not been reported in for the literature, are derived from first principles. An overview of spatial transformations in medical imaging is also given in [1]. A. Rotation and Rigid-Body Transformations To generalize the Nelder–Mead optimization algorithm to the group of proper rotations and the rigid-body transfor, appropriate notions of the “mean” of a set elemations ments in the group, and the “straight line” connecting two elements of the group, are first needed. We begin with precise formulations of these notions. Recall that the special orthogonal consists of the 3 3 real matrices that satisfy group and . The special Euclidean group consists of the 4 4 real matrices of the form with consists of the 4 4 real matrices of the form (3) . where One way in which a Lie group and its Lie algebra are related is by the (matrix) exponential and logarithm mappings. Given , where is a unit vector and , the is defined by the formula map (4) The formula for the logarithm is given by (5) where is any scalar satisfying ; restricting enforces a unique value for the logarithm (with the , two solutions—antipodal points on the exception that if sphere of radius in —exist). Every admits a . Although representation as the exponential of some we will not have use for them here, corresponding formulas for and are available in, e.g., [17]. With the above preliminaries, the natural distance metric on is constructed as follows (see [10] for details). Given two , the distance between these two rotarotations tions, denoted is evaluated according to the formula (6) where denotes the matrix Frobenius norm, i.e., . The above distance metric is natural in the sense of being invariant (up to scale factor) with respect to choice of for reference frames; that is, . any Given this frame-invariant notion of length on , it can be established that the shortest curve connecting two rotations is of the form (1) and , and the lower-left zero element where and have indicates the 3-D zero row vector. Both the structure of a differentiable manifold and an algebraic group (under matrix multiplication), and are examples of Lie groups. Each Lie group has associated with it a Lie algebra, typically consists of the denoted in small letters. The Lie algebra real 3 3 skew-symmetric matrices of the form (7) so that and , and . This curve and . corresponds to the minimal geodesic between the minimal geodesics are one-parameter subOn groups of the form , with and a scalar parameter. ; given of This is not quite the case for the form (8) (2) Given , we shall denote its skew-symmetric matrix representation by . The Lie algebra associated the minimal geodesic connecting is given by (at ) and (at ) (9) LEE et al.: GEOMETRIC DIRECT SEARCH ALGORITHMS FOR IMAGE REGISTRATION where , i.e., the rotation component is simply between and . Similarly, the minimal geodesic on the translation component is the straight line connecting and . The associated distance metric on is (10) It should be noted that this metric is only invariant with respect to choice of fixed (global) reference frame (i.e., left-invariant). ] Moreover, this metric [and for that matter any metric on depends on the choice of length scale chosen for physical space. We refer the reader to [10] for proofs, and a discussion of the physical implications, as well as heuristics for choosing length scales meaningful for the application at hand. In the event that one desires to weight the orientation and position equally in some sense, one possible heuristic is to choose the length scale such that the volume of the position and orientation workspaces with respect to are equal; in this regard, the volume of its natural volume form is evaluated to be . Based on the above, a precise notion of means on the rotation and Euclidean groups can now be established. First, given a set in , one way of defining the of rotations mean rotation is the element that minimizes the criterion 2217 B. Group of Volume-Preserving Transformations adThe group of volume-preserving transformations mits the structure of a matrix Lie group, consisting of the real that satisfy . The associ3 3 matrices consists of the real 3 3 matrices of trace ated Lie group zero. We first discuss a natural choice of local coordinates for , followed by its metric space structure. It is known that any element of the general linear group of nonsingular matrices can be decomposed via the Iwa, where , sawa decomposition [3] as is diagonal with positive entries, and is upper-triangular with unit diagonal entries. Restricting to have positive determito be an element of . The above nant in turn forces decomposition reflects the well-known physical property that a general linear transformation can be decomposed into a combination of rotation, scaling, and shearing. The scaling transfor, mation can be further decomposed as so that scaling can be physically decoupled into a scale factor and stretching component. , an Restricting the above Iwasawa decomposition to can be decomposed as , where element , is diagonal with unit determinant, and is upper-triangular with unit diagonal entries. For example, any volume-preserving transformation in two dimensions can be decomposed as (11) According to Moakher [15], the optimal that globally minimizes , denoted the extrinsic mean, is the orthogonal projeconto tion of (12) Using a result of [11], it can be established that as long as is nonsingular (which is almost always true except for pathological cases), the mean can be expressed as Three parameters corresponding to rotation ( ), stretching ( ), and shearness ( ) can be used to parametrize any element of . In three dimensions, any element can be , where can be paparametrized as rametrized via the usual exponential representation , where , and the stretching transformation and shearing transformation are, respectively, of the form (14) (13) where where and are obtained from the singular value decompo, and if , and othersition wise. Alternatively, one could replace the objective function of ; (11) by the intrinsic distance metric, i.e., in this case the minimizer is often referred to as the intrinsic, or geometric, mean. Because a general closed-form formula for is not available, and the associated the intrinsic mean on numerical procedure for the intrinsic mean is more involved, in our application we will make use of the extrinsic mean above. for a set The corresponding formula for the mean on is now straightforward: the of elements corresponding rotation is given by (13), while the is given by its arithmetic mean. The same translation , and remarks above also apply for the intrinsic mean on we, therefore, use the extrinsic mean instead. and are generated as (15) (16) Developing a metric space structure on is complicated by the fact that, unlike , simple analytic formulas for minimal geodesics between two arbitrary points are no longer is no available. The exponential map longer surjective, and is bijective only in a neighborhood of 2218 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 the identity. Moreover, in the appendix we show that, given and on , the minendpoints imal geodesic (with respect to the standard left-invariant metric) connecting and must satisfy (17) (18) . Finding the minimal geodesic thus where involves the solution of a two-point boundary value problem on and , for which multiple solutions typically exist. The geodesics on are also given by the same set of (18), and . but this time with are also geodesics It readily follows that geodesics on . The computational cost of determining the minimal on geodesic obviously needs to be considered in the overall performance analysis of any geometric direct search algorithm. On the other hand, it is known (e.g., [21]) that near the identity, the minimal geodesics are in fact given by left and right , , translations of the one-parameter subgroups . To take advantage of this result, we assume that image registration has already been performed over the rigid-body transformations using, e.g., our gradient-free optimization routine on , and that the image data have been appropriately transformed via this rigid-body transformation. To further refine this solution we now perform a second optimization over with or ; in this case, we can expect that the obtained solution will be reasonably close to the identity. is within the injectivity radius of (that As long as centered at over which is, the radius of the open ball in is surjective), and can be connected via a one-parameter subgroup. Since an analytic charis not available, acterizations of the injective radius on we resort to the following numerical experiment to get a sense of the size of the injectivity radius. We first parametrize an elby the three parameters described ement earlier, along a line from ( 1.5, 1.5, 1.5) to (1.5, 1.5, 1.5); along this line is then calculated. the error The direction of the line is now changed randomly 2000 times by sampling from a uniform distribution over the unit sphere. The error along each line is calculated in the same fashion; the mean errors over the 2000 randomly generated lines are shown in Fig. 1. Due to finite precision an exact match (i.e., zero error) is obtained only at the identity as expected, but the errors are ex. These results suggest that tremely small, on the order of that deviate by as much as ( 1.5, even for elements of Fig. 1. Numerical plot of injectivity radius of SL(2). 1.5, 1.5) from the origin in coordinates, the exponential and logarithm maps are nearly bijective. For completeness, we list the known analytic formulas for the (see, e.g., [2]): given of the form exponential on we have (19), shown at the bottom of the page. onto its Now we present the projection formula of . Because numerically computed values of subgroup invariably drift off to , a projection back is often required. The projecto the nearest element in tion problem can be formulated as the following minimization: , we seek the that minimizes given (20) The optimal solution admits the following simple characterization. admit the singular value deProposition 1: Let , where . The optimal that composition , with and as minimizes (20) is given by obtained above, and a real positive diagonal matrix of unit . determinant that minimizes Proof: See Appendix II. Using this proposition the projection is simplified to finding the unit determinant diagonal matrix with positive entries that . On , given , we minimizes , that minimizes seek the . From the necessary conditions for optimality must satisfy (21) if if if (19) LEE et al.: GEOMETRIC DIRECT SEARCH ALGORITHMS FOR IMAGE REGISTRATION On , given , for optimality, and , we seek the that minimizes . From the necessary conditions must satisfy (22) (23) Although not optimal in the above sense, one convenient to is as follows. Supposing means of projecting is decomposed as , the as before that projection can be obtained by normalizing the elements of to , the corresponding projection have unit determinant. For is (24) while in the case of 2219 denote a nondegenerate simplex consisting of seven Let on . Define the worst point elements to be the point in the simplex at which the objective function attains its largest value. Similarly, the best point is defined to be the point in the simplex at which the objective function attains its smallest value. Further define the four scalar parameters representing the amount of reflection, expansion, contraction, and shrinkage by , , , and , respectively; some widely , , , and . used values are , at each -th iteration one of four operaGiven a simplex tions (reflection, expansion, contraction, or shrinkage) is performed. When a reflection, expansion, or contraction is performed, a new vertex is obtained to replace the worst point of . When a shrink is performed, three new vertexes are obtained, which together with the current best point, constitute the updated simplex for the next iteration. The detailed algorithm is as follows. 1) (Ordering) Relabel the simplex vertexes in increasing order of the objective function value, i.e., such . that the projection is given by (25) In the case of where deviates strongly from 1, it can happen that the result is far from the optimal solution. Under the assumption that rigid registration has already been performed and the given is reasonably close to identity, it is . With this formula similar perfair to assume that formance is obtained, but with better computational efficiency than solving the optimality condition. We close this section with a formula for the sample mean . Given a set of elements , the on (extrinsic) mean that minimizes the criterion (26) coincides with (27) can, in turn, be obtained via the projection of onto , whose solution we indicate earlier. The result can be verified via straightforward calculation, whose details we omit. III. GEOMETRIC NELDER–MEAD ALGORITHM We now describe the Nelder–Mead direct search optimization , where is an algorithm for minimizing some function element of , , or any of their various subgroups. For a detailed description of the classical Nelder–Mead algorithm on vector spaces, see the original [18] and the more recent [12] and [24]. 2) (Reflection) Calculate the reflection point as follows: (28) (29) as where denotes the mean rotation of . If given by (13), and denotes the centroid of , replace by and go to Step 6. 3) (Expansion) If from , find the expansion point (30) (31) , replace by and go to Step 6. If by and go to Step 6. Otherwise, replace 4) (Contraction) If two following contractions. • (External Contraction) If the ouside contraction point , then perform one of the , evaluate according to (32) (33) , replace by If Otherwise, proceed to Step 5. and go to Step 6. • (Internal Contraction) If according to inside contraction point , evaluate the (34) (35) A. Nelder–Mead on Denote the rotation and translation components of by and , respectively. , replace by If Otherwise, proceed to Step 5. and go to Step 6. 2220 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 for respectively. 5) (Shrinkage) Calculate the shrinkage vertexes about according to (36) (37) for respectively. . Replace by 6) Go back to Step 1 until stopping criterion . Here left-invariant distance metric on . Replace by 6) Go back to Step 1 until stopping criterion . , for some preset To ensure that the one-parameter subgroups calculated above do not drift off , the projection formulas derived in the previous section can be applied. , for some preset and denotes the of (10). B. Nelder–Mead on We now describe the geometric Nelder–Mead algorithm on . The version is similar, the difference being that the simplex is now composed of four vertexes. As before, given , at each iteration one of four operations (rea th simplex flection, expansion, contraction, or shrinkage) is performed as follows. IV. APPLICATIONS TO IMAGE VOLUME REGISTRATION We now present applications of our geometric direct search algorithm to the mutual information based image registration problem, based on the formulations given in [14], [4], and [23]. We assume as given two image volume data sets and , which we regard as random variables with marginal , , and joint distribution . distributions The Shannon entropy for each distribution, which intuitively reflects the amount of uncertainty in the corresponding random variable, is defined by 1) (Ordering) Relabel the simplex vertexes in increasing order of the objective . function value, i.e., such that 2) (Reflection) Calculate the reflection point (43) (44) as follows: (45) (38) where denotes the mean of , replace 3) (Expansion) If from by . If and go to Step 6. , find the expansion point The mutual information between is defined by and , denoted , (46) (47) (39) , replace by and go to Step 6. If by and go to Step 6. Otherwise, replace 4) (Contraction) If two following contractions. • (External Contraction) If the ouside contraction point , then perform one of the , evaluate according to Given the image intensity values and of a pair of corre, sponding voxels in the two images, estimates for , and can be obtained via normalization of the joint and marginal histograms of the overlapping parts of both images; see [14] for details. that maximizes the We seek the optimal transformation ; the latter demutual information between image and notes the transformed version of image via (40) (48) , replace by If Otherwise, proceed to Step 5. and go to Step 6. • (Internal Contraction) If according to inside contraction point , evaluate the (41) , replace by If Otherwise, proceed to Step 5. and go to Step 6. 5) (Shrinkage) Calculate the shrinkage vertexes about according to (42) In existing approaches, the transformation is parametrized is maxvia some suitable local coordinates, and imized in a vector space setting; for example, in the case of , the Euler angles are a popular choice of local coordinates. In our experiments below, we compare the results of our geometric direct search algorithms with those obtained from local coordinate-based formulations of the Nelder–Mead algorithm. All the simulations and experiments described below are performed with real medical images. The optimizations are performed using Matlab, running on an IBM Pentium 800 MHz with 640 MByte of memory. We use a stopping criterion of . LEE et al.: GEOMETRIC DIRECT SEARCH ALGORITHMS FOR IMAGE REGISTRATION Fig. 2. Comparison of convergence speed between tion. SL(2) and < optimiza- 2221 Fig. 4. Comparison of convergence speed between SE (3) and < optimization. Fig. 4 shows the results of our volume registration ex. The original MRI volume of size periments on 128 128 70 is transformed with the following rigid transformation : (50) Fig. 3. Convergence behavior of larity. SL(2) and < optimization near a singu- We first present simulation results for optimization on . Fig. 4 shows the optimization result; the right upper image is a single slice of a size 201 201 brain CT, while the right bottom image is the deformed image using the following transformation : The graph on the left side shows the convergence behavior of each algorithm. We note that the convergence of the optimization on is faster than on ; the maximum is obtained case, compared to 256 iterations within 305 iterations in the case. in the Fig. 4 shows the results of our volume registration ex. The original MRI volume of size periments on 128 128 70 is deformed with the following transformation : (51) (49) The graph on the left side shows the convergence behavior for each algorithm; note that faster convergence of the algorithm on compared to that for . The maximum is obtained within 78 iterations in the case, compared to 70 iterations case. One can also observe more rapid and steady in the progress in the objective function maximization for the case. In Fig. 4, we use the same set of simulation conditions as before, except that we now intentionally apply a 2-D rotation singularity at zero radians, to compare the convergence propand optimization algorithms near a local erty of the coordinate singularity. In the optimization, zero and are identified as the same point, whereas this is not the case in the vector space optimization. As shown in Fig. 4, the perfordeteriorates significantly; the mance of the optimization in optimal value is obtained in 130 iterations, which is 86% slower . In certain singular cases, local than the optimization on coordinate-based algorithms can even fail to converge. We note that this singularity problem is inherent in every local coordinate-based optimization algorithm. The upper columns on the right show a 3-D view of the original and deformed volumes. The graph on the left compares the conand . Here, vergence behavior of the optimization on is faster we also find that the speed of convergence on than that of . More than 50 simulations with cases similar to the above were performed; these 50 trials do not include cases in which the solutions are in the neighborhood of a local coordinate singularity. Table I shows the average elapsed number of iterations and computation times for the respective algorithms. The optimizations took approximately 20 min in a Matlab environment, on while for selected C++ implementations of corresponding specific cases less than 2 min were required. Based on these results our expectation is that the actual optimization times for the geoor will typically be up to ten metric algorithms on times faster than listed in Table I, which were obtained without any consideration of attaining real-time performance. Based on our experimental comparison results, the geometric , , and are reoptimization algorithms on spectively 4.5%, 16.4% and 19.6% faster than the corresponding vector space optimization algorithms with respect to time, and 2222 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 TABLE I CONVERGENCE TIME FOR EACH OPTIMIZATION ALGORITHM Fig. 5. Comparison of convergence speed between tion. SL(3) and < optimiza- 8.6% 16.9%, and 14.1% more efficient in terms of the number of iteration. Fig. 6 shows some representative results of image registra. tion using our geometric Nelder–Mead algorithm on An MFC-based program for loading and manipulating DICOM format medical images was first implemented. Some speed up techniques in [4] were used and the cubic 6 6 interpolation method with continuous second derivatives [13] was used to calculate voxel values after transformation. The first and second columns are planar views of the reference and target volumes, respectively, while the third column displays the fusion results after registration. The fourth column shows the 3-D view of the reference volumes. V. CONCLUSION This paper has presented a geometric generalization of the Nelder–Mead direct search algorithm to the Lie groups and , together with applications to image volume registration. The algorithms are geometric in the sense of being local coordinate-invariant, and respecting the chosen metric structure of the underlying space. The advantages of the geometric approach are that local coordinate singularities do not require any special treatment, and also ensure uniform performance of the algorithm that otherwise might depend on local features of the coordinate representation (e.g., in the chosen coordinate representation, some regions of the search space may be highly distorted relative to other regions). Direct search algorithms are most effective for image registration applications in which the evaluations of the objective function are computationally expensive, and gradient information is unavailable. Simulation and experimental results evaluating the performance of our algorithms have been obtained using real CT and MRI image volumes. Our experimental results indicate that, conservatively, performance improvements of around 15% in computational efficiency can be achieved Fig. 6. Some 3-D volume registration results using SE (3) optimization. (a) Pre- to postoperation image registration result. (b) CT to MRI image of spine registration result. (c) CT to MRI image of spine registration result—saggital view. (d) MRI to PET registration result. relative to existing local coordinate-based methods. More significantly, our geometric algorithms do not exhibit the poor convergence behavior near singularities characteristic of local coordinate-based methods. The results on minimal geodesics and , and the projection formula on , on should also be of independent interest. The direct search algorithms described in this paper are in principle generalizable to other matrix groups, provided that precise notions of the sample mean and minimal geodesics can be characterized. Some natural extensions are to the , and the affine group . Efforts are also underway to use the algorithms in a more general scheme for finding optimal nonlinear diffeomorphisms, by an appropriate decomposition of the global problem into several local versions of the above. APPENDIX I MINIMAL GEODESICS ON In this appendix, we derive the variational equations for minwith respect to the standard left-inimal geodesics on variant metric. We formulate this problem as an optimal control problem, in which the objective is to minimize (52) over subject to (53) LEE et al.: GEOMETRIC DIRECT SEARCH ALGORITHMS FOR IMAGE REGISTRATION with is perturbed to and are given. Assume , where with . Then after some calculation, we have 2223 The original problem can, thus, be recast as a minimization of . From and , it can be seen that the optimal has the same orthogonal part as that of . (54) REFERENCES In other words, the associated is now perturbed to . The associated first variation becomes (55) Integrating the second term by parts leads to (56) Setting this equation equal to zero for all admissible end up with , we (57) The optimality conditions can, therefore, be expressed as (58) (59) , , and boundary conditions given. Note that the above derivation also applies to , so that the optimality conditions are identical, but and . Clearly, the geodesics with are also geodesics on . It also follows that on is constant in time; this implies that the differential equation for is Lipschitz, thereby ensuring existence of a solution. with , APPENDIX II PROOF OF PROJECTION FORMULA ON In this appendix, we provide a proof of Proposition 1. where Suppose the SVD of and with and . After some manipulation (60) , . The maximal value of , attained when , . Note that the maximal value depends on the . It follows that (60) is lower bounded by sign of the . if and Letting if , (60) becomes where is [1] R. Woods, “Spatial transformation models,” in Handbook of Medical Imaging: Processing and Analysis, I. Bankman, Ed. New York: Academic, 2000, p. 465. [2] D. S. Bernstein and W. So, “Some explicit formulas for the matrix exponential,” IEEE Trans. Autom. Control, vol. 38, no. 8, pp. 1228–1232, Aug. 1993. [3] M. Bridson and A. Haefliger, Metric Spaces of Non-Positive Curvature. Berlin, Germany: Springer-Verlag, 1999. [4] M. Capek, L. Mroz, and R. Wegenkittl, “Robust and fast medical registration of 3D multi-modality data sets,” in Proc. Int. Fed. Medical and Biological Engineering, 2001, vol. 1, pp. 515–518. [5] A. Collignon, F. Maes, D. Delaere, D. Vandermeulen, P. Suetens, and G. Marchal, “Automated multi-modality image registration based on information theory,” Inf. Process. Med. Imag., pp. 263–274, 1995. [6] M. De Craene, A. du Bois d’Aische, and B. Macq, “Multi-modal nonrigid registration using a stochastic gradient approximation,” in Proc. IEEE Int. Symp. Biomedical Imaging: Macro to Nano, 2004, vol. 2, pp. 1459–1462. [7] A. Edelman, T. A. Arias, and S. T. Smith, “The geometry of algorithms with orthogonality constraints,” SIAM J. Matrix Anal. Appl., vol. 20, no. 2, pp. 305–353, 1998. [8] S. Gwak, J. Kim, and F. C. Park, “Numerical optimization on the Euclidean group with applications to camera calibration,” IEEE Trans. Robot. Autom., vol. 19, no. 1, pp. 65–74, Jan. 2003. [9] A. Iserles, H. Z. Munthe-Kaas, S. P. Norsett, and A. Zanna, “Lie-group methods,” Acta Numer., vol. 9, pp. 215–365, 2000. [10] F. C. Park, “Distance metrics on the rigid-body motions with applications to mechanism design,” ASME J. Mech. Des., vol. 117, no. 1, pp. 48–54, 1995. [11] F. C. Park, J. Kim, and C. Kee, “Geometric descent algorithms for attitude determination using GPS,” AIAA J. Guid., Control, Dyn., vol. 23, no. 1, pp. 26–33, 2000. [12] J. C. Lagarias, J. A. Reeds, M. H. Wright, and P. E. Wright, “Convergence properties of the Nelder–Mead simplex algorithm in low dimensions,” SIAM J. Optim., vol. 9, no. 1, pp. 112–147, 1998. [13] T. M. Lehmann, “Survey: Interpolation methods in medical image processing,” IEEE Trans. Med. Imag., vol. 18, no. 11, pp. 1049–1075, Nov. 1999. [14] F. Maes, A. Collignon, D. Vandermeulen, G. Marchal, and P. Suetens, “Multimodality image registration by maximization of mutual information,” IEEE Trans. Med. Imag., vol. 16, no. 2, pp. 187–189, Feb. 1997. [15] M. Moakher, “Means and averaging in the group of rotations,” SIAM J. Matrix Anal. Appl., vol. 24, no. 1, pp. 1–16, 2002. [16] M. Moshfeghi, S. Ranganath, and K. Nawyn, “Three-dimensional elastic matching of volumes,” IEEE Trans. Image Process., vol. 3, no. 2, pp. 128–138, Mar. 1994. [17] R. M. Murray, Z. Li, and S. S. Sastry, A Mathematical Introduction to Robotic Manipulation. Boca Raton, FL: CRC, 1994. [18] J. A. Nelder and R. Mead, “A simplex method for function minimization,” Comput. J., vol. 7, pp. 308–313, 1965. [19] J. P. W. Pluim, J. B. A. Maintz, and M. A. Viergever, “Mutual-information-based registration of medical images: A survey,” IEEE Trans. Med. Imag., vol. 22, no. 8, pp. 986–1004, Aug. 2003. [20] R. Shekhar and V. Zagrodsky, “Mutual information-based rigid and nonrigid registration of ultrasound volumes,” IEEE Trans. Med. Imag., vol. 21, no. 1, pp. 9–22, Jan. 2002. [21] M. Spivak, A Comprehensive Introduction to Differential Geometry. Berkeley, CA: Publish or Perish, 1979. [22] E. Suarez, R. Cardenes, C. Alberola, C.-F. Westin, and J. Ruiz-Alzola, “A general approach to nonrigid registration: Decoupled optimization,” presented at the 23rd Annu. Inf. Conf. IEEE Engineering in Medicine and Biology Society, 2001. [23] P. A. Viola and W. M. Wells, III, “Alignment by maximization of mutual information,” Int. J. Comput. Vis., vol. 24, no. 2, pp. 137–154, 1997. [24] M. H. Wright, “Direct search methods: Once scorned, now respectable,” in Proc. Dundee Biennial Conf. Numerical Analysis, 1995, pp. 191–208. 2224 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 9, SEPTEMBER 2007 Seok Lee received the B.S., M.S., and Ph.D. degrees in mechanical and aerospace engineering from Seoul National University, Seoul, Korea, in 2000, 2002, and 2007, respectively. He currently serves as a R&D staff member in the Computing and Intelligence Laboratory, Samsung Advanced Institute of Technology. His research interests are in robotics, image processing, medical imaging, and related areas of computer vision. Minseok Choi received the B.S. degree in mechanical engineering and mathematics from Seoul National University, Seoul, Korea, in 2002, where he is currently pursuing the M.S. degree. From 2002 to 2005, he was a Software Engineer with Denali IT and Empas. His research interests include motion planning, computer vision, and related areas of applied mathematics. Hyungmin Kim received his engineering degree in mechanical engineering from Seoul National University, Seoul, Korea, in 1998, and the M.S. degree from the Robotics and Automation Laboratory, Seoul National University. He is currently pursuing the Ph.D. degree in biomedical engineering at the University of Bern, Bern, Switzerland. After his degree courses, he entered himself into the medical industry, developed medical imaging softwares and surgical navigation systems at Cybermed, Inc., for six years. He serves as a Research Associate at MEM Research Center, University of Bern. Frank Chongwoo Park received the B.S. degree in electrical engineering and computer science from the Massachusetts Institute of Technology, Cambridge, in 1985, and the Ph.D. degree in applied mathematics from Harvard University, Cambridge, in 1991. He is a Professor of mechanical and aerospace engineering AT Seoul National University, Seoul, Korea. He was an Assistant Professor of mechanical and aerospace engineering at the University of California, Irvine. His research interests are in robotics and computer vision, mathematical systems theory, and related areas of applied mathematics. Dr. Park currently serves as the 2007 IEEE Distinguished Lecturer for the Robotics and Automation Society, as a Senior Editor for the IEEE TRANSACTIONS ON ROBOTICS, and as an Area Editor for the forthcoming Springer-Verlag Handbook of Robotics.
© Copyright 2026 Paperzz