Image and Vision Computing 23 (2005) 1029–1040 www.elsevier.com/locate/imavis Dynamic B-snake model for complex objects segmentation Yue Wang, Eam Khwang Teoh* School of Electrical and Electronic Engineering, Nanyang Technological University, Block S2, Nanyang Avenue, Singapore 639798, Singapore Received 13 May 2004; received in revised form 30 May 2005; accepted 1 July 2005 Abstract A close-form B-Snake model using statistics information for 2D objects segmentation is presented in this paper. We called it Dynamic B-Snake Model (DBM). It is able to model the features of the object in training set and guide the B-Snake in the deforming procedure. Compared to other deformable models, the DBM maintains the smoothness of curve while still remain compact representation. Moreover, a method of Minimum Mean Square Error (MMSE) is developed to iteratively estimate the position of those control points in the B-Snake model. As it deforms the segments of the B-Spline at a time, instead of individual points, it is very robust against local minima. Furthermore, in order to use available statistical information about the desired object shape, the Principal Component Analysis (PCA) is applied to model the distribution of knot points of training samples. This allows the deformation of B-Snake to synthesize the shape similar to those in the training set. By applying the proposed B-Snake model to medical images, it is shown that our method is more robust and accurate in comparing with the traditional Snake and Active Shape Model(ASM). q 2005 Elsevier Ltd. All rights reserved. Keywords: Deformable model; Active contour; Snake; B-Spline; B-snake; Object segmentation; Principal component analysis 1. Introduction Object segmentation is a very important procedure in image analysis, computer vision, and medical imaging. Many medial image analysis applications, like the measurement of anatomical structures, require prior segmentation of the organ from the surrounding tissue. However, it is a difficult task due to the variations of objects and the diversities of image types. For example, one important task in medical imaging is the boundary extraction of the brain ventricle from magnetic resonance (MR) images. But the variations of shapes and sizes of the ventricles with the ages usually make the extraction procedure difficult. Our special interest is the human brain ventricle segmentation in MR images for further study. The Snake was originally developed by M. Kass [1]. It was curve defined within an image domain which can move under the influence of internal forces from the curve itself and external forces from the image data. * Corresponding author. Tel.: C65 67905393; fax: C65 67912687. E-mail addresses: [email protected] (Y. Wang), eekteoh@ntu. edu.sg (E.K. Teoh). 0262-8856/$ - see front matter q 2005 Elsevier Ltd. All rights reserved. doi:10.1016/j.imavis.2005.07.006 Once internal and external forces have been defined, the Snake can detect the desired object boundaries (or other object features) within an image. From the original philosophy of Snake, there are many techniques have been proposed as an alternative method for presenting a curve: Fourier descriptors [2], B-Splines [3,7,9,10,21,26], auto-regressive model [4], moments [5], HMM models [6] and wavelets [8], etc. Among these methods, B-Spline representation of the curve stands for an advantage as such a formulation of a deformable model allows for the local control and a compact representation. The flexibility of the curve increases as more control points are added while still remain less parameters compared to other model. Moreover, the internal force is not needed as the smoothness requirement has been implicitly built into the model. In this paper, we present a closed-form B-Snake model called Dynamic B-Snake Model (DBM) with a Minimum Mean Square Error (MMSE) approach [24]. It possesses three main novelties: (1) Internal forces are not required in DBM since the B-Snake representation maintains smoothness via hard constrains implicit in the representation while remains compact representation. 1030 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 (2) A method of MMSE is developed to iteratively estimate the position of those control points in the closed B-Snake model. As MMSE deforms the segments of the B-Spline at a time, instead of individual points, it is very robust to against local minima. (3) In order to use available statistical information about the desired object shape, the Principal Component Analysis (PCA) has been applied to our B-Snake model. This approach allows the deformation of B-Snake to synthesize the shape similar to those in the training set. The structure of this paper is arranged as follows. In Section 2, a review of the existing B-Snake model is presented. Section 3 introduces the B-Snake Model with the algorithm for estimating the parameters of B-Snake. In Section 4, the statistic model is given to model the samples of the training set and guide the B-Snake deformation. The simulation results are shown in Section 5. This paper concludes in Section 6. 2. Related works A deformable B-Spline algorithm was presented in [9] for determining vessel boundaries and enhancing their centerline features. A bank of even and odd S-Gabor filter pairs of different orientations are convolved with vascular images. The resulting responses across filters of different orientations are combined to create an external energy field for Snake optimization. Vessels are represented by cubic B-Snake, and are optimized on filter outputs with dynamic programming. In this algorithm, the starting and the end points have to be fixed, and the dynamic programming is very slow for the deformation of B-Snake. Stammberger [19] proposed a B-Snake algorithm for the segmentation of the knee joint cartilage from MR images by using a multi-resolution approach. As the external forces are generated by image edges and the distance transformation of a standard model, the B-Snake may not deform to a desired object, because there is no statistical information has been included in this algorithm. Mário presented an approach to unsupervised contour representations and estimations by using B-Spline [20]. The problem is formulated in a statistical framework with the likelihood function being derived from a region-based image model. However, no any priori knowledge of the shape is used in this model. Wang [21–23] presented a B-Snake based lane model for lane detection. The external forces in this model are designed based on the perspective relationship of lane boundaries on the image plane. However, although the results are good in lane detection, the number of control points is fixed to three, this limits the capability to describe the complex shape. In their later paper [26], a B-Snake model with a strategy of adaptive control point insertion was proposed for segmenting the complex structures in medical images. However, in all of above methods, no any priori knowledge of the shape is used. For the case that the priori information is available, current researches make it possible to be implemented into the Snake model. Cootes presented a Point Distribution Model (PDM) [11] for building flexible shape models. The shape is represented by a set of landmark points. The shapes are aligned and the deviations from the mean are analyzed using principal component analysis. As an extension research work to the point distribution model, Baumberg proposed a cubic B-Spline model [10] for detecting and tracking the walking pedestrians. The control points of B-Spline are treated exactly the same way as the labeled points of point distribution model [11]. This method has been applied to a real-time processing system. The Active Shape Model (ASM) was introduced by Cootes et al. [12] to improve the PDM for object segmentation. The object’s contour is presented by a set of fix number of landmark points. PDM is constructed to modeling the landmark points’ distribution of these shapes. The local intensity structure of each landmark point is constructed as well. ASM segmentation method involves finding the new landmark points with the similar intensity value and the new shape of landmark points is generated that is similar to the shapes in the training set. ASM parameters are then updated by fitting the shape to the target points, a new contour is generated and new target points searched. This process is repeated until convergence. ASM has been used for several segmentation tasks in medical images [13–18]. The shortcomings of ASM are, firstly these landmark points have to be manually chosen for describing the whole shape. That means these landmark points are not only those key points of shape, but also other points between them. They may not possess the affine-invariant property, this will lead to the problem that the affine transformation may not be recovered properly. Secondly, as the method works by modeling how different landmark points tend to move together as the shape varies, if the labeling is incorrect, with a particular point placed at different sites on each training shape, the method will fail to capture shape variability. Thirdly, as the deformation for each model points on the contour is independently, the gross error may rise and the contour may be pulled away from the real boundary position. Motivated by the above problems, we here present a knowledge based B-Snake model, Dynamic B-Snake Model, for object segmentation. It is the extension of our previous research work [26]. This model is combined with the robustness of B-Spline and ASM. It consists with a reconstruction method of B-Spline curve and an affine invariant alignment strategy for PCA to synthesize the shape of B-Snake similar to those in the training set, and a MMSE to perform a fine define deformation for matching the object boundaries more precisely. The details are given in the followings. Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 1031 Where 3. On B-Snake model The traditional Snake is a point-based deformable model. The curve is represented by a set of discrete points. Its deformation is driven by the external and internal forces that are calculated from an energy function. From the original philosophy of Snake, an alteration is using a parametric B-Spline representation as the curve descriptor. Compared to traditional point-based Snake, the B-Snake greatly reduces the number of state variables required for a Snake. We have implemented it successfully in lane detection [21–23] and modeling complex object contour [26]. B-Snake can be either open or close with any order. In this paper, we are concerning to implement the close cubic B-Spline to B-Snake. 3.1. Close cubic B-Splines 2 MR ðSÞ Z ½ s3 s2 1 1 1 6K6 2 K2 6 6 6 1 1 6 6 2 K1 2 6 s 1 6 6 1 1 6K 6 2 0 2 6 6 6 1 2 1 4 6 3 6 3 1 67 7 7 7 07 7 7 7: 7 07 7 7 7 7 05 3.2. The B-Snake model The cubic B-Snake is defined as follows: X rðsÞ Z gi ðsÞ; where 0% s% 1: (2) i A close cubic B-Spline (Fig. 1) has control nC1 points {QiZ[xiyi]T, iZ{0,1,.,n}, and nC1 connected curve segments {gi(s)Z(xi(s),yi(s)), iZ1,2,.,nC1}. The connection points between curve segments are called the knot points Pi(iZ1,2,.,nC1), where the B-spline bases are tied together, PiZgi(0), (iZ1,2,.,nK1). Each curve segment is a linear combination of four cubic polynomials by the parameter s, where s is normalized between 0 and 1 (0%s%1). That is, For a cubic B-Snake, gi(s) is defined by Eq. (1). The control points are positioned so that the curve locates the desired object contour. The external energy term on r(s) is defined as E(r(s)). There are no internal forces since the B-Spline representation maintains smoothness via hard constraints implicit in the representation. Therefore, the total energy function of the B-Snake EB-snake can be defined by integrating E(r(s)) along the B-Snake. That is, ð1 EB-snake Z EðrðsÞÞ ds: 3 2 QðiK1ÞmodðnC1Þ 7 6 6 QimodðnC1Þ 7 7; i Z 1; 2; .; n C 1: 6 gi ðsÞ Z MR ðsÞ6 7 4 QðiC1ÞmodðnC1Þ 5 QðiC2ÞmodðnC1Þ (3) 0 (1) To identify the edge features in the image, the above expression must be minimized along the B-Spline curve. So that, at equilibrium (when image forces vanish), r(s) stabilizes close to a contour of object. Fig. 1. A closed cubic B-spline curve. 1032 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 2 3.3. Minimum mean square error approach Gradient Vector Flow (GVF) [27] is selected here as the definition of external forces for B-Snake, since GVF has a larger capture range. It is computed as a diffusion of the gradient vector of a gray-level or binary edge map derived from the image. When the B-Snake is on the boundaries of object, its total external force Fext that is summation of the localized external force Fext(r(s)) should be zero. That is, ð1 Fext Z fext ðrðsÞÞ ds Z 0: (4) 2 s3i;1 6 3 6 si;2 Mi Z 6 6 $ 4 s3i;m s2i;1 si;1 s2i;2 $ si;2 $ s2i;m si;m 1 1 1 6K6 2 K2 6 36 1 6 1 1 6 76 2 K1 2 6 17 76 7 1 $ 56 6K1 0 6 2 2 1 6 6 6 1 2 1 4 6 3 6 3 1 67 7 7 7 07 7 7 7; 7 07 7 7 7 7 05 i Z 1; 2; .; nK2: 0 3 1 1 1 0 K K 6 6 2 27 7 6 7 36 7 1 6 1 1 7 60 K1 7 76 2 2 7 6 17 7; 76 7 6 7 1 1 $ 56 7 6 0 K2 0 2 7 7 6 1 6 7 7 6 40 1 2 1 5 6 3 6 2 On the other hand, if the total external force of the B-Snake is zero, then there will be no change for the position of B-Snake. We assume that at time t the localized external force fext(rt(s)) is connected with the position change of B-Snake rt(s)KrtK1(s) in the following way, fext ðr t ðsÞÞ Z gðr t ðsÞKr tK1 ðsÞÞ; 2 6 3 6 snK1;2 s2nK1;2 snK1;2 MnK1 Z 6 6 $ $ $ 4 s3nK1;m s2nK1;m snK1;m (5) where g is a step-size. The position change of the B-Snake can be mapped to the adjustment of the control points at Q time t. For example, DQ(t) is defined as the adjustment at time t, then we have QðtÞ Z QðtK1Þ C DQðtÞ: (6) 2 K1 T M M F (7) where g is the step size, and M is a matrix [26] calculated from Eq. (1): 2 M1 6 0 6 6 6 $ 6 6 6 0 6 M Z6 0 6 6 6 M0 6 n1 6 6 M0 4 n 0 MnC1 where 0 . . M2 $ 0 $ . $ . 0 MnK3 . . 0 0 . 0 0 . 0 0 . 0 0 s3nK1;1 s2nK1;1 snK1;1 s3nK1;m s2nK1;m snK1;m 2 2 s3n;1 s2n;1 sn;1 6 3 6 sn;2 s2n;2 sn;2 Mn Z 6 6 $ $ $ 4 s3n;m s2n;m sn;m 3 7 7 7 7 7 7 0 7 7 ; MnK2 7 7 7 MnK1 7 7 7 Mn 7 5 0 $ MnC1 60 6 36 1 6 6 76 0 6 17 76 7 $ 56 60 6 1 6 6 6 40 2 (8) 2 1 36 1 66 76 60 17 76 6 7 $ 56 60 6 1 4 0 6 3 6 snK1;2 s2nK1;2 snK1;2 0 MnK1 Z6 6 $ $ $ 4 The localized external force fext(rt(s)) can be sampled along the B-Snake. In this way, we can solve the Eq. (5) digitally. Here, the Minimum Mean Square Error solution for the digital version of the Eq. (5) is given as a matrix form [24], DQðtÞ Z gK1 ½ M T s3nK1;1 s2nK1;1 snK1;1 2 s3n;1 s2n;1 sn;1 6 3 6 sn;2 s2n;2 sn;2 Mn0 Z 6 6 $ $ $ 4 s3n;m s2n;m sn;m 0 K 0 1 2 0 K 0 1 6K 2 6 36 1 6 6 1 76 2 6 17 76 7 $ 56 6 1 6 2 1 6 6 6 1 4 6 1 6 1 2 1 6 3 0 0 07 7 7 0 0 07 7 7; 7 0 0 07 7 5 0 0 0 3 1 2 7 7 7 7 K1 7 7 7 7; 7 0 7 7 7 7 2 7 5 3 3 1 0 07 6 7 7 7 0 0 07 7 7 7; 7 0 0 07 7 7 7 7 0 0 05 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 1033 Fig. 2. B-Snake in human brain ventricle contour extraction. 2 2 s3nC1;1 s2nC1;1 snC1;1 6 3 6 snC1;2 s2nC1;2 snC1;2 MnC1 Z 6 6 $ $ $ 4 s3nC1;m s2nC1;m snC1;m 60 6 36 1 6 6 76 0 7 1 76 6 6 $7 56 0 6 1 6 6 6 40 2 2 s3nC1;1 s2nC1;1 snC1;1 6 3 6 snC1;2 s2nC1;2 snC1;2 0 MnC1 Z6 6 $ $ $ 4 s3nC1;m s2nC1;m snC1;m 3 1 0 0 K 7 67 7 1 7 7 0 0 2 7 7 7; 17 0 0 K 7 27 7 7 1 7 5 0 0 6 1 1 6 2 K2 6 36 1 6 1 6 76K1 2 6 17 76 7 $ 56 6 0 1 6 2 1 6 6 6 2 1 4 3 6 3 1 07 6 7 7 7 0 07 7 7 7: 7 0 07 7 7 7 7 0 05 knowledge-based strategy for B-Snake deformation using B-Spline re-construction and statistical information of a training set. In order to use statistical information to guide the B-Snake deformation, the correspondence problem between two shapes should be solved. First we have to fix the number of control points of B-Snake, and then find the corresponding knot point between the training sets. Every B-Spline shape in the training set has to be aligned before extracting the statistical geometric information by using PCA. PCA will then guide the B-Snake deformation in the new image with a similar shape in the training set. 4.1. Re-construction of B-Spline curve Re-construction of the B-Spline with a fix number of control point is based on re-assigning the knot points for each segment with equal affine-arclength [29], since the equal arclength is not preserved under affine transformation, but affine-arclength will do. Affine-arclength is defined as: and si,m is the parameter s to form the sampling point m in ith segment, FZ[.fext(rt(si,m)).]. In real practice, B-Snake is initialized by four control points shown in Fig. 2(a). Associated with an adaptive control point insertion strategy presented in [26], the final result of B-Snake in human brain ventricle contour extraction is shown in Fig. 2(b), the number of control points is increased to 17. tl ðsÞ Z 4. B-Snake model using statistical geometric information Here we chose m as the number for the new knot points of re-constructed B-Spline curve, the length between two conscious knots is equal affine arc-length. Suppose the new knots are PZ(P0,P1,P2,.,PmK1) the corresponding control points are defined as: PCA is a classical statistical method [28]. This linear transform has been widely used in data analysis and processing. We have implemented PCA to B-Snake as an effect approach to the data. In this section, we suggest a Ðs ðx_l y€ l Kx€ l y_l Þ1=3 0 P Ðs i (9) ðx_i y€ i Kx€ i y_i Þ1=3 0 Q Z AK1 P (10) 1034 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 Fig. 3. A B-Snake with 40 control points. where 2 1 66 6 6 6 60 6 6 6 6$ 6 6 AZ6 $ 6 6 6 6 60 6 6 6 6 40 2 3 1 6 0 1 6 $ 2 3 $ 1 6 $ $ $ . . . . 0 . $ $ $ $ $ 0 1 6 2 3 1 6 . 0 1 6 2 3 3 07 7 7 7 07 7 7 7 $7 7 7 $7 7 7 7 7 07 7 7 7 17 5 6 affine transformation Aalign between each two alignment could be estimated. Please see the triangles’ area in Fig. 4 as a demo example on the attribute vector. Some results of shape alignment based on this method are shown in Fig. 5. 4.3. Modeling the shape distribution from the training set (11) Fig. 3 shows an example of 40 control points of B-Snake, it is re-constructed from Fig. 2(b) which has 17 control points. 4.2. The shape alignment strategy To determine the correspondence between two B-Snakes, a shape algorithm [25] which is using an affineinvariant feature has been implemented in B-Snake model. In this algorithm, for every knot point Pi obtained in last Section, an attribute vector Fi which is calculated by the areas formed by adjacent knot points has been assigned to it. As the attribute vector holds parameters that describe the shape characteristics of the adjacent knot points around each knot point, the far adjacent knot points represents more global properties of the contour of knot points. Therefore, the attribute vector corresponding to Pi can be expected to be different from attribute vectors of other points. As the area is related affine invariant, the attribute vectors are affine-invariant as well. Hence, shape alignment could be achieved by an error minimization process and the assumed An effective approach is to apply PCA to the training data. The data form a cloud of points in the n-d space. PCA computes the main axes of this cloud, allowing one to approximate any of the original points using a model with fewer than n-d parameters. Given a set of the training data of aligned knot points of B-Snake, {Pi}, the approach is as follows: (1) Compute the mean of data, P T and the covariance of the data, C. (2) Compute the eigenvectors, fi and corresponding eigenvalues li of C, and sort fi by the size of their corresponding eigenvalues. Fig. 4. Illustration of the concept of the ‘attribute vector’ on the ith knot point. The area of a triangle P[iKvs]P[i]P[iCvs] is used as the vsth element of the ith attribute vector Fi. Here, 1%vs%n/2. Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 1035 Fig. 5. Some aligned results of B-Snake model. (3) Let F contains the m eigenvectors corresponding to largest eigenvalues,0 then use below formulas for fitting the knot point vector P of the model to the knot point vector P of the aligned B-Snake shape: P 0 Z P T C FH (12) where H is a m dimensional vector given by H Z FT ðPKP T Þ (13) The vector H defines a set of parameters of a B-Snake model. By varying the elements of H, the shape of knot points, 0 P , can be varied as well by using Eq. (12). The number of eigenvectors to retain, m, can be chosen so that the model represents some proportion (98%) of the total variance of the data, or so that the residual terms can be considered noise.0 0 Once obtaining the best knot point vector P , we transform P back to update the positions of knot points in the current B-Snake by using the affine-transformation matrix Aalign. 4.4. The complete DBM algorithm The complete algorithm using both affine invariant alignment and statistical information is as follows: ; iZ 0; 1; .; Ng from the (1) Get a reference model fPmodel i training set. (2) Initialize the B-Snake as fPmodel ; iZ 0; 1; .; Ng. i (3) Deform the B-Snake by using MMSE to minimize external force (Section 3.3). If the iteration number exceeds a predefined number or external force of B-Snake is below a define value, go to step 6. (4) Align the current B-Snake configuration with the standard model contour by using the affine-transformation matrix Aalign calculated from the B-Snake to the model [25]. Then, stack the aligned B-Snake as a knot point vector align T P Z ½xalign ; yalign ; .; xalign N ; yN 0 0 Fig. 6. Deformation of DBM in ventricle segmentation. 1036 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 Fig. 7. Some contour extraction results of different MR brain images using DBM. (a)–(l): the green contour denotes the initial shape of B-Snake while the bright one denotes the final result. (5) Map the knot point vector into the new vector P into the new vector P 0 using the statistical model, P 0 Z P T C FH, which is described in Section 4.3. Then, transform P 0 back into the original coordinate space of the B-Snake via the inverse matrix of Aalign and update the B-Snake. Go to step 3. (6) Stop. 5. Experimental results and discussion The algorithm presented above has been implemented in medical images for object segmentation. Two experiment results are presented. First, the performance of DBM for ventricle segmentation in MR images are presented. Second, the qualitative and quantitative Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 1037 Fig. 8. A failed case of DBM. comparisons of DBM with the traditional Snake and ASM are presented as well. 5.1. Experiment results on object segmentation We are using 105 shapes of ventricle contour with 40 control points to form the training sets in this application. One of the shapes in the training set was chosen as the initialization of B-Snake. Fig. 6(a) shows an initialization of B-Snake, the initial shape is taken from the training set. The deformation of DBM is showed in Fig. 6(b)–(e), there are a total of 17 iterations that have been involved. The final result superimposed on the original image is shown in Fig. 6(f). In this experiment, the B-Snake was firstly attracted by the external force which is generated by the significant edges of object, and then the deformed shape of B-Snake was mapped to the PCA to get the most close shape in the training set. The final shape of B-Snake fitting to the ventricle boundaries is quite close. Fig. 7(a)–(l) show more contour extraction results for different MR brain images using DBM. The green contours are the initial shapes of B-Snake in each image, while the white contours denote the final results after B-Snake deformation. Please notice that the deformation from the initialization to the final result is large. Since the distribution of knot points of the shapes included in the training set has been modeled in our model, therefore, the proposed method intends to maintain the geometric shape of the B-Snake model during the shape deformation procedure. More importantly, our model doesn’t deform Snake points individually, but it deforms segments of the B-Snake at a time. It is shown that this procedure helps B-Snake to be very robust to local minima. Further more, as the external force field is generated by the edge image, the MMSE makes sure the B-Snake curve matches the object boundaries more accurately. However, as the presented B-Snake is attracted strongly by the local edges, therefore, we can’t guarantee that all global minima are found, and this will lead to some failed cases. Fig. 8 gives one example of failure of our algorithm after convergence to a local minima. We can observe that, on the bottom left of B-Snake, the B-Snake was deformed to the wrong edge. This is because the proposed B-Snake model does not use the additional information to form the capability to distinguish whether it is the desired object’s edge or not. To overcome this problem, more available information about the studied object should be used, such as intensity and texture. 5.2. Experiments on comparing DBM with the traditional snake and ASM To evaluate the robustness of DBM, a comparative study of DBM with the traditional Snake and ASM have been carried out. The results obtained are presented in this Section. Fig. 9 qualitatively compares the performance of our DBM, the traditional Snake and ASM in detecting ventricles from the MR brain images. Initialization for three different MR images are shown in Fig. 9(a). The three models are initialized with the same location and contour. Fig. 9(b) shows the final results of traditional Snake, we can observe that the traditional Snake lacks the capability to deform largely to overcome the local erroneous edges. In addition, it did not preserve any anatomical homology during the deformation. ASM is much better than the traditional Snake in preserving the model shape in the deformation procedure. This is as shown in Fig. 9(c). However, we observed that some parts of the model curve are either trapped in local minimum or not matching the desired object boundaries precisely. This is because ASM is not using the object edges information as an assistant to help it approach the object boundaries accurately. Hence, its final results in Fig. 9(c) are 1038 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 Fig. 9. Qualitative Comparison of DBM with the traditional Snake and ASM. (a1)–(a3) indicate the initialized contours; (b1)–(b3) indicate the results of traditional Snake; (c1)–(c3) indicate the results of ASM; (d1)–(d3) indicate the results of DBM. still unsatisfactory. On the contrary, DBM achieved good results as shown in Fig. 9(d). We can observe that, in addition to preserve the model shape, DBM can match the ventricles’ boundaries more precisely. More importantly, compared to the traditional point based Snake and ASM, DBM deforms the segments of the B-Snake at a time rather than the individual points. This procedure is shown to help the B-Snake to be very robust to overcome the local minima. Table 1 gives the quantitative comparison of DBM with the traditional Snake and ASM using the results presented in Fig. 9. The average and maximum distance of the final contour marked by experts is shown Table 2, using units of Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 1039 Table 1 Quantitative comparison in term of DBM with the traditional Snake and ASM based on the results presented in Fig. 9 Traditional Snake ASM DBM Results for MR images shown in Fig. 9 (b1,c1,d1) Results for MR images shown in Fig. 9 (b2,c2,d2) Results for MR images shown in Fig. 9 (b3,c3,d3) Avg. dist. Max dist Avg. dist. Max dist Avg. dist. Max dist 8.2 4.5 2.7 23.5 10.4 6.8 6.2 3.1 2.4 20.1 8.9 6.5 5.1 3.7 2.2 19.3 16.7 5.3 Table 2 Quantitative comparison of computation time taken by DBM with the traditional Snake and ASM based on the results presented in Fig. 9 Traditional Snake ASM DBM Results for MR images shown in Fig. 9 (b1,c1,d1) (ms) Results for MR images shown in Fig. 9 (b2,c2,d2) (ms) Results for MR images shown in Fig. 9 (b3,c3,d3) (ms) 220 430 560 250 410 490 210 470 520 pixel. DBM has the least average distances at 2.2–2.7 pixels and least maximal distance at 5.3–6.8 pixels for all three examples. In view of above comparison results, DBM demonstrates its superior robustness to the rest two deformable models against of local minima. Table 2 shows the comparison of processing time of DBM with the traditional Snake and ASM based on the results presented in Fig. 9. The traditional Snake is the fastest because it uses the edge image for deformation only. DBM is slightly slower than ASM. This is because more time is required in process of MMSE. This, in term, will guarantee a more precise matching of the object’s edge by DBM. 6. Conclusion In this paper, a Dynamic B-Snake Model (DBM) is presented for complex shape segmentation. The proposed algorithm is applied for the segmentation of 2D ventricle in the MR brain images. In this model, a statistical framework is embedded in order to use the prior knowledge of the studied object. This is done by combining the PCA for modeling the shape distribution in the training samples while taking into account the case of affine transformation. Constrained by the prior geometric knowledge that reflects the shape, DBM can keep its geometric shape during the procedure of segmentation. This helps to avoid the local minima. The final positions of the control points are refined by MMSE method during the B-Snake deformation procedure. It provides fine deformation for allowing the B-Snake to accurately adapt to the desired object boundaries in the image. The experimental results show that the proposed DBM has a robust capability to describe complex object contours and can achieve more accurate object segmentation when compared to the traditional Snake and ASM. Several extensions of this approach are possible. In this paper, the main focus is on the construction of 2-D deformable model for object segmentation. However, from a 2-D image, it is not possible to fully recover the 3-D shape due to projection. Although our model can be employed to segment 3-D objects by slice-to-slice segmentations, the segmentation results will be definitely worse than those by the actual 3-D B-Surface model. Using B-Surface to construct and describe the object surface remains a challenge. Two problems need to be solved. First, how to define the network of B-Surface patches that are continuously connected at their boundaries without any overlap or leakage to form a closed object surface is a difficult task. The technique of multiple control point may be used here to form a particular shape of patch or using different orders of Bsurface patched to form one surface. A triangle patch definition has been proposed in [30]. Second, the MMSE method in this thesis is for 2-D only, but it is possible to be extended to 3-D B-Surface model for estimating the 3-D object. Here, the principle for deforming the B-Surface is the same as the B-snake model, but more calculation is expected for B-Surface deformation. References [1] M. Kass, A. Witkin, D. Terzopoulos, Snakes: active contour models, Int. J. Comput. Vis. 1 (4) (1987) 321–331. [2] E. Person, S. Fu, Shape Discrimination Using Fourier Descriptors, IEEE Trans. Syst. Man Cybern. 7 (1977) 170–179. [3] Y. Guand, T. Tjahjadi, Coarse-to-fine planar object identification using invariant curve features and B-spline modeling, Pattern Recognit. 33 (2000) 1411–1422. [4] M.J. Paulik, M. Das, K. Loh, Nonstationary autoregressive modeling of object contours, IEEE Trans. Signal Process 40 (1992) 660–675. [5] J. Flusser, Pattern recognition by a affine moment invariant, Pattern Recognit. 26 (1993) 167–174. [6] A.L.N. Fred, J.S. Marques, P.M. Jorge, Hidden Markov models vs syntactic modeling in object recognition, in: Proceedings of International on Image Processing (ICIP’97), 1997, pp. 893– 896. 1040 Y. Wang, E.K. Teoh / Image and Vision Computing 23 (2005) 1029–1040 [7] Z. Huang, F.S. Cohen, Affine-invariant B-spline moments for curve matching, IEEE Trans. Pattern Anal. Machine Intell. 5 (1996) 10. [8] Q.M. Tueng, W.W. Boles, Wavelet-based affine invariant representation: a tool for recognizing planar objects in 3D space, IEEE Trans. PAMI 19 (1997) 846–857. [9] A.K. Klein, F. Lee, A.A. Amini, Quantitative coronary angiography with deformable spline models, IEEE Trans. Med. Imaging 16 (5) (1997). [10] A.M. Baumberg, D.C. Hogg, Learning flexible models from image sequences, Eur. Conf. Comput. Vis.’94 1 (1994) 299–308. [11] T.F. Cootes, C.J. Taylor, D.H. Cooper, J. Graham. Training models of shape from sets of examples, in: Proceedings of British Machine Vision Conference,1992, pp. 9–18. [12] T.F. Cootes, C.J. Tayor, D.H. Cooper, J. Graham, Active shape models - their training and application, Comput. Vis. Image Understanding 61 (1) (1995) 38–59. [13] T.F. Cootes, A. Hill, C.J. Taylor, J. Haslam, The use of active shape models for locating structures in medical images, Image Vis. Comput. 12 (6) (1994) 355–366. [14] P.P. Smyth, C.J. Taylor, J.E. Adams, Automatic measurement of vertebral shape using active shape models, in: Proceedings of British Machine Vision Conference, 1996, pp. 705–714. [15] S. Solloway, C.J. Taylor, C.E. Hutchinson, J.C. Waterton, The use of active shape models for making thickness measurements from MR images, in: Proceedings of 4th European Conference on Computer Vision, 1996, pp. 400–412. [16] G. Behiels, D. Vandermeulen, F. Maes, P. Suetens, P. Dewaele, Active Shape Model-Based Segmentation of Digital X-Ray Images Lecture Notes in Computer Science, Springer-Verlag, Germany, 1999. MICCAI ’99, pp. 128–137. [17] F. Vogelsang, M. Kohnen, J. Mahlke, F. Weiler, M.W. Kilbinger, B. Wein, R.W. Günther, Model based analysis of chest radiographs, Proc. SPIE 3979 (2000) 1040–1052. [18] G. Hamarneh, T. Gustavsson, Combining Snakes and Active Shape Models for Segmenting the Human Left Ventricle in Echocardiographic Images Computers in Cardiology, vol. 27, IEEE, Piscataway, NJ, 2000. pp. 115–118. [19] T. Stammberger, S. Rudert, M. Michaelis, M. Reiser, H. Englmeier, Segmentation of MR Images with B-Spline Snakes: A Multi-resolution Approach Using the Distance Transformation for Model Forces CEUR Workshop Proceedings, vol. 12 1998. [20] A.K. Jain, Unsupervised contour representation and estimation using B-splines and a minimum description length criterion, IEEE Trans. Image Process 9 (6) (2000) 1075–1087. [21] Y. Wang, E.K. Teoh, D. Shen, Lane Detection Using B-Snake, in: IEEE International Conference on Information, Intelligence and Systems (ICIIS’99), Washington, DC, November 1–3, 1999. [22] Y. Wang, D. Shen, E.K. Teoh, A novel lane model for lane boundary detection, in IAPR workshop on machine vision applications, 1998, pp. 27–30. [23] Y. Wang, D. Shen, E.K. Teoh, Lane detection and tracking using B-snake, Image Vis. Comput. 22 (4) (2004) 269–280. [24] Y. Wang, 2D object segmentation and matching using B-spline model, PhD thesis submitted to Nanyang Technological University, Singapore, May 2004. [25] H.S. Horace, D. Shen, An affine-invariant active contour model (AISnake) for model-based segmentation, Image and Vis. Comput. 16 (2) (1998) 135–146. [26] Y. Wang, E.K. Teoh, D. Shen, Structure-adaptive B-snake for segmenting complex objects, in: International Conference on Image Processing (ICIP 2001), Thessaloniki, Greece, Oct 7–10, 2001, pp. 769–772. [27] C. Xu, J.L. Prince, Snakes, shapes, and gradient vector flow, IEEE Transactions on Image Processing, March 1998, pp. 359–369. [28] Dunteman, H. George, Principal Components Analysis, Sage Publications, Newbury Park, 1989. [29] S. Buchin, Affine Differential Geometry, Science Press, Beijing, China, 1983. [30] M. Eck, H. Hoppe, Automatic Reconstruction of B-Spline Surfaces of Arbitrary Topological Type, Technical Report MSR-TR-96-01, Microsoft Corporation, January 1996.
© Copyright 2026 Paperzz