首页 Gaussian Mixture based image segmentation algorithm

Gaussian Mixture based image segmentation algorithm

Gaussian Mixture based image segmentation algorithm Pergamon Pattern Recognition, Vol. 31, No. 3, pp. 315 325, 1998 © 1997 Pattern Recognition Society. Published by Elsevier Science Ltd Printed in Great Britain. All rights reserved 0031-3203/98517.00 + .00 PII: S0031-3203 (97) 00045-9 A GAUSSIAN-MIXT...

Pergamon Pattern Recognition, Vol. 31, No. 3, pp. 315 325, 1998 © 1997 Pattern Recognition Society. Published by Elsevier Science Ltd Printed in Great Britain. All rights reserved 0031-3203/98517.00 + .00 PII: S0031-3203 (97) 00045-9 A GAUSSIAN-MIXTURE-BASED IMAGE SEGMENTATION ALGORITHM LALIT GUPTA and THOTSAPON SORTRAKUL Department of Electrical Engineering, Southern Illinois University, Carbondale, IL 62901, U.S.A. (Received 12 October 1995; accepted 24 May 1996) Abstract--This paper focuses on the formulation, development, and evaluation of an autonomous segmen- tation algorithm which can segment targets in a wide class of highly degraded images. A segmentation algorithm based on a Gaussian-mixture model of a two-class image is selected because it has the potential for effective segmentation provided that the histogram of the image approximates a Gaussian mixture and the parameters of the model can be estimated accurately. A selective sampling approach based on the Laplacian of the image is developed to transform the histogram of any image into an approximation of a Gaussian mixture and a new estimation method which uses information derived from the tails of the mixture density is formulated to estimate the model parameters. The resulting selective-sampling-Gaus- sian-mixture parameter-estimation segmentation algorithm is tested and evaluated on a set of real degraded target images and the results show that the algorithm is able to accurately segment diverse images. © 1997 Pattern Recognition Society. Published by Elsevier Science Ltd. Segmentation Histogram Thresholding Gaussian mixture 1. INTRODUCTION This paper focuses on developing an autonomous segmentation algorithm which can segment targets in a wide class of degraded images. The autonomous segmentation of targets is one of the most complex problems in automatic target recognition (ATR) as the environment of the target is not controllable. Segmentation in the context of ATR problems is the process of separating a target from the background as accurately as possible. Because the subsequent stages of the ATR system depend upon the information provided by the segmentation stage, reducing segmen- tation error clearly reduces the probability of target misclassification. A multitude of image segmentation algorithms have been developed based on grey level thresholding, 11-13~ edge detection, ~11-16) region grow- ing, m'12'tT-19) region splitting and merging,/11,12,2°-22) re- laxation,~11,12,23 25~ and fuzzy set theory313'26-28) It is well known in image segmentation research that there is no single algorithm that can effectively segment all types of images. Additionally, the application of differ- ent algorithms to the same image generally produce different segmentation results. The selection of an appropriate segmentation algorithm depends largely on the type of images and the application areas. One approach is to select/design an effective algorithm for a class of images generated by a particular sensor. This approach is especially suitable for segmenting images generated in problems such as industrial in- spection where the environment is controllable. The drawback to this approach when applied to segment- ing images in problems such as ATR is that there is no control of the environment. The images, therefore, have varying characteristics and the results of seg- menting some images may be seriously incorrect. Since the performance of the segmentation stage has a major influence on the performance of the ATR system, the goal is to focus on the formulation of an autonomous segmentation algorithm capable of accu- rately segmenting a wide class of highly degraded images. The segmentation algorithm developed in this pa- per is based on a parametric model in which the probability density function of the gray levels in the image is a mixture of two Gaussian density functions. This model has received considerable attention in the development of segmentation algorithms and it has been noted that the performance is influenced by the shape of the image histogram and the accuracy of the estimates of the model parameters. The model-based segmentation algorithms perform poorly if the histo- gram of an image is a poor approximation of a mix- ture of two Gaussian density functions. It is especially poor when the two modes of the mixture density are not well-separated. The application of this model in image segmentation is, therefore, highly limited as the histograms of the images encountered in practical problems are seldom approximations of Gaussian mixtures with well-defined modes. Additionally, no method has been proposed to broaden the application of Gaussian-mixture-based segmentation to images that have histograms which are poor approxima- tions of Gaussian mixtures. In order to broaden the application, a selective sampling method is pro- posed to transform the histogram of any image into 315 316 L. GUPTA and T. SORTRAKUL a histogram which approximates a Gaussian mixture. A new method is also proposed to accurately estimate the model parameters by first estimating the two Gaussian components of the mixture density and then estimating the parameters of the model from the two components. 2. THE GAUSSIAN MIXTURE MODEL FOR IMAGE SEGMENTATION Gaussian-mixture-based segmentation is based on histogram thresholding which is one of the most fre- quently used methods for segmenting images. In his- togram thresholding, it is assumed that an image consists of two regions or classes: target and back- ground, with each region having a uni-modal gray-level distribution. The segmentation problem, therefore, involves determining a suitable threshold to partition the image into the target and background regions. Gaussian-mixture segmentation algorithms assume a parametric model in which the probability density function (PDF) of the gray levels in the image are a mixture of two Gaussian density functions with given means, standard deviations, and proportions. That is, it is assumed that an image with the target and background intensities is combined with Gaus- sian noise. The mixture probability density function of such an image can be written as ~t21 p(z) = (P1)p(z/C O + (Pz)p(z/C2), where p(z/Ci) = (2~0./2) - 1/2exp [ - (z - pi)2/(20.2)], mixture component i, i = 1, 2 pi = mean of intensity level i, i = 1, 2 0.i = standard deviation of intensity level i, i = 1, 2 Pi = prior probability of intensity level i, i = 1, 2; (P1 + P2 = 1). For a given threshold T, the overall probability of error is E(T) = PxEI(T) + PIE2(T), where and Ea(T)=f~: ,p (z /Cz )dz E2(T) = ]~jr p(z/C1)dz. The threshold value for which E(T) is minimum is given by solving AT 2 + BT + C=O, (1) where A = o , 2 - 0 .~, B = 2(pla 2 - p2a~), 20.2 ,20.2 C = t~,2 - - IA1 2 q- 2 r r2a 2 ln(0.2P1/alP2). If the five model parameters P1 (or P2),/~1,1~2, 0.1, and 0"2 are known, the optimum threshold can be deter- mined arlalytically from equation (1), however, if the parameters are unknown, they have to be estimated from the gray-level histogram of the image. Several image segmentation algorithms based on estimating the parameters of the Gaussian mix- ture using curve-fitting techniques, (1'3) truncated histograms,16, 71 and the E-M algorithm I1°1 have been proposed. The algorithms based on curve-fitting techniques use a goodness-of-fit criterion through a conjugate gradient hill-climbing procedure to fit histograms with bimodal Gaussian density curves. The drawback with the curve-fitting methods is that they are computationally involved. The drawback with the iterative methods which estimate the model parameters from truncated histograms is that the esti- mation of parameters is biased even if the optimal threshold is selected. An improvement in estimating the variances when a threshold in an iteration is away from the overlapping regions of the mixture compo- nents is proposed in reference (7), however, the para- meters are otherwise estimated as described in refer- ence (6). The drawback of the E-M-based algorithm is that it tends to give poor estimates for small sample sizes and the time for convergence can be extremely high. Additionally, the E M algorithm fails to con- verge if one or both variances approach zero as in the cases of targets and backgrounds with uniform intensities. The performance of Gaussian-mixture-based seg- mentation algorithms clearly depend on how well the histogram of an image approximates a Gaussian mix- ture and the accuracy of the estimates of the model parameters. The histograms of real images seldom approximate a Gaussian mixture and none of the algorithms described propose methods to broaden the application of Gaussian-mixture-based segmentation to images with histograms that are a poor approxima- tion of a Gaussian mixture. The focus of the segmen- tation algorithm developed in this paper, therefore, is to broaden the application of Gaussian mixture segmentation algorithms through a histogram transformation approach and to accurately estimate the mixture density parameters through a method specifically designed to estimate the mixture density parameters from the transformed histogram. The formulations of the selective sampling algorithm for histogram transformation, the Gaussian-mixture parameter-estimation algorithm, and the selective- sampling-Gaussian-mixture parameter-estimation image segmentation algorithm are described in the following sections. Gaussian-mixture-based image 317 3. THE SELECTIVE SAMPLING (SS) ALGORITHM Several histogram modification methods for image segmentation have been proposed to facilitate the detection of the bottom of the valley between two modes by producing a transformed gray-level histo- gram which is more bimodal, or has a deeper valley between the modes, or converting the valley between the modes into a peak. (5' 12) The approach developed in this paper focuses on selecting a set of pixels which are typical of the target and the background. Only the selected pixels contribute to the transformed histo- gram. The pixels selected are the set of pixels which most likely fall on the two sides of the boundary of a target. The Laplacian L[f(x, y)] of the imagef(x, y) is used to identify and select such pixels. 12 The La- placian is a second-derivative operator which gives high values to pixels on both sides of the edge (modelled as a step or a ramp function) and low- values to all other pixels. Pixels on the bright side of an edge have a negative Laplacian whereas pixels on the dark side of an edge have a positive Laplacian. The histogram of the selected non-zero Laplacian pixels will, therefore, be bi-modal with the 2 modes representing typical target and background gray- levels and the valley between the two peaks would also be well-defined. Additionally, the two peaks will be approximately equal in size because the number of pixels on both sides of the boundary would be ap- proximately equal even if the target and the back- ground have unequal areas. The Laplacian of a continuous function g(x, y) is defined as L[g(x, y)] = [(02g)/Ox 2) q-(a2g)/c~y2)]. For a discrete imagef(x, y), the Laplacian at (x, y) can be approximated by L[f(x, y)] =f(x - 1,y) +f(x + 1, y) +f(x, y - I) +f(x, y + 1) - 4[f(x, y)]. The Laplacian appears to be ideal for selecting pixels to generate a well-defined bi-modal histogram as it produces values with differing signs at each edge, thus, facilitating the identification of target and back- ground pixels. Rules based on the sign and magnitude of the Laplacian can be easily developed to identify the target and background pixels whether an edge is modeled as a step or a ramp. The problem with the Laplacian is that as a sec- ond-order derivative, it is highly sensitive to noise and produces high values not only at the boundary of the target but also in any region where sharp gray-level transitions occur. Therefore, the applicability of this method to highly degraded images is questionable. To overcome this problem, the pixel selection rule de- veloped is based on using the Laplacian to accurately locate edges in noise and using the connectivity of the pixels to exclude pixels that may be noise pixels. Without a loss of generality, it will be assumed that an image is composed of a bright target on a dark back- ground. In the first step, the sign transition in the Laplacian is used to determine which type of edge is encountered when the image rows and columns are scanned. A transition from plus to minus indicates a step edge transition from a background pixel to a target pixel whereas a transition from minus to plus indicates a step edge transition from a target pixel to a background pixel. Ramp edge transitions have sim- ilar sign transitions in the Laplacian along with a zero (or zeros) between the sign changes. Figure l(a) illus- trates the Laplacian computed for step and ramp edge models. The gray level is denoted by f(x), the La- placian by L(x), and the location of the pixels selected are underlined. Note that the average value of the Laplacian is zero at an edge transition. As it is impos- sible to characterize all possible distortions which may occur in edges, only edges which are neither steps nor ramps but are monotonically increasing or de- creasing are considered as distorted edges. A set of rules are developed to compute a more accurate La- placian should such distortions occur in the edges. By first detecting an edge and then combining adjacent Laplacians, the average value L(x) of the Laplacian at an edge transition can be made close to zero as shown by the underlined values in Fig. l(b). The next step involves excluding pixels with low Laplacian magni- tudes by comparing them with an empirically deter- mined threshold. Such pixels typically result from noise and are not edge pixels. In the final step, the connectivity of the remaining pixels is determined by examining the eight-neighborhood of each pixel in order to exclude isolated pixels as the goal is to retain only boundary edge pixels which are generally con- nected to each other. The detailed steps of the selec- tive sampling algorithm are described below: Step 1: Computation of the Laplacian of the image. Compute the Laplacian L[f(x,y)] of the image f(x, y). Let the symbols 0, +, and - represent the zero, positive and negative Laplacian values of L[f(x, y)] and let nO, n +, and n - represent a string of n, (n > 2) continuous zeros, positive, and negative Laplacians. Step 2: Detection of all edges in the image. Scan the rows and columns of L If(x, y)] and use the following rules to detect edges: (a) Noise-free step edge detection A noise-free step edge is characterized by the se- quence: [. . . 0,0,( +, - ), 0,0 ... ] or [ . . .0 ,0 , ( - , +) ,0 ,0 ...3. (b) Noise-free ramp edge detection A noise-free ramp edge is characterized by the sequence: [. . . 0, 0,( +, n0, - ) ,0 ,0 ... ] or [ . . . 0 ,0 , ( - ,n0 , +) ,0 ,0 ...3 318 L. GUPTA and T. SORTRAKUL f(x) L(x) 06 6 -6 _( 0036 6600 030-3 0 -660 (a) Noise-flee step and ramp edges. 6 6 3 0 0 -503 f(x) L(x) Ll(X) 016 1 4-5 50-5 J -L_ 6 0 0 1 6 6 6 1 0 6 6 1 0 0 014 -5 0 -541 0 -5 41 0 0 5 0 -5 0--5 0 _5 0 -5 0 5 f(x) L(x) Ll(x) 0056 0056 6500 665 00 05-4 -1 05-41 -1 -4 50 0 -1 -450 050-5 050-5 -5050 0 -505__0 (b) Distorted edges. Fig. 1. The Laplacian computed for noise-free and distorted edges. (c) Detection of monotonically increasing/decreasing edges A monotonically increasing edge is characterized by the sequence: [... O,O,(n +,kO, m - ),0,0 .... ] and a monotonically decreasing edge is character- ized by [ .... 0,0,(m - ,k0, n +), 0,0 . . . . ]. If a monotonically increasing edge is detected, the Laplacian is updated by summing the n positive and m negative Laplacians and storing them at the first occurrence and last occurrence of positive and nega- tive values, respectively. All other Laplacians are set to zero. Similarly, for the monotonically decreasing edge, the Laplacian is updated by summing the m negative and n positive Laplacians and storing them at the first occurrence and last occurrence of negative and positive values, respectively. It is also possible to have sequences of zeros between a string of positive or negative Laplacians. For this case also, the strings of positive and negative Laplacians are sum- med and stored at the extreme locations. Step 3: Exclusion of small edges. Generate an image .fl (x, y) according to f l (x ,y )= ~1 if ,L[f(x,y)], >Lo, otherwise. Lo is an empirically determined threshold. Step 4: Removal of isolated disconnected pixels. Gen- erate the transformed image f2 (x, y) containing only the selected pixels according to 0 if [ f l (x, y) = 0] or [f l (x, y) = 1 fz(x,y) = and ~f l (x,y) < 2], 8N. f(x, y) otherwise. 8N denotes the set of 8-connected pixels of the pixel at (x, y). Gaussian-mixture-based image 319 Step 5: Histogram off(x, y). Compute the histogram hE (z) off2 (x, y). It is important to note that f2(x, y) is an image which contains the gray levels of the pixels selected from the original imagef(x, y). Given that, typically, only pixels on both sides of an edge transition are selected, the number of pixels used in computing the transformed histogram will be relatively small. The means of the target and background pixels can be expected to be well separated and the proportion of target and background pixels will be approximately equal irrespective of the target and background sizes. t p(z)LITj ~ , ~ p(z)RITj ~.~ P2p(z/C2) PIp(z/CI) hITj @v i Tj ~zlTj ~21Tj z Fig. 2. Illustration of the mixture density, components, and tails. 4. THE GAUSSIAN-MIXTURE PARAMETER ESTIMATION (GMPE) ALGORITHM The parameter-estimation algorithm focuses on the tails of the mixture density to first estimate the two Gaussian components of the mixture density and then estimate the parameters of the model from the two components. For a given threshold T j, the notations used in the formulation are as follows: p(z) = mixture density p(z/Ci) = mixture component i, i = 1, 2 Pip(z/Ci) = weighted mixture component i, i = 1, 2 p(z)L[Tj = left truncated component p(z)R[Tj = right truncated component Li] Tj = left tail of weighted component Pip(z/Ci), i, i = 1, 2 R~[Tj = right tail of weighted component Pip(z/Ci), i, i = 1, 2. The mixture density, the components, and the tails are illustrated in Fig. 2. The parameter estimation is done iteratively, and each iteration consists of three estimation steps. For a given threshold Tj, the para- meters ]21 and ]22 are estimated initially using the truncated components p(Z)L[ Tj and p(Z)R[ T~. The next step involves estimating the weighted compo- nents P1 P(z/C1) and P2 P (z/C2) and the proportions P~ and P2. Finally, the threshold is estimated from the parameters of the estimated mixture components p(z/C1)l Tj and p(z/C2)[Tj. In the formulation of the parameter-estimation algorithm, it is assumed that the contribution of tail L21Tj is negligible in tail L1 [ Tj and that tail R1 [ Tj is negligible in tail R2I Tj. Then, using the symmetrical property of the Gaussian function, the tails R~[Tj and L2[ Tj can be estimated from the tails of the mixture p(z). The detailed steps of the estimation algorithm are described next. Step 0: Selection of initial threshold To. Let the initial estimate of the optimum threshold To be ]20, where ]20 is the mean of p(z). Step 1: Estimation of

                    本文档为【Gaussian Mixture based image segmentation algorithm】，请使用软件OFFICE或WPS软件打开。作品中的文字与图均可以修改和编辑，
                    图片更改请在作品中右键图片并更换，文字修改请直接点击文字进行修改，也可以新增和删除文档中的内容。 
 该文档来自用户分享，如有侵权行为请发邮件ishare@vip.sina.com联系网站客服，我们会及时删除。

                    [版权声明] 本站所有资料为用户分享产生，若发现您的权利被侵害，请联系客服邮件isharekefu@iask.cn，我们尽快处理。

                    本作品所展示的图片、画像、字体、音乐的版权可能需版权方额外授权，请谨慎使用。

                    网站提供的党政主题相关内容(国旗、国徽、党徽..)目的在于配合国家政策宣传，仅限个人学习分享使用，禁止用于任何广告和商用目的。
                

下载需要：免费已有0 人下载

立即下载

Gaussian Mixture based image segmentation algorithm

你可能还喜欢