The image segmentation problem is concerned with partitioning an image into multiple regions according to some homogeneity criterion. This article is primarily concerned with graph theoretic approaches to image segmentation.

File:Aenep11.png
Figure 1: Input image.
File:Aenep14.png
Figure 4: Third partition.
File:Aenep16.png
Figure 6: Fifth partition.
File:Aenep17.png
Figure 7: Sixth partition.
File:Aenep23.png
Figure 10: Segmentation results.

Applications of Image Segmentation

edit
  • Image Compression
    • Segment the image into homogeneous components, and use the most suitable compression algorithm for each component to improve compression.
  • Medical Diagnosis
    • Automatic segmentation of MRI images for identification of cancerous regions.
  • Mapping and Measurement
    • Automatic analysis of remote sensing data from satellites to identify and measure regions of interest.

Segmentation using Normalized Cuts

edit

Graph theoretic formulation

edit

The set of points in an arbitrary feature space can be represented as a weighted undirected complete graph G = (V, E), where the nodes of the graph are the points in the feature space. The weight   of an edge   is a function of the similarity between the nodes   and  . In this context, we can formulate the image segmentation problem as a graph partitioning problem that asks for a partition   of the vertex set  , where, according to some measure, the vertices in any set   have high similarity, and the vertices in two different sets   have low similarity.

Normalized Cuts

edit

Let G = (V, E) be a weighted graph. Let   and   be two subsets of vertices.

Let:

  
  
  

In the normalized cuts approach[1], for any cut   in  ,   measures the similarity between different parts, and   measures the total similarity of vertices in the same part.


Since  , a cut   that minimizes   also maximizes  .

Computing a cut   that minimizes   is an NP-hard problem. However, we can find in polynomial time a cut   of small normalized weight   using spectral techniques.

The Ncut Algorithm

edit

Let D be an   diagonal matrix with   on the diagonal, and let   be an   symmetrical matrix with  .

After some algebraic manipulations, we get:

  

subject to the constraints:

  •  , for some constant  
  •  

Minimizing   subject to the constraints above is NP-hard. To make the problem tractable, we relax the constraints on  , and allow it to take real values. The relaxed problem can be solved by solving the generalized eigenvalue problem  for the second smallest generalized eigenvector.

The partitioning algoritm:

  1. Given a set of features, set up a weighted graph  , compute the weight of each edge, and summarize the information in   and  .
  2. Solve   for eigenvectors with the smallest eigenvalues.
  3. Use the eigenvector with the smallest eigenvalue to bipartition the graph.
  4. Decide if the current partition should be subdivided.
  5. Recursively partition the segmented parts, if necessary.

Example

edit

Figures 1-7 exemplify the Ncut algorithm.

Limitations

edit

Solving a standard eigenvalue problem for all eigenvectors (using the QR algorithm, for instance) takes   time. This is impractical for image segmentation applications where   is the number of pixels in the image.


OBJ CUT

edit
 This section is under major construction.


OBJ CUT[2] is an efficient method that automatically segments an object. The OBJ CUT method is a generic method, and therefore it is applicable to any object category model. Given an image D containing an instance of a known object category, e.g. cows, the OBJ CUT algorithm computes a segmentation of the object, that is, it infers a set of labels m.

Let m be a set of binary labels, and let   be a shape parameter(  is a shape prior on the labels from a Layered Pictorial Structure (LPS) model). We define an energy function   as follows.

    (1)


The term   is called an unary term, and the term   is called a pairwise term. An unary term consists of the likelihood   based on color, and the unary potential   based on the distance from  . A pairwise term consists of a prior   and a contrast term  .

The best labeling   minimizes  , where   is the weight of the parameter  .

    (2)

The OBJ CUT algorithm

edit
  1. Given an image D, an object category is chosen, e.g. cows or horses.
  2. The corresponding LPS model is matched to D to obtain the samples  
  3. The objective function given by equation (2) is determined by computing   and using  
  4. The objective function is minimized using a single MINCUT operation to obtain the segmentation m.


Example

edit

Figures 8-11 exemplify the OBJ CUT algorithm.

Other approaches

edit
  • Jigsaw approach[3]
  • Image parsing [4]
  • Interleaved segmentation [5]
  • LOCUS [6]
  • LayoutCRF [7]

References

edit
  1. ^ Jianbo Shi and Jitendra Malik (1997): "Normalized Cuts and Image Segmentation", IEEE Conference on Computer Vision and Pattern Recognition, pp 731-737
  2. ^ M. P. Kumar, P. H. S. Torr, and A. Zisserman. Obj cut. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, San Diego, pages 18-25, 2005.
  3. ^ E. Borenstein, S. Ullman: Class-specic, top-down segmentation. In Proceedings of the 7th European Conference on Computer Vision, Copenhagen, Denmark, pages 109-124, 2002.
  4. ^ Z. Tu, X. Chen, A. L. Yuille, S. C. Zhu: Image Parsing: Unifying Segmentation, Detection, and Recognition. Toward Category-Level Object Recognition 2006: 545-576
  5. ^ B. Leibe, A. Leonardis, B. Schiele: An Implicit Shape Model for Combined Object Categorization and Segmentation. Toward Category-Level Object Recognition 2006: 508-524
  6. ^ J. Winn, N. Joijic. Locus: Learning object classes with unsupervised segmentation. In Proceedings of the IEEE International Conference on Computer Vision, Beijing, 2005.
  7. ^ J. M. Winn, J. Shotton: The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects. CVPR (1) 2006: 37-44