Back to the Gradient Vector Flow Page

List of Citations from Science Citation Index for

T. F. Cootes, C. J. Taylor, D. H. Cooper, et al., " Active Shape Models - Their Training and Application," Computer Vision and Image Understanding, 61(1): 38-59, January 1995.

1995: 1  1996: 10  1997: 32  1998: 19  1999: 23  2000: 34  2001: 21  2002: 1  

  Total citations: 141

As of 28 Jan 2002

By Year - By Citations - By Year with Abstract

 
1995

  1. SOZOU, PD, COOTES, TF, TAYLOR, CJ, and DIMAURO, EC, "NONLINEAR GENERALIZATION OF POINT DISTRIBUTION MODELS USING POLYNOMIAL REGRESSION," IMAGE AND VISION COMPUTING, vol. 13, pp. 451-457, 1995.

    Abstract:   We have previously described how to model shape variability by means of point distribution models (PDM) in which there is a linear relationship between a set of shape parameters and the positions of points on the shape. This linear formulation can fail for shapes which articulate or bend. We show examples of such failure for both real and synthetic classes of shape. A new, more general formulation for PDMs, based on polynomial regression, is presented. The resulting polynomial regression PDMs (PRPDM) perform well on the data for which the linear method failed.

 
1996

  1. Denzler, J, and Niemann, H, "3D data driven prediction for active contour models based on geometric bounding volumes," PATTERN RECOGNITION LETTERS, vol. 17, pp. 1171-1178, 1996.

    Abstract:   Active contour models have proven to be a promising approach for data driven object tracking without knowledge about the problem domain and the object. Problems arise in case of fast moving objects and in natural scenes with heterogeneous background. In these cases, a prediction step is an essential part of the tracking mechanism. In this paper we describe a new approach for modelling the contour of moving objects in the 3D world. The key point is the description of moving objects by a simplified geometric model, the sc-called bounding volume. The contour of moving objects is predicted by estimating the movement and the shape of the bounding volume in the 3D world and by projecting its contour to the image plane. Stochastic optimization algorithms are used to estimate shape parameters of the bounding volume. The 2D contour of the bounding volume is used to initialize the active contour, which then extracts the contour of the moving object. Thus, an update of the motion and model parameters of the bounding volume is possible. No task specific knowledge and no a priori knowledge about the object is necessary. We will show that in the case of convex polyhedral bounding volumes, this method can be applied to real-time closed-loop object tracking on standard Unix workstations. Furthermore, we present experiments which prove that the robustness for tracking moving objects in front of a heterogeneous background can be improved.

  2. Girard, S, Dinten, JM, and Chalmond, B, "Building and training radiographic models for flexible object identification from incomplete data," IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, vol. 143, pp. 257-264, 1996.

    Abstract:   The authors address the problem of identifying the projection of an object from incomplete data extracted from its radiographic image. They assume that the unknown object is a particular sample of a flexible object. Their approach consists first in designing a deformation model able to represent and to simulate a great variety of samples of the flexible object radiographic projection. This modellisation is achieved using a training set of complete data. Then, given the incomplete data, the identification task consists in estimating the observed object using the deformation model. The proposed modelling extracts from the training set, not only the deformation modes, but also other relevant information (such as probability distributions on the deformations, relations between deformations) to use it to regularise the identification step.

  3. Abrantes, AJ, and Marques, JS, "Class of constrained clustering algorithms for object boundary extraction," IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 5, pp. 1507-1521, 1996.

    Abstract:   Boundary extraction is a key task in many image analysis operations. This paper describes a class of constrained clustering algorithms for object boundary extraction that includes several well-known algorithms proposed in different fields (deformable models, constrained clustering, data ordering, and traveling salesman problems), The algorithms belonging to this class are obtained by the minimization of a cost function with two terms: a quadratic regularization term and an image-dependent term defined by a set of weighting functions, The minimization of the cost function is achieved by lowpass filtering the previous model shape and by attracting the model units toward the centroids of their attraction regions, To define a new algorithm belonging to this class, the user has to specify a regularization matrix and a set of weighting functions that control the attraction of the model units toward the data, The usefulness of this approach is twofold: It provides a unified framework for many existing algorithms in pattern recognition and deformable models, and allows the design of new recursive schemes.

  4. Nastar, C, and Ayache, N, "Frequency-based nonrigid motion analysis: Application to four dimensional medical images," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 18, pp. 1067-1079, 1996.

    Abstract:   We present a method for nonrigid motion analysis in time sequences of volume images (4D data). In this method, nonrigid motion of the deforming object contour is dynamically approximated by a physically-based deformable surface. In order to reduce the number of parameters describing the deformation, we make use of a modal analysis which provides a spatial smoothing of the surface. The deformation spectrum, which outlines the main excited modes, can be efficiently used for deformation comparison. Fourier analysis on time signals of the main deformation spectrum components provides a temporal smoothing of the data. Thus a complex nonrigid deformation is described by only a few parameters: the main excited modes and the main Fourier harmonics. Therefore, 4D data can be analyzed in a very concise manner. The power and robustness of the approach is illustrated by various results on medical data. We believe that our method has important applications in automatic diagnosis of heart diseases and in motion compression.

  5. Chakraborty, A, Staib, LH, and Duncan, JS, "Deformable boundary finding in medical images by integrating gradient and region information," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 15, pp. 859-870, 1996.

    Abstract:   Accurately segmenting and quantifying structures is a key issue in biomedical image analysis. The two conventional methods of image segmentation, region-based segmentation, and boundary finding, often suffer from a variety of limitations. Here we propose a method which endeavors to integrate the two approaches in an effort to form a unified approach that is robust to noise and poor initialization. Our approach uses Green's theorem to derive the boundary of a homogeneous region- classified area in the image and integrates this with a gray level gradient-based boundary finder. This combines the perceptual notions of edge/shape information with gray level homogeneity. A number of experiments were performed both on synthetic and real medical images of the brain and heart to evaluate the new approach, and it is shown that the integrated method typically performs better when compared to conventional gradient-based deformable boundary finding. Further, this method yields these improvements with little increase in computational overhead, an advantage derived from the application of the Green's theorem.

  6. Mitiche, A, and Bouthemy, P, "Computation and analysis of image motion: A synopsis of current problems and methods," INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 19, pp. 29-55, 1996.

    Abstract:   The goal of this paper is to offer a structured synopsis of the problems in image motion computation and analysis, and of the methods proposed, exposing the underlying models and supporting assumptions. A sufficient number of pointers to the literature will be given, concentrating mostly on recent contributions. Emphasis will be on the detection, measurement and segmentation of image motion. Tracking, and deformable motion isssues will be also addressed. Finally, a number of related questions which could require more investigations will be presented.

  7. Subsol, G, Thirion, JP, and Ayache, N, "Application of an automatically built 3D morphometric brain atlas: Study of cerebral ventricle shape," VISUALIZATION IN BIOMEDICAL COMPUTING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1131, pp. 373-382, 1996.

    Abstract:   In this paper we present new results on the automatic building of a 3D morphometric brain atlas from volumetric MRI images and its application to the study of the shape of cerebral structures. In particular, we show how it is possible to define ''abnormal'' deformations of the cerebral ventricles with a small set of parameters.

  8. Hill, A, Cootes, TF, and Taylor, CJ, "Active shape models and the shape approximation problem," IMAGE AND VISION COMPUTING, vol. 14, pp. 601-607, 1996.

    Abstract:   Active Shape Models (ASM) use an iterative algorithm to match statistically defined models of known but variable objects to instances in images. Each iteration of ASM search involves two steps: image data interrogation and shape approximation. Here we consider the shape approximation step in detail. We present a new method of shape approximation which uses directional constraints. We show how the error term for the shape approximation problem can be extended to cope with directional constraints, and present iterative solutions to the 2D and 3D problems. We also present an efficient algorithm for the 2D problem in which a modification of the error term permits a closed-form approximate solution which can be used to produce starting estimates for the iterative solution.

  9. Cootes, TF, DiMauro, EC, Taylor, CJ, and Lanitis, A, "Flexible 3D models from uncalibrated cameras," IMAGE AND VISION COMPUTING, vol. 14, pp. 581-587, 1996.

    Abstract:   We describe how to build statistically-based flexible models of the 3D structure of variable objects, given a training set of uncalibrated images. We assume that for each example object there are two labelled images taken from different viewpoints. From each image pair a 3D structure can be reconstructed, up to either an affine or projective transformation, depending on which camera model is used. The reconstructions are aligned by choosing the transformations which minimise the distances between matched points across the training set. A statistical analysis results in an estimate of the mean structure of the training examples and a compact parameterised model of the variability in shape across the training set. Experiments have been performed using pinhole and affine camera models. Results are presented for both synthetic data and real images.

  10. Christensen, GE, Rabbitt, RD, and Miller, MI, "Deformable templates using large deformation kinematics," IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 5, pp. 1435-1447, 1996.

    Abstract:   A general automatic approach is presented for accommodating local shape variation when mapping a two-dimensional (2-D) or three-dimensional (3-D) template image into alignment with a topologically similar target image, Local shape variability is accommodated by applying a vector-field transformation to the underlying material coordinate system of the template while constraining the transformation to be smooth (globally positive definite Jacobian), Smoothness is guaranteed without specifically penalizing large-magnitude deformations of small subvolumes by constraining the transformation on the basis of a Stokesian limit of the fluid-dynamical Navier-Stokes equations, This differs fundamentally from quadratic penalty methods, such as those based on linearized elasticity or thin-plate splines, in that stress restraining the motion relaxes over time allowing large-magnitude deformations, Kinematic nonlinearities are inherently necessary to maintain continuity of structures during large-magnitude deformations, and are included in all results, After initial global registration, final mappings are obtained by numerically solving a set of nonlinear partial differential equations associated with the constrained optimization problem, Automatic regridding is performed by propagating templates as the nonlinear transformations evaluated on a finite lattice become singular, Application of the method to intersubject registration of neuroanatomical structures illustrates the ability to account for local anatomical variability.

 
1997

  1. Luettin, J, and Thacker, NA, "Speechreading using probabilistic models," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 65, pp. 163-178, 1997.

    Abstract:   We describe a robust method for locating and tracking lips in gray-level image sequences. Our approach learns patterns of shape variability from a training set which constrains the model during image search to only deform in ways similar to the training examples, Image search is guided by a learned gray- level model which is used to describe the large appearance variability of lips, Such variability might be due to different individuals, illumination, mouth opening, specularity, or visibility of teeth and tongue, Visual speech features are recovered from the tracking results and represent both shape and intensity information, We describe a speechreading (lip- reading) system, where the extracted features are modeled by Gaussian distributions and their temporal dependencies by hidden Markov models. Experimental results are presented for locating lips, tracking lips, and speechreading. The database used consists of a broad variety of speakers and was recorded in a natural environment with no special lighting or lip markers used, For a speaker independent digit recognition task using visual information only, the system achieved an accuracy about equivalent to that of untrained humans. (C) 1997 Academic Press.

  2. Solloway, S, Hutchinson, CE, Waterton, JC, and Taylor, CJ, "The use of active shape models for making thickness measurements of articular cartilage from MR images," MAGNETIC RESONANCE IN MEDICINE, vol. 37, pp. 943-952, 1997.

    Abstract:   Previously reported studies to quantify articular cartilage have used labor-intensive manual or semi-automatic data-driven techniques, demonstrating high accuracy and precision. However, none has been able to automate the segmentation process. This paper describes a fast, automatic, model-based approach to segmentation and thickness measurement of the femoral cartilage in 3D T-1-weighted images using active shape models (ASMs). Systematic experiments were performed to assess the accuracy and precision of the technique with in vivo images of both normal and abnormal knees. Segmentation accuracy was determined by comparing the results of the segmentation with the boundaries delineated by a radiologist, The mean error in locating the boundary was 0.57 pixels. To assess the precision of the measurement technique, the mean thickness of the femoral cartilage was calculated for repeated scans of five healthy volunteers. A mean coefficient of variation (CV) of 2.8% was obtained for the thickness measurements.

  3. Redhead, AL, Kotcheff, ACW, Taylor, CJ, Porter, ML, and Hukins, DWL, "An automated method for assessing routine radiographs of patients with total hip replacements," PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H- JOURNAL OF ENGINEERING IN MEDICINE, vol. 211, pp. 145-154, 1997.

    Abstract:   This paper describes a new, fully automated method of locating objects on radiographs of patients with total joint replacements (TJRs). A statistical computer model, known as an active shape model, was trained to identify the position of the femur, pelvis, stem and cup marker wire on radiographs of patients with Charnley total hip prostheses. Once trained, the model was able to locate these objects through a process of automatic image searching, despite their appearance depending on the orientation and anatomy of the patient. Experiments were carried out to test the accuracy with which the model was able to fit to previously unseen data and with which reference points could be calculated from the model points. The model was able to locate the femur and stem with a mean error of approximately 0.8 mm and a 95 per cent confidence limit of 1.7 mn. Once the model had successfully located these objects, the midpoint of the stem head could be calculated with a mean error of approximately 0.2 mm. Although the model has been trained on Charnley total hip replacements, the method is generic and so can be applied to radiographs of patients with any TJR. This paper shows that computer models can form the basis of a quick, automatic method of taking measurements from standard clinical radiographs.

  4. Delibasis, K, Undrill, PE, and Cameron, GG, "Designing Fourier descriptor-based geometric models for object interpretation in medical images using genetic algorithms," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 66, pp. 286-300, 1997.

    Abstract:   In previous work we have modeled simple 3D anatomical objects using deformed superquadrics and established their optimal position with the aid of genetic algorithms (GAs). Here we extend the complexity of the search object using 3D Fourier descriptor (FD) representations and allow GAs once again to optimize the object's shape and position. Using magnetic resonance image data, we perform an approximate segmentation on one lateral ventricle in the brain and use the FDs from this as seeding values for the GAs to search for the left and right lateral ventricles in seven 3D data sets. We show that the method is capable of coping with normal biological variation. Finally, we compare FD/GA-guided segmentation with a manually guided interactive region growing method and find an agreement of 78 +/- 10% in voxel classification with a corresponding average edge placement error of 2.2 +/- 0.4 mm. (C) 1997 Academic Press.

  5. Smyth, PP, Taylor, CJ, and Adams, JE, "Automatic measurement of vertebral shape using Active Shape Models," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 441-446, 1997.

    Abstract:   In this paper, we describe how Active Shape Models (ASMs) have been used to accurately and robustly locate vertebrae in noisy lateral Dual Energy X-ray Absorptiometry (DXA) images of the spine. Vertebrae were located using either separate models for each vertebra, or a combined model of the whole spine. The combined model was found to be more robust. We show that ASMs allow normal vertebrae to be located as accurately as by human operators. We measure the performance of ASMs in locating fractured vertebrae of osteoporotic patients. We also describe how model parameters may be used to estimate how accurately a vertebra had been located, in order to detect vertebrae for which search had failed.

  6. Joshi, SC, Banerjee, A, Christensen, GE, Csernansky, JG, Haller, JW, Miller, MI, and Wang, L, "Gaussian random fields on sub-manifolds for characterizing brain surfaces," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 381-386, 1997.

    Abstract:   This paper provides analytical methods for characterizing the variation of the shape of neuroanatomically significant substructures of the brain in an ensemble of brain images. The focus of this paper is on the neuro-anatomical variation of the "shape" of 2-dimensional surfaces in the brain. Brain surfaces are studied by building templates that are smooth sub-manifolds of the underlying coordinate system of the brain. Variation of the shape in populations is quantified via defining Gaussian random vector fields on these sub-manifolds Methods for the empirical construction of Gaussian random vector fields for representing the variations of the substructures are presented. As an example, using these methods we characterize the shape of the hippocampus in a population of normal controls and schizophrenic brains. Results from a recently completed study comparing shapes of the hippocampus in a group of matched schizophrenic and normal control subjects are presented. Bayesian hypothesis test is formulated to cluster the normal and schizophrenic hippocampi in the population of 20 individuals.

  7. Duta, N, and Sonka, M, "Segmentation and interpretation of MR brain images using an improved knowledge-based active shape model," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 375-380, 1997.

    Abstract:   An improvement of the Active Shape procedure introduced by Cootes and Taylor is presented. The new automated brain segmentation and interpretation approach incorporates a priori knowledge about neuroanatomic structures and their specific structural relationships to provide robust segmentation and labeling. The method was trained in 8 MR brain images and tested in 19 brain images by comparison to observer-defined independent standards. Neuroanatomic structures in all images from the test set were successfully identified. The presented method is applicable to virtually any task involving deformable shape analysis.

  8. Ahmad, T, Taylor, CJ, Lanitis, A, and Cootes, TF, "Tracking and recognising hand gestures, using statistical shape models," IMAGE AND VISION COMPUTING, vol. 15, pp. 345-352, 1997.

    Abstract:   Hand gesture recognition from video images is of considerable interest as a means of providing simple and intuitive man- machine interfaces. Possible applications range from replacing the mouse as a pointing device to virtual reality and communication with the deaf. We describe an approach to tracking a hand in an image sequence and recognising, in each video frame. which of five gestures it has adopted. A statistically based Point Distribution Model (PDM) is used to provide a compact parametrised description of the shape of the hand for any of the gestures or the transitions between them. The values of the resulting shape parameters are used in a statistical classifier to identify gestures. The model can be used as a deformable template to track a hand through a video sequence but this proves unreliable. We describe how a set of models, one for each of the five gestures, can be used for tracking with the appropriate model selected automatically. We show that this results in reliable tracking and gesture recognition for two 'unseen' video sequences in which all the gestures are used.

  9. Sozou, PD, Cootes, TF, Taylor, CJ, DiMauro, EC, and Lanitis, A, "Non-linear point distribution modelling using a multi-layer perceptron," IMAGE AND VISION COMPUTING, vol. 15, pp. 457-463, 1997.

    Abstract:   Objects of the same class sometimes exhibit variation in shape. This shape variation has previously been modelled by means of point distribution models (PDMs) in which there is a linear relationship between a set of shape parameters and the positions of points on the shape. A polynomial regression generalization of PDMs, which succeeds in capturing certain forms of non-linear shape variability, has also been described. Here we present a new form of PDM, which uses a multi-layer perceptron to carry out non-linear principal component analysis. We compare the performance of the new model with that of the existing models on two classes of variable shape: one exhibits bending, and the other exhibits complete rotation. The linear PDM fails on both classes of shape; the polynomial regression model succeeds for the first class of shapes but fails for the second; the new multi-layer perceptron model performs well for both classes of shape. The new model is the most general formulation for PDMs which has been proposed to date. (C) 1997 Elsevier Science B.V.

  10. Smyth, PP, Taylor, CJ, and Adams, JE, "Automatic measurement of vertebral shape using active shape models," IMAGE AND VISION COMPUTING, vol. 15, pp. 575-581, 1997.

    Abstract:   In this paper, we describe how Active Shape Models (ASMs) have been used to accurately and robustly locate vertebrae in lateral Dual Energy X-ray Absorptiometry (DXA) images of the spine. DXA images are of low spatial resolution, and contain significant random and structural noise, providing a difficult challenge for object location methods. All vertebrae in the image were searched for simultaneously, improving robustness in location of individual vertebrae by making use of constraints on shape provided by the position of other vertebrae. We show that the use of ASMs with minimal user interaction allows accuracy to be obtained which is as good as that achievable by human operators, as well as high precision. Having located each vertebra, it is desirable to evaluate whether it has been located sufficiently accurately for shape measurements to be useful. We determined this on the basis of grey-level model fit, which was shown to usefully detect poorly located vertebrae, which should enable accuracy to be improved by rejecting proposed search solutions whose grey-level fit was poorer than a threshold. (C) 1997 Elsevier Science B.V.

  11. Kotcheff, ACW, and Taylor, CJ, "Automatic construction of eigenshape models by genetic algorithm," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 1-14, 1997.

    Abstract:   A new approach to the problem of automatic construction of eigenshape models is presented. Eigenshape models have proved to be successful in a variety of medical image analysis problems. However, automatic model construction is a difficult problem, and in many applications the models are built by hand - a painstaking process. Previous attempts to produce models automatically have been applicable only in specific cases or under certain assumptions. We show that the problem can be understood very simply in terms of shape symmetries. The pose and parameterisation of each shape must be chosen so as to produce a model that is compact and specific. Mie define an objective function that measures these properties. The problem of automatic model construction is thus reduced to an optimisation problem. We show that the objective function we define can be optimised by a Genetic Algorithm, and produces models that are better than hand built ones.

  12. Rueckert, D, and Burger, P, "Geometrically deformable templates for shape-based segmentation and tracking in cardiac MR images," ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1223, pp. 83-98, 1997.

    Abstract:   We present a new approach to shape-based segmentation and tracking of multiple, deformable anatomical structures in cardiac MR images. We propose to use an energy-minimizing geometrically deformable template (GDT) which can deform into similar shapes under the influence of image forces. The degree of deformation of the template from its equilibrium shape is measured by a penalty function associated with mapping between the two shapes. In 2D, this term corresponds to the bending energy of an idealized thin-plate of metal. By minimizing this term along with the image energy terms of the classic deformable model, the deformable template is attracted towards objects in the image whose shape is similar to its equilibrium shape. This framework allows for the simultaneous segmentation of multiple deformable objects using intra- as well as inter- shape information. The energy minimization problem of the deformable template is formulated in a Bayesian framework and solved using relaxation techniques: Simulated Annealing (SA), a stochastic relaxation technique is used for segmentation while Iterated Conditional Modes (ICM), a deterministic relaxation technique is used for tracking. We present results of the algorithm applied to the reconstruction of the left and right ventricle of the human heart in 4D MR images.

  13. Liu, TL, and Geiger, D, "Visual deconstruction: Recognizing articulated objects," ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1223, pp. 295-309, 1997.

    Abstract:   We propose a deconstruction framework to recognize and find articulated objects. In particular we axe interested in human arm and leg articulations. The deconstruction view of recognition naturally decomposes the problem of finding an object in an image, into the one of (i) extracting key features in an image, (ii) detecting key points in the models, (iii) segmenting an image, and (iv) comparing shapes. All of these subproblems can not be resolved independently. Together, they reconstruct the object in the image. We briefly address (i) and (ii) to focus on solving together shape similarity and segmentation, combining top-down & bottom-up algorithms. We show that the visual deconstruction approach is derived as an optimization for a Bayesian-Information theory, and that the whole process is naturally generated by the guaranteed Dijkstra optimization algorithm.

  14. Lanitis, A, Taylor, CJ, and Cootes, TF, "Automatic interpretation and coding of face images using flexible models," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 743-756, 1997.

    Abstract:   Face images are difficult to interpret because they are highly variable. Sources of variability include individual appearance, 3D pose, facial expression, and lighting. We describe a compact parametrized model of facial appearance which takes into account all these sources of variability. The model represents both shape and gray-level appearance, and is created by performing a statistical analysis over a training set of face images. A robust multiresolution search algorithm is used to fit the model to faces in new images. This allows the main facial features to be located, and a set of shape, and gray- level appearance parameters to be recovered. A good approximation to a given face can be reconstructed using less than 100 of these parameters. This representation can be used for tasks such as image coding, person identification, 3D pose recovery, gender recognition, and expression recognition. Experimental results are presented for a database of 690 face images obtained under widely varying conditions of 3D pose, lighting, and facial expression. The system performs well on ail the tasks listed above.

  15. Vetter, T, and Poggio, T, "Linear object classes and image synthesis from a single example image," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 733-742, 1997.

    Abstract:   The need to generate new views of a 3D object from a single real image arises in several fields, including graphics and object recognition. While the traditional approach relies on the use of 3D models, we have recently introduced [1], [2], [3] simpler techniques that are applicable under restricted conditions. The approach exploits image transformations that are specific to the relevant object class, and learnable from example Views of other ''prototypical'' objects df the same class. In this paper, we introduce such a technique by extending the notion of linear class proposed by Poggio and Vetter. For linear object classes, it is shown that linear transformations can be learned exactly from a basis set of 2D prototypical views. We demonstrate the approach on artificial objects and then show preliminary evidence that the technique can effectively ''rotate'' high-resolution face images from a single 2D view.

  16. Pavlovic, VI, Sharma, R, and Huang, TS, "Visual interpretation of hand gestures for human-computer interaction: A review," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 677-695, 1997.

    Abstract:   The use of hand gestures provides an attractive alternative to cumbersome interface devices for human-computer interaction (HCl). In particular, Visual interpretation of hand gestures can help in achieving the ease and naturalness desired for HCl. This has motivated a very active research area concerned with computer vision-based analysis and interpretation of hand gestures. We survey the literature on visual interpretation of hand gestures in the context of its role in HCl. This discussion is organized on the basis of the method used for modeling, analyzing, and recognizing gestures. Important differences in the gesture interpretation approaches arise depending on whether a 3D model of the human hand or an image appearance model of the human hand is used. 3D hand models offer a way of more elaborate modeling of hand gestures but lead to computational hurdles that have not been overcome given the real-time requirements of HCl. Appearance-based models lead to computationally efficient ''purposive'' approaches that work well under constrained situations but seem to lack the generality desirable for HCl. We also discuss implemented gestural systems as well as other potential applications of vision-based gesture recognition. Although the current progress is encouraging, further theoretical as well as computational advances are needed before gestures can be widely used for HCl. We discuss directions of future research in gesture recognition, including its integration with other natural modes of human-computer interaction.

  17. Yow, KC, and Cipolla, R, "Feature-based human face detection," IMAGE AND VISION COMPUTING, vol. 15, pp. 713-735, 1997.

    Abstract:   Human face detection has always been an important problem for face, expression and gesture recognition. Though numerous attempts have been made to detect and localize faces, these approaches have made assumptions that restrict their extension to more general cases. We identify that the key factor in a generic and robust system is that of using a large amount of image evidence, related and reinforced by model knowledge through a probabilistic framework. In this paper, we propose a feature-based algorithm for detecting faces that is sufficiently generic and is also easily extensible to cope with more demanding variations of the imaging conditions. The algorithm detects feature points from the image using spatial filters and groups them into face candidates using geometric and gray level constraints. A probabilistic framework is then used to reinforce probabilities and to evaluate the likelihood of the candidate as a face. We provide results to support the validity of the approach and demonstrate its capability to detect faces under different scale, orientation and viewpoint. (C) 1997 Elsevier Science B.V.

  18. Vetter, T, and Troje, NF, "Separation of texture and shape in images of faces for image coding and synthesis," JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, vol. 14, pp. 2152-2161, 1997.

    Abstract:   Human faces differ in shape and texture. Image representations based on this separation of shape and texture information have been reported by several authors [for a review, see Science 272, 1905 (1996)]. We investigate such a representation of human faces based on a separation of texture and two- dimensional shape information. Texture and shape were separated by use of pixel-by-pixel correspondence among the various images, which was established through algorithms known from optical flow computation. We demonstrate the improvement of the proposed representation over well-established pixel-based techniques in terms of coding efficiency and in terms of the ability to generalize to new images of faces. The evaluation is performed by calculating different distance measures between the original image and its reconstruction and by measuring the time that human subjects need to discriminate them. (C) 1997 Optical Society of America.

  19. Taylor, CJ, Cootes, TF, Lanitis, A, Edwards, G, Smyth, P, and Kotcheff, ACW, "Model-based interpretation of complex and variable images," PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, vol. 352, pp. 1267-1274, 1997.

    Abstract:   The ultimate goal of machine vision is image understanding-the ability not only to recover image structure but also to know what it represents. By definition, this involves the use of models which describe and label the expected structure of the world. Over the past decade, model-based vision has been applied successfully to images of man-made objects. It has proved much more difficult to develop model-based approaches to the interpretation of images of complex and variable structures such as faces or the internal organs of the human body (as visualized in medical images). In such cases it has been problematic even to recover image structure reliably without a model to organize the often noisy and incomplete image evidence. The key problem is that of variability. To be useful, a model needs to be specific-that is, to be capable of representing only 'legal' examples of the modelled object(s). It has proved difficult to achieve this whilst allowing for natural variability. Recent developments have overcome this problem; it has been shown that specific patterns of variability in shape and grey-level appearance can be captured by statistical models that can be used directly in image interpretation. The details of the approach are outlined and practical examples from medical image interpretation and face recognition are used to illustrate how previously intractable problems can now be tackled successfully. It is also interesting to ask whether these results provide any possible insights into natural vision; for example, we show that the apparent changes in shape which result from viewing three- dimensional objects from different viewpoints can be modelled quite well in two dimensions; this may lend some support to the 'characteristic views' model of natural vision.

  20. Subsol, G, Roberts, N, Doran, M, Thirion, JP, and Whitehouse, GH, "Automatic analysis of cerebral atrophy," MAGNETIC RESONANCE IMAGING, vol. 15, pp. 917-927, 1997.

    Abstract:   3D MR data obtained for 10 healthy control subjects have been used to build a brain atlas, The atlas is built in four stages, First, a set of features that are unambiguously definable and anatomically relevant need to be computed for each item in the database, The chosen features are crest lines along which the maximal principal curvature of the surface of the brain is maximal in its associated principal direction, Second, a nonrigid registration algorithm is used to determine the common crest lines among the subjects in the database, These crest lines form the structure of the atlas. Third, a set of crest lines is taken as a reference set and a modal analysis is performed to determine the fundamental deformations that are necessary to bring the individual data in line with the reference set, The deformations are averaged and the set of mean crest lines becomes the atlas, Finally, the standard deviation of the deformations between the atlas and the items in the database defines the normal variation in the relative positions of the crest lines in a healthy population, In a fully automatic procedure, the crest lines on the surface of the brain adjacent to the cerebral ventricles in a patient with primary progressive aphasia were compared to the atlas; confirmation that the brain of this patient demonstrates atrophy was provided by stereological analysis that showed that the volume of the left cerebral hemisphere is 48.8 ml (CE 2.8%) less than the volume of the right cerebral hemisphere in the region of the temporal and frontal lobes, When the amplitude of the deformations necessary to register the crest lines obtained for the patient with the atlas were greater than three standard deviations beyond the variability inherent in the atlas, the deformation was considered significant, Four of the five main deformation modes of the longest crest line of the surface of the brain adjacent to the cerebral ventricles were significantly different in the patient with primary progressive aphasia compared to the atlas, The ventricles are preferentially enlarged in the left cerebral hemisphere, Furthermore, they are closer together posteriorly and further apart anteriorly than in the atlas, These observations may be indicative of the atrophy of the temporal and frontal lobes of the left cerebral hemisphere noted in the patient, Ultimately, the approach may provide a useful screening technique for identifying brain diseases involving cerebral atrophy, Serial studies of individual patients may provide insights into the processes controlling or affected by particular diseases. (C) 1997 Elsevier Science Inc.

  21. Rueckert, D, and Burger, P, "Shape-based segmentation and tracking in 4D cardiac MR images," CVRMED-MRCAS'97, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1205, pp. 43-52, 1997.

    Abstract:   We present a new approach to shape-based segmentation and tracking of multiple, deformable anatomical structures in cardiac MR images. We propose to use an energy-minimizing geometrically deformable template (GDT) which can deform into similar shapes under the influence of image forces. The degree of deformation of the template from its equilibrium Shape is measured by a penalty function associated with mapping between the two shapes. In 2D, this term corresponds to the bending energy of an idealized thin-plate of metal. By minimizing this term along with the image energy terms of the classic deformable model, the deformable template is attracted towards objects in the image whose shape is similar to its equilibrium shape. This framework allows the simultaneous segmentation of multiple deformable objects using intra-as well as inter-shape information. The energy minimization problem of the deformable template is formulated in a Bayesian framework and solved using relaxation techniques: Simulated Annealing (SA), a stochastic relaxation technique is used for segmentation while Iterated Conditional Modes (ICM), a deterministic relaxation technique is used for tracking. We present results of the algorithm applied to the reconstruction of the left and right ventricle of the human heart in 4D MR images.

  22. Montagnat, J, and Delingette, H, "Volumetric medical images segmentation using shape constrained deformable models," CVRMED-MRCAS'97, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1205, pp. 13-22, 1997.

    Abstract:   In this paper we address the problem of extracting geometric models from lour contrast volumetric images, given a template or reference shape of that model. We proceed by deforming a reference model in a volumetric image. This reference deformable model is represented as a simplex mesh submitted to regularizing shape constraint. Furthermore, we introduce an original approach that combines the deformable model framework with the elastic registration (based on iterative closest point algorithm) method. This new method increases the robustness of segmentation while allowing very complex deformation, of the original template. Examples of segmentation of the liver and brain ventricles are provided.

  23. Grzeszczuk, RP, and Levin, DN, "''Brownian strings'': Segmenting images with stochastically deformable contours," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 1100-1114, 1997.

    Abstract:   This paper describes an image segmentation technique in which an arbitrarily shaped contour was deformed stochastically until it fitted around an object of interest. The evolution of the contour was controlled by a simulated annealing process which caused the contour to settle into the global minimum of an image-derived ''energy'' function. The nonparametric energy function was derived from the statistical properties of previously segmented images, thereby incorporating prior experience. Since the method was based on a state space search for the contour with the best global properties, it was stable in the presence of image errors which confound segmentation techniques based on local criteria, such as connectivity. Unlike ''snakes'' and other active contour approaches, the new method could handle arbitrarily irregular contours in which each interpixel crack represented an independent degree of freedom. Furthermore, since the contour evolved toward the global minimum of the energy, the method was more suitable for fully automatic applications than the snake algorithm, which frequently has to be reinitialized when the contour becomes trapped in local energy minima. High computational complexity was avoided by efficiently introducing a random local perturbation in a time independent of contour length, providing control over the size of the perturbation, and assuring that resulting shape changes were unbiased. The method was illustrated by using it to find the brain surface in magnetic resonance head images and to track blood vessels in angiograms.

  24. OToole, AJ, Vetter, T, Volz, H, and Salter, EM, "Three-dimensional caricatures of human heads: distinctiveness and the perception of facial age," PERCEPTION, vol. 26, pp. 719-732, 1997.

    Abstract:   A standard facial-caricaturing algorithm was applied to a three-dimensional representation of human heads. This algorithm sometimes produced heads that appeared 'caricatured'. More commonly, however, exaggerating the distinctive three- dimensional information in a face seemed to produce an increase in the apparent age of the face-both at a local level, by exaggerating small facial creases into wrinkles, and at a more global level via changes that seemed to make the underlying structure of the skull more evident. Concomitantly, de-emphasis of the distinctive three-dimensional information in a face made it appear relatively younger than the veridical and caricatured faces. More formally, face-age judgments made by human observers were ordered according to the level of caricature, with anticaricatures judged younger than veridical faces, and veridical faces judged younger than caricatured faces. These results are discussed in terms of the importance of the nature of the features made more distinct by a caricaturing algorithm and the nature of human representation(s) of faces.

  25. Hozumi, T, Yoshida, K, Yoshioka, H, Yagi, T, Akasaka, T, Takagi, T, Nishiura, M, Watanabe, M, and Yoshikawa, J, "Echocardiographic estimation of left ventricular cavity area with a newly developed automated contour tracking method," JOURNAL OF THE AMERICAN SOCIETY OF ECHOCARDIOGRAPHY, vol. 10, pp. 822-829, 1997.

    Abstract:   Development of an automated contour tracking method provides detection and tracking of the endocardial boundary using the energy minimization method without tracing a region of interest. The purpose of this study was to compare the automated contour tracking method and manually drawn methods for the measurement of left ventricular cavity areas and fractional area change. Apical four-chamber view was visualized and recorded for off-line analysis in 11 patients by means of two-dimensional echocardiography. The automated contour tracking method automatically traces the endocardial border from the recorded images and calculates left ventricular cavity areas (end-diastole and end-systole) and fractional area change. In the same images selected as end-diastole and end- systole in the automated contour tracking method, left ventricular endocardial border was manually traced to calculate left ventricular cavity areas and fractional area change. Both methods were compared by Linear regression analysis for the measurement of cavity areas and fractional area change. Left ventricular areas measured by the automated contour tracking method showed an excellent correlation with those by the manual method (end-diastole: r = 0.99, y = 0.83x + 2.6, standard error of the estimate = 1.5 cm(2); end-systole: r = 0.99, y = 0.96x - 0.8, standard error of the estimate = 1.2 cm(2)). The mean differences between the automated contour tracking and manual methods were -3.1 +/- 5.1 cm(2) and -1.6 +/- 2.4 cm(2) at end- diastole and end-systole, respectively. Fractional area change determined by the automated contour tracking method correlated well with that by the manual method (r = 0.95, y = 1.17x - 6.5, standard error of the estimate = 3.4%). The mean difference between the automated contour tracking and manual methods was - 0.8% +/- 7.1%. In conclusion, a newly developed automated contour tracking method correlates highly with the manual method for the estimation of left ventricular cavity areas and fractional area change in high-quality images. This suggests that this new technique may be useful in the automated quantitation of left ventricular function in patients with high-quality images with no dropout and no intercavity artifact or structure.

  26. Huang, CL, Chang, WT, Wu, LC, and Wang, JK, "Three-dimensional PET emission scan registration and transmission scan synthesis," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 16, pp. 542-561, 1997.

    Abstract:   The duration of a positron emission tomography (PET) imaging scan can be reduced if the transmission scan of one patient which is used for emission correction can be synthesized by using the reference transmission scan of another patient, In this paper, we propose a new intersubjects PET emission scan registration method and PET transmission synthesis method by using the boundary information of the body or brain scan of the PET emission scans. The PET emission scans have poor image quality and different intensity statistics so that we preprocess the emission scans to have similar histogram and then apply the point distribution model (PDM) [15] to extract the contours of the emission scan, The extracted boundary contour of every slice is used to reconstruct the three- dimensional (3-D) surface of the reference set and the target set, Our registration is 3-D surface-based which uses the normal flow method [17] to find the correspondence vector field between two 3-D reconstructed surfaces, Since it is difficult to analyze internal organ using the PET emission scan imaging without correction, we assume that the deformation of internal organ is homogeneous, With the corresponding vector field between the two emission scans and the transmission scan of the reference set, we can synthesize the transmission scan of the target set.

  27. Huang, CL, and Huang, YM, "Facial expression recognition using model-based feature extraction and action parameters classification," JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, vol. 8, pp. 278-290, 1997.

    Abstract:   This paper introduces an automatic facial expression recognition system which consists of two parts: facial feature extraction and facial expression recognition. The system applies the point distribution model and the gray-level model to find the facial features. Then the position variations of certain designated points on the facial feature are described by 10 action parameters (APs). There are two phases in the recognition process: the training phase and the recognition phase. In the training phase, given 90 different expressions, the system classifies the principal components of the APs of all training expressions into six different clusters. In the recognition phase, given a facial image sequence, it identifies the facial expressions by extracting the 10 APs, analyzes the principal components, and finally calculates the AP profile correlation for a higher recognition rate. In the experiments, our system has demonstrated that it can recognize the facial expression effectively. (C) 1997 Academic Press.

  28. Choi, KN, Cross, ADJ, and Hancock, ER, "Localising facial features with matched filters," AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1206, pp. 11-20, 1997.

    Abstract:   This paper describes a study of facial feature recognition using matched filter techniques. The basic aim is to develop a set of filters that can be used to characterise each of eight different facial features. These are left and right eyes, left and right-eyebrows, hairline, nose, mouth and chin. The matched filters are extracted from training images using inverse Fourier analysis. We provide an experimental evaluation of the method on the University of Berne face data-base.. Here we explore the most effective choice of training data so that the filters can be effectively applied when the facial pose varies. We also evaluate the effectiveness of the method when facial occlusion due to spectacles is present.

  29. Bowden, R, Mitchell, TA, and Sarhadi, M, "Cluster based nonlinear principle component analysis," ELECTRONICS LETTERS, vol. 33, pp. 1858-1859, 1997.

    Abstract:   In the field of computer vision, principle component analysis (PCA) is often used to provide statistical models of shape, deformation or appearance. This simple statistical model provides a constrained. compact approach to model based vision. However, as larger problems are considered. high dimensionality and nonlinearity make linear PCA an unsuitable and unreliable approach. A nonlinear PCA (NLPCA) technique is proposed which uses cluster analysis and dimensional reduction to provide a fast. robust solution. Simulation results on both 2D contour models and greyscale images are presented.

  30. Joshi, SC, Miller, MI, and Grenander, U, "On the geometry and shape of brain sub-manifolds," INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, vol. 11, pp. 1317-1343, 1997.

    Abstract:   This paper develops mathematical representations for neuro- anatomically significant substructures of the brain and their variability in a population. The focus of the paper is an the neuro-anatomical variation of the geometry and the "shape" of two-dimensional surfaces in the brain. As examples, we focus on the cortical and hippocampal surfaces in an ensemble of Macaque monkeys and human MRI brains. The "shapes" of the substructures are quantified via the construction of templates; the variations are represented by defining probabilistic deformations of the template. Methods for empirically estimating probability measures on these deformations are developed by representing the deformations as Gaussian random vector fields on the embedded sub-manifolds. The Gaussian random vector fields are constructed as quadratic mean limits using complete orthonormal bases on the sub-manifolds. The complete orthonormal bases are generated using modes of vibrations of the geometries of the brain sub-manifolds. The covariances are empirically estimated from an ensemble of brain data. Principal component analysis is presented for characterizing the "eigen-shape" of the hippocampus in an ensemble of MRI-MPRAGE whole brain images. Clustering based on eigen-shape is presented for two sub-populations of normal and schizophrenic.

  31. Christensen, GE, Joshi, SC, and Miller, MI, "Volumetric transformation of brain anatomy," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 16, pp. 864-877, 1997.

    Abstract:   This paper presents diffeomorphic transformations of three- dimensional (3-D) anatomical image data of the macaque occipital lobe and whole brain cryosection imagery and of deep brain structures in human brains as imaged via magnetic resonance imagery, These transformations are generated in a hierarchical manner, accommodating both global and local anatomical detail, The initial low-dimensional registration is accomplished by constraining the transformation to be in a low- dimensional basis, The basis is defined by the Green's function of the elasticity operator placed at predefined locations in the anatomy and the eigenfunctions of the elasticity operator, The high-dimensional large deformations are vector fields generated via the mismatch between the template and target- image volumes constrained to be the solution of a Navier-Stokes fluid model. As part of this procedure, the Jacobian of the transformation is tracked, insuring the generation of diffeomorphisms. It is shown that transformations constrained by quadratic regularization methods such as the Laplacian, biharmonic, and linear elasticity models, do not ensure that the transformation maintains topology and, therefore, must only be used for coarse global registration.

  32. Hill, A, Brett, AD, and Taylor, CJ, "Automatic landmark identification using a new method of non- rigid correspondence," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 483-488, 1997.

    Abstract:   A method for corresponding the boundaries of two shapes is presented. The algorithm locates a matching pair of sparse polygonal approximations, one for each of a pair of boundaries, by minimising a cost function using a greedy algorithm. The cost function expresses the dissimilarity in both the shape and representation error (with respect to the defining boundary) of the sparse polygons. Results are presented for three classes of shape which exhibit various types of non-rigid deformation. The algorithm is also applied to an automatic landmark identification task for the construction of statistical shape models.

 
1998

  1. Schuhmann, D, Seemann, M, Schoepf, UJ, Haubner, M, Krapichler, C, Gebicke, K, Reiser, M, and Englmeier, KH, "Computerized diagnostic data analysis and 3-D visualization," RADIOLOGE, vol. 38, pp. 799-809, 1998.

    Abstract:   Purpose:To survey methods for 3D data visualization and image analysis which can be used for computer based diagnostics. Material and methods: The methods available are explained in short terms and links to the literature are presented. Methods which allow basic manipulation of 3D data are windowing, rotation and clipping. More complex methods for visualization of 3D data are multiplanar reformation, volume projections (MIP,semi-transparent projections) and surface projections. Methods for image analysis comprise local data transformation (e.g. filtering) and definition and application of complex models (e.g. deformable models). Results: Volume projections produce an impression of the 3D data set without reducing the data amount. This supports the interpretation of the 3D data set and saves time in comparison to any investigation which requires examination of all slice images. More advanced techniques for visualization, e.g. surface projections and hybrid rendering visualize anatomical information to a very detailed extent, but both techniques require the segmentation of the structures of interest. Image analysis methods can be used to extract these structures(e.g. an organ)from the image data. Discussion:At the present time volume projections are robust and fast enough to be used routinely. Surface projections can be used to visualize complex and presegmented anatomical features.

  2. Hagenlocker, M, and Fujimura, K, "CFFD: a tool for designing flexible shapes," VISUAL COMPUTER, vol. 14, pp. 271-287, 1998.

    Abstract:   This paper describes a solid deformation method, composed free- form deformation (CFFD), which applies a sequence of uniform periodic B-spline FFDs over 3D Euclidean space. The construction of the individual FFDs, which are defined by unbounded control lattices, is described, concentrating on a method by which feature point trajectories are employed to control lattice point displacements. Problems due to mutual influence of feature point trajectories are discussed, and frozen points, which inhibit deformation in their proximity, are introduced. Also, methods for constructing the FFD lattices are proposed which control mutual influence of feature point trajectories. The paper addresses computational issues of constructing and applying CFFDs, and discusses the application of CFFD to 3D design and animation.

  3. Montagnat, J, and Delingette, H, "Globally constrained deformable models for 3D object reconstruction," SIGNAL PROCESSING, vol. 71, pp. 173-186, 1998.

    Abstract:   To achieve geometric reconstruction from 3D datasets two complementary approaches have been widely used. On one hand, the deformable model framework locally applies forces to fit the data. On the other hand, the non-rigid registration framework computes a global transformation minimizing the distance between a template and the data. We first show that applying a global transformation on a surface template, is equivalent to applying certain global forces on a deformable model. Second, we propose a scheme which combines the registration and free-form deformation. This globally constrained deformation scheme allows us to control the amount of deformation from the reference shape with a single parameter. Finally, we propose a general algorithm for performing model-based reconstruction in a robust and accurate manner. Examples on both range data and medical images are used to illustrate and validate the globally constrained deformation framework. (C) 1998 Elsevier Science B.V. All rights reserved.

  4. Jain, AK, Zhong, Y, and Dubuisson-Jolly, MP, "Deformable template models: A review," SIGNAL PROCESSING, vol. 71, pp. 109-129, 1998.

    Abstract:   In this paper, we review the recently published work on deformable models. We have chosen to concentrate on 2D deformable models and relate the energy minimization approaches to the Bayesian formulations. We categorize the various active contour systems according to the definition of the deformable model. We also present in detail one particular formulation for deformable templates which combines edge, texture, color and region information for the external energy and model deformations using wavelets, splines or Fourier descriptors. We explain how these models can be used for segmentation, image retrieval in a large database and object tracking in a video sequence. (C) 1998 Elsevier Science B.V. All rights reserved.

  5. Fujimura, K, and Makarov, M, "Foldover-free image warping," GRAPHICAL MODELS AND IMAGE PROCESSING, vol. 60, pp. 100-111, 1998.

    Abstract:   An image warping method is presented that deforms an image continuously without foldover, while observing a given set of trajectories of feature elements. Any intermediate image during the morph is homeomorphic to the initial image and the morphing process is a homotopy. The method permits points, line- segments, and polygons to be included as features in the image. Our method is based on time-varying triangulation, that is, triangulation changes as features move. Accordingly, the deformation mapping is updated locally for the part for which the triangulation changes. Experimental results are included to demonstrate the feasibility of our approach and the complexity of the algorithm is analyzed. (C) 1998 Academic Press.

  6. Ip, HHS, and Shen, DG, "An affine-invariant active contour model (AI-snake) for model- based segmentation," IMAGE AND VISION COMPUTING, vol. 16, pp. 135-146, 1998.

    Abstract:   In this paper, we show that existing shaped-based active contour models are not affine-invariant and we addressed the problem by presenting an affine-invariant snake model (AI- snake) such that its energy function are defined in terms local and global affine-invariant features. The main characteristic of the AI-snake is that, during the process of object extraction, the pose of the model contour is dynamically adjusted such that it is in alignment with the current snake contour by solving the snake-prototype correspondence problem and determining the required affine transformation. In addition, we formulate the correspondence matching between the snake and the object prototype as an error minimization process between two feature vectors which capture both local and global deformation information. We show that the technique is robust against object deformations and complex scenes. (C) 1998 Elsevier Science B.V.

  7. Edwards, GJ, Lanitis, A, Taylor, CJ, and Cootes, TF, "Statistical models of face images - Improving specificity," IMAGE AND VISION COMPUTING, vol. 16, pp. 203-211, 1998.

    Abstract:   Model-based approaches to the interpretation of face images have proved very successful. We have previously described statistically based models of face shape and grey-level appearance and shown bow they can be used to perform various coding and interpretation tasks. In the paper we describe improved methods of modelling which couple shape and grey-level information more directly than our existing methods, isolate the changes in appearance due to different sources of variability (person, expression, pose, lighting) and deal with non-linear shape variation. We show that the new methods are better suited to interpretation and tracking tasks. (C) 1998 Elsevier Science B.V.

  8. Glasbey, CA, and Mardia, KV, "A review of image-warping methods," JOURNAL OF APPLIED STATISTICS, vol. 25, pp. 155-171, 1998.

    Abstract:   Image warping is a transformation which maps all positions in one image plane to positions in a second plane. It arises in many image analysis problems, whether in order to remove optical distortions introduced by a camera or a particular viewing perspective, to register art image with a map or template, or to align two or more images. The choice of warp is a compromise between a smooth distortion and one which achieves a good match. Smoothness can be ensured by assuming a parametric form for the warp or by constraining it using differential equations. Matching can be specified by points to be brought into alignment, by local measures of correlation between images, or by the coincidence of edges. Parametric and non-parametric approaches to warping, and matching criteria, are reviewed.

  9. Garrido, A, and De la Blanca, NP, "Physically-based active shape models: Initialization and optimization," PATTERN RECOGNITION, vol. 31, pp. 1003-1017, 1998.

    Abstract:   In this paper we describe a new approach for 2-D object segmentation using an automatic method applied on images with problems as partial information, overlapping objects, many objects in a single scene, severe noise conditions and locating objects with a very high degree of deformation. We use a physically-based shape model to obtain a deformable template, which is defined on a canonical orthogonal coordinate system. The proposed methodology works starting from the output of an edge detector, which is processed to automatically obtain an approximation of the shape. The final estimation of the shapes is obtained fitting a deformable template model, which is defined on a learned surface of deformation. Results from biological images are presented. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

  10. Ivins, JP, and Porrill, J, "A deformable model of the human iris for measuring small three- dimensional eye movements," MACHINE VISION AND APPLICATIONS, vol. 11, pp. 42-51, 1998.

    Abstract:   This paper describes a deformable model of the human iris which forms part of a system for accurate offline measurement of binocular three-dimensional eye movements, particularly cyclotorsion (torsion), from video image sequences. At least two existing systems measure torsion from infrared video images by pupil tracking followed by cross correlation using arcs of bandpass-filtered iris texture. Unfortunately, pupil expansion and contraction reduces the accuracy of this method unless drugs are used to constrict the pupil, which causes temporary blurred vision. A five-parameter deformable model of the iris is therefore developed for analysing images obtained without the use of drugs. This model can translate (horizontal and vertical eye motion), rotate (torsion) and scale both uniformly and radially (pupil changes). Torsion measurements obtained with the model are repeatable and accurate to within 0.1 degrees; this performance in illustrated by analysing binocular torsion during fixation on a stationary target.

  11. Vetter, T, "Synthesis of novel views from a single face image," INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 28, pp. 103-116, 1998.

    Abstract:   Images formed by a human face change with viewpoint. A new technique is described for synthesizing images of faces from new viewpoints, when only a single 2D image is available. A novel 2D image of a face can be computed without explicitly computing the 3D structure of the head. The technique draws on a single generic 3D model of a human head and on prior knowledge of faces based on example images of other faces seen in different poses. The example images are used to "learn" a pose-invariant shape and texture description of a new face. The 3D model is used to solve the correspondence problem between images showing faces in different poses. The proposed method is interesting for view independent face recognition tasks as well as for image synthesis problems in areas like teleconferencing and virtualized reality.

  12. Gu, C, and Lee, MC, "Semiautomatic segmentation and tracking of semantic video objects," IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol. 8, pp. 572-584, 1998.

    Abstract:   This paper introduces a novel semantic video object extraction system using mathematical morphology and a perspective motion model. Inspired by the results from the study of the human visual system, we intend to solve the semantic video object extraction problem in two separate steps: supervised I-frame segmentation, and unsupervised P-frame tracking. First, the precise semantic video object boundary can be found using a combination of human assistance and a morphological segmentation tool. Second, the semantic video objects in the remaining frames are obtained using global perspective motion estimation and compensation of the previous semantic video object plus boundary refinement as used for I frames.

  13. Lepsoy, S, and Curinga, S, "Conversion of articulatory parameters into active shape model coefficients for lip motion representation and synthesis," SIGNAL PROCESSING-IMAGE COMMUNICATION, vol. 13, pp. 209-225, 1998.

    Abstract:   Speech-driven facial animation combines techniques from different disciplines such as image analysis, computer graphics, and speech analysis. Active shape models (ASM) used in image analysis are excellent tools for characterizing lip contour shapes and approximating their motion in image sequences. By controlling the coefficients for an ASM, such a model can also be used for animation. We design a mapping of the articulatory parameters used in phonetics into ASM coefficients that control nonrigid lip motion. The mapping is designed to minimize the approximation error when articulatory parameters measured on training lip contours are taken as input to synthesize the training lip movements. Since articulatory parameters can also be estimated from speech, the proposed technique can form an important component of a speech-driven facial animation system. (C) 1998 Elsevier Science B.V. All rights reserved.

  14. Ivins, J, and Porrill, J, "Constrained active region models for fast tracking in color image sequences," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 72, pp. 54-71, 1998.

    Abstract:   Image segmentation is a fundamental problem in computer vision, for which deformable models offer a partial solution. Most deformable models work by performing some kind of edge detection; complementary region growing methods have not often been used. As a result, deformable models that track regions rather than edges have yet to be developed to a great extent. Active region models are a relatively new type of deformable model driven by a region energy that is a function of the statistical characteristics of an image. This paper describes the use of constrained active region models for frame-rate tracking in color video images on widely available computer hardware. Two of the many color representations now in use are reviewed for this purpose: the intensity-based RGB space and the more intuitive HSV space. Normalized RGB, which is essentially a measure of hue and saturation, emerges as the preferred representation because it is invariant to illumination changes and can be obtained from many frame- grabbers via a simple fast software transformation. Three types of motion are examined for constraining deformable models: rigid models can only translate and rotate to fit image features; conformal models can also change size; affine models exhibit two kinds of shearing in addition to the other components. Two methods are described for producing affine motion, given the desired unconstrained motion calculated by searching for local energy minima lying perpendicular to the model boundary. An existing method, based on iterative gradient descent, computes translating, rotating, scaling, and shearing forces which can be combined to produce affine and other types of motion. A faster, more accurate method uses least-squares minimization to approximate the desired motion; with this method it is also possible to derive specific equations for rigid and conformal motion and to correct for the aperture problem associated with the perpendicular search method. The advantages of the new least-squares method are illustrated by using it to drive an active region model via an affine transformation which tracks the movements of a robot arm at frame rate in color video images, (C) 1998 Academic Press.

  15. Cross, ADJ, and Hancock, ER, "Graph matching with a dual-step EM algorithm," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 20, pp. 1236-1253, 1998.

    Abstract:   This paper describes a new approach to matching geometric structure in 2D point-sets. The novel feature is to unify the tasks of estimating transformation geometry and identifying point-correspondence matches. Unification is realized by constructing a mixture model over the bipartite graph representing the correspondence match and by affecting optimization using the EM algorithm. According to our EM framework, the probabilities of structural correspondence gate contributions to the expected likelihood function used to estimate maximum likelihood transformation parameters. These gating probabilities measure the consistency of the matched neighborhoods in the graphs. The recovery of transformational geometry and hard correspondence matches are interleaved and are realized by applying coupled update operations to the expected log-likelihood function. In this way, the two processes bootstrap one another. This provides a means of rejecting structural outliers. We evaluate the technique on two real-world problems. The first involves the matching of different perspective views of 3.5-inch floppy discs. The second example is furnished by the matching of a digital map against aerial images that are subject to severe barrel distortion due to a line-scan sampling process. We complement these experiments with a sensitivity study based on synthetic data.

  16. Wang, YM, and Staib, LH, "Elastic model based non-rigid registration incorporating statistical shape information," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI'98, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1496, pp. 1162-1173, 1998.

    Abstract:   This paper describes a new method of non-rigid registration using the combined power of elastic and statistical shape models. The transformations are constrained to be consistent with a physical model of elasticity to maintain smoothness and continuity. A Bayesian formulation, based on this model, on an intensity similarity measure, and on statistical shape information embedded in corresponding boundary points, is employed to find a more accurate and robust nan-rigid registration. A dense set of forces arises from the intensity similarity measure to accommodate complex anatomical details. A sparse set of forces constrains consistency with statistical shape models derived from a training set. A number of experiments were performed on both synthetic and real medical images of the brain and heart to evaluate the approach. It is shown that statistical boundary shape information significantly augments and improves elastic model based non-rigid registration.

  17. Fleute, M, and Lavallee, S, "Building a complete surface model from sparse data using statistical shape models: Application to computer assisted knee surgery," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI'98, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1496, pp. 879-887, 1998.

    Abstract:   This paper addresses the problem of extrapolating very few range data to obtain a complete surface representation of an antomical structure. A new method that uses statistical shape models is proposed and its application to modeling a few points manually digitized on the femoral surface is detailed, in order to improve visualization of a system developped by TIMC laboratory for computer assisted anterior cruciate ligament (ACL) reconstruction. The model is built from a population of 11 femur specimen digitized manually. Data sets are registered together using an elastic registration method of Szeliski and Lavallee based on octree-splines. Principal Components Analysis (PCA) is performed on a field of surface deformation vectors. Fitting this statistical model to a few points is performed by non-linear optimisation. Results are presented for both simulated and real data. The method is very flexible and can be applied to any structures for which the shape is stable.

  18. Duta, N, and Sonka, M, "Segmentation and interpretation of MR brain images: An improved active shape model," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 17, pp. 1049-1062, 1998.

    Abstract:   This paper reports a novel method for fully automated segmentation that is based on description of shape and its variation using point distribution models (PDM's). An improvement of the active shape procedure introduced by Cootes and Taylor to find new examples of previously learned shapes using PDM's is presented, The new method for segmentation and interpretation of deep neuroanatomic structures such as thalamus, putamen, ventricular system, etc. incorporates a priori knowledge about shapes of the neuroanatomic structures to provide their robust segmentation and labeling in magnetic resonance (MR) brain images. The method was trained in eight MR brain images and tested in 19 brain images by comparison to observer-defined independent standards. Neuroanatomic structures in all testing images were successfully identified. Computer-identified and observer-defined neuroanatomic structures agreed well, The average labeling error was 7% +/- 3%, Border positioning errors were quite small, with the average border positioning error of 0.8 +/- 0.1 pixels in 256 x 256 MR images, The presented method was specifically developed for segmentation of neuroanatomic structures in MR brain images. However, it is generally applicable to virtually any task involving deformable shape analysis.

  19. Grenander, U, and Miller, MI, "Computational anatomy: An emerging discipline," QUARTERLY OF APPLIED MATHEMATICS, vol. 56, pp. 617-694, 1998.

    Abstract:   This paper studies mathematical methods in the emerging new discipline of Computational Anatomy. Herein we formalize the Brown/Washington University model of anatomy following the global pattern theory introduced in [1, 2], in which anatomies are represented as deformable templates, collections of 0, 1, 2, 3-dimensional manifolds. Typical structure is carried by the template with the variabilities accommodated via the application of random transformations to the background manifolds. The anatomical model is a quadruple (Omega, H, I, P), the background space Omega = boolean ORalpha M-alpha of 0, 1, 2, 3-dimensional manifolds, the set of diffeomorphic transformations on the background space H : Omega <-> Omega, the space of idealized medical imagery I, and P the family of probability measures on H. The group of diffeomorphic transformations H is chosen to be rich enough so that a large family of shapes may be generated with the topologies of the template maintained. For normal anatomy one deformable template is studied, with (Omega, H, I) corresponding to a homogeneous space [3], in that it can be completely generated from one of its elements, I = HItemp,I-temp is an element of I. For disease, a family of templates boolean ORalphaItempalpha are introduced of perhaps varying dimensional transformation classes. The complete anatomy is a collection of homogeneous spaces I-total = boolean ORalpha(I-alpha,H-alpha). There are three principal components to computational anatomy studied herein. (1) Computation of large deformation maps: Given any two elements I, I' is an element of I in the same homogeneous anatomy (Omega, H, I), compute diffeomorphisms h from one anatomy to the other I (h-1)reversible arrow(h) I'. This is the principal method by which anatomical structures are understood, transferring the emphasis from the images I is an element of I to the structural transformations h is an element of H that generate them. (2) Computation of empirical probability laws: Given populations of anatomical imagery and diffeomorphisms between them I h(n-1)reversible arrow(hn) I-n, n = 1, . . . , N, generate probability laws P is an element of P on H that represent the anatomical variation reflected by the observed population of diffeomorphisms h(n), n = 1,..., N. (3) Inference and disease testing: Within the anatomy (Omega, H, I, P), perform Bayesian classification and testing for disease and anomaly.

 
1999

  1. Luo, B, and Hancock, ER, "Procrustes alignment with the EM algorithm," COMPUTER ANALYSIS OF IMAGES AND PATTERNS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1689, pp. 623-631, 1999.

    Abstract:   This paper casts the problem of point-set alignment via Procrustes analysis into a maximum likelihood framework using the EM algorithm. The aim is to improve the robustness of the Procrustes alignment to noise and clutter. By constructing a Gaussian mixture model over the missing correspondences between individual points, we show how alignment can be realised by applying singular value decomposition to a weighted point correlation matrix. Moreover, by gauging the relational consistency of the assigned correspondence matches, we can edit the point sets to remove clutter. We illustrate the effectiveness of the method matching stereogram. We also provide a sensitivity analysis to demonstrate the operational advantages of the method.

  2. Chui, H, Rambo, J, Duncan, J, Schultz, R, and Rangarajan, A, "Registration of cortical anatomical structures via robust 3D point matching," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 168-181, 1999.

    Abstract:   Inter-subject non-rigid registration of cortical anatomical structures as seen in MR is a challenging problem. The variability of the sulcal and gyral patterns across patients makes the task of registration especially difficult regardless of whether voxel- or feature-based techniques are used. In this paper, we present an approach to matching sulcal point features interactively extracted by neuroanatomical experts. The robust point matching (RPM) algorithm is used to find the optimal affine transformations for matching sulcal points. A 3D linearly interpolated non-rigid warping is then generated for the original image volume. We present quantitative and visual comparisons between Talairach, mutual information-based volumetric matching and RPM on five subjects' MR images.

  3. Cootes, TF, Beeston, C, Edwards, GJ, and Taylor, CJ, "A unified framework for atlas matching using Active Appearance Models," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 322-333, 1999.

    Abstract:   We propose to use statistical models of shape and texture as deformable anatomical atlases. By training on sets of labelled examples these can represent both the mean structure and appearance of anatomy in medical images, and the allowable modes of deformation. Given enough training examples such a model should be able synthesise any image of normal anatomy. By finding the parameters which minimise the difference between the synthesised model image and the target image we can locate all the modelled structure. This potentially time consuming step can be solved rapidly using the Active Appearance Model (AAM). In this paper we describe the models and the AAM algorithm and demonstrate the approach on structures in MR brain cross-sections.

  4. Brett, AD, and Taylor, CJ, "A framework for automated landmark generation for automated 3D statistical model construction," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 376-381, 1999.

    Abstract:   We describe a method of pairwise 3D surface correspondence for the automated generation of landmarks on a set of examples from a class of shape. We show how the pairwise corresponder can be used in an extension of an existing framework for establishing dense correspondences between a set of training examples to build a 3D statistical model. The framework relies upon additional algorithms for the production of surface paths between vertices on a polyhedral mesh, and these are described. An example statistical model is shown for the left lateral ventricle of the brain.

  5. Velasco, HMG, Aligue, FJL, Orellana, CJG, Macias, MM, and Sotoca, MIA, "Application of ANN techniques to automated identification of bovine livestock," ENGINEERING APPLICATIONS OF BIO-INSPIRED ARTIFICIAL NEURAL NETWORKS, VOL II, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1607, pp. 422-431, 1999.

    Abstract:   In this work a classification system is presented that, taking lateral images of cattle as inputs, is able to identify the animals and classify them by breed into previously learnt classes. The system consists of two fundamental parts. In the first one, a deformable-model-based preprocessing of the image is made, in which the contour of the animal in the photograph is sought, extracted, and normalized. Next, a neural classifier is presented that, supplemented with a decision-maker at its output, makes the distribution into classes. In the last part, the results obtained in a real application of this methodology are presented.

  6. Germond, L, Dojat, M, Taylor, C, and Garbay, C, "A multi-agent system for MRI brain segmentation," ARTIFICIAL INTELLIGENCE IN MEDICINE, LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, vol. 1620, pp. 423-432, 1999.

    Abstract:   In this paper we present an original approach for the segmentation of MRI brain images which is based on a cooperation between low-level and high-level approches. MRI brain images are very difficult to segment mainly due to the presence of inhomogeneities within tissues and also due to the high anatomical variability of the brain topology between individuals. In order to tackle these difficulties, we have developped a method whose characteristics are : (i) the use of a priori knowledge essentially anatomical and model-based; (ii) a multi-agent system (MAS) for low-level region segmentation; (iii) a cooperation between a priori knowledge and low-level segmentation to guide and constrain the segmentation processes. These characteristics allow to produce an automatic detection of the main tissues of the brain. The method is validated with phantoms and real images through comparisons with another widely used approach (SPM).

  7. Hutton, TJ, Hammond, P, and Davenport, JC, "Active shape models for customised prosthesis design," ARTIFICIAL INTELLIGENCE IN MEDICINE, LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, vol. 1620, pp. 448-452, 1999.

    Abstract:   Images and computer graphics play an increasingly important role in the design and manufacture of medical prostheses and implants. Images provide guidance on optimal design in terms of location, preparation and the overall shape and configuration of subcomponents. Direct manipulation of a graphical representation provides a natural design environment. RaPiD is a CAD-like knowledge-based assistant for designing a dental prosthesis known as a removable partial denture (RPD). The expertise embedded in RaPiD encourages optimal subcomponent configuration, but currently supports only minor customisation. This paper describes how oral images and Active Shape Models (ASMs) are being used to address this limitation.

  8. Kelemen, A, Szekely, G, and Gerig, G, "Elastic model-based segmentation of 3-D neuroradiological data sets," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 828-839, 1999.

    Abstract:   This paper presents a new technique for the automatic model- based segmentation of three-dimensional (3-D) objects from volumetric image data. The development closely follows the seminal work of Taylor and Cootes on active shape models, but is based on a hierarchical parametric object description rather than a point distribution model, The segmentation system includes both the building of statistical models and the automatic segmentation of new image data sets via a restricted elastic deformation of shape models, Geometric models are derived from a sample set of image data which have been segmented by experts, The surfaces of these binary objects are converted into parametric surface representations, which are normalized to get an invariant object-centered coordinate system, Surface representations are expanded into series of spherical harmonics which provide parametric descriptions of object shapes. It is shown that invariant object surface parametrization provides a good approximation to automatically determine object homology in terms of sets of corresponding sets of surface points. Gray-level information near object boundaries is represented by 1-D intensity profiles normal to the surface. Considering automatic segmentation of brain structures as our driving application, our choice of coordinates for object alignment was the well-accepted stereotactic coordinate system. Major variation of object shapes around the mean shape, also referred to as shape eigenmodes, are calculated in shape parameter space rather than the feature space of point coordinates, Segmentation makes use of the object shape statistics by restricting possible elastic deformations into the range of the training shapes, The mean shapes are initialized in a new data set by specifying the landmarks of the stereotactic coordinate system, The model elastically deforms, driven by the displacement forces across the object's surface, which are generated by matching local intensity profiles. Elastical deformations are limited by setting bounds for the maximum variations in eigenmode space. The technique has been applied to automatically segment left and right hippocampus, thalamus, putamen, and globus pallidus from volumetric magnetic resonance scans taken from schizophrenia studies. The results have been validated by comparison of automatic segmentation with the results obtained by interactive expert segmentation.

  9. Lotjonen, J, Magnin, IE, Nenonen, J, and Katila, T, "Reconstruction of 3-D geometry using 2-D profiles and a geometric prior model," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 992-1002, 1999.

    Abstract:   A method has been developed to reconstruct three-dimensional (3-D) surfaces from two-dimensional (2-D) projection data. It is used to produce individualized boundary element models, consisting of thorax and lung surfaces, for electro- and magnetocardiographic inverse problems. Two orthogonal projections are utilized, A geometrical prior model, built: using segmented magnetic resonance images, is deformed according to profiles segmented from projection images. In our method, virtual X-ray images of the prior model are first constructed by simulating real X-ray imaging, The 2-D profiles of the model are segmented from the projections and elastically matched with the profiles segmented from patient data. The displacement vectors produced by the elastic 2-D matching are back projected onto the 3-D surface of the prior model. Finally, the model is deformed, using the back-projected vectors. Two different deformation methods are proposed, The accuracy of the method is validated by a simulation, The average reconstruction error of a thorax and lungs was 1.22 voxels, corresponding to about 5 mm.

  10. Chesnaud, C, Refregier, P, and Boulet, V, "Statistical region snake-based segmentation adapted to different physical noise models," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 1145-1157, 1999.

    Abstract:   Algorithms for object segmentation are crucial in many image processing applications. During past years, active contour models (snakes) have been widely used for finding the contours of objects. This segmentation strategy is classically edge- based in the sense that the snake is driven to fit the maximum of an edge map of the scene. In this paper, we propose a region snake approach and we determine fast algorithms for the segmentation of an object in an image. The algorithms developed in a Maximum Likelihood approach are based on the calculation of the statistics of the inner and the outer regions (defined by the snake). It has thus been possible to develop optimal algorithms adapted to the random fields which describe the gray levels in the input image if we assume that their probability density function family are known. We demonstrate that this approach is still efficient when no boundary's edge exists in the image. We also show that one can obtain fast algorithms by transforming the summations over a region, for the calculation of the statistics, into summations along the boundary of the region. Finally, we will provide numerical simulation results for different physical situations in order to illustrate the efficiency of this approach.

  11. Stammberger, T, Eckstein, F, Michaelis, M, Englmeier, KH, and Reiser, M, "Interobserver reproducibility of quantitative cartilage measurements: Comparison of B-spline snakes and manual segmentation," MAGNETIC RESONANCE IMAGING, vol. 17, pp. 1033-1042, 1999.

    Abstract:   The objective of this work was to develop a segmentation technique for thickness measurements of the articular cartilage in MR images and to assess the interobserver reproducibility of the method in comparison with manual segmentation. The algorithm is based on a B-spline snakes approach and is able to delineate the cartilage boundaries in real time and with minimal user interaction. The interobserver reproducibility of the method, ranging from 3.3 to 13.6% for various section orientations and joint surfaces, proved to be significantly superior to manual segmentation. (C) 1999 Elsevier Science Inc.

  12. Craw, I, Costen, N, Kato, T, and Akamatsu, S, "How should we represent faces for automatic recognition?," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 725-736, 1999.

    Abstract:   We describe results obtained from a testbed used to investigate different codings for automatic face recognition. An eigenface coding of shape-free faces using manually located landmarks was more effective than the corresponding coding of correctly shaped faces. Configuration also proved an effective method of recognition, with rankings given to incorrect matches relatively uncorrelated with those from shape-free faces. Both sets of information combine to improve significantly the performance of either system. The addition of a system, which directly correlated the intensity values of shape-free images, also significantly increased recognition, suggesting extra information was still available. The recognition advantage for shape-free faces reflected and depended upon high-quality representation of the natural facial variation via a disjoint ensemble of shape-free faces; if the ensemble was comprised of nonfaces, a shape-free disadvantage was induced. Manipulation within the shape-free coding to emphasize distinctive features of the faces, by caricaturing, allowed further increases in performance; this effect was only noticeable when the independent shape-free and configuration coding was used. Taken together, these results strongly support the suggestion that faces should be considered as lying in a high-dimensional manifold, which is locally linearly approximated by these shapes and textures, possibly with a separate system for local features. Principal Components Analysis is then seen as a convenient tool in this local approximation.

  13. Egmont-Petersen, M, and Arts, T, "Recognition of radiopaque markers in X-ray images using a neural network as nonlinear filter," PATTERN RECOGNITION LETTERS, vol. 20, pp. 521-533, 1999.

    Abstract:   Neural networks are developed to recognise radiopaque markers in biplane cineangiographic video-images with a background composed of different objects. Our connectionist approach is compared theoretically as well as experimentally with linear template matching. Theoretically, neural networks are likely to give better recognition results as they can implement nonlinear discriminants. Experiments confirm that the networks result in better marker recognitions than template matching. (C) 1999 Elsevier Science B.V. All rights reserved.

  14. Cootes, TF, and Taylor, CJ, "A mixture model for representing shape variation," IMAGE AND VISION COMPUTING, vol. 17, pp. 567-573, 1999.

    Abstract:   The shape variation displayed by a class of objects can be represented as probability density function, allowing us to determine plausible and implausible examples of the class. Given a training set of example shapes we can align them into a common co-ordinate frame and use kernel-based density estimation techniques to represent this distribution. Such an estimate is complex and expensive, so we generate a simpler approximation using a mixture of gaussians. We show how to calculate the distribution, and how it can be used in image search to locate examples of the modelled object in new images. (C) 1999 Elsevier Science B.V. All rights reserved.

  15. Brett, AD, Hill, A, and Taylor, CJ, "A method of 3D surface correspondence and interpolation for merging shape examples," IMAGE AND VISION COMPUTING, vol. 17, pp. 635-642, 1999.

    Abstract:   A method for corresponding the triangulated mesh surface representations of two shapes is presented. It comprises a method of polyhedral mesh decimation and a symmetric version of the iterative Closest Point (ICP) algorithm. The method produces a matching pair of sparse polyhedral approximations, one for each shape surface, using a global Euclidean measure of similarity. A method of surface patch parameterisation is presented which uses minimal paths constructed across the surface of a polyhedron. We describe the use of this patch parameterisation in the interpolation of surfaces for the construction of a merged mean shape with a densely triangulated surface. Results are presented for the production of a binary tree of merged biological shapes which may be used as a basis for the automated landmarking of a set of examples. (C) 1999 Elsevier Science B.V. All rights reserved.

  16. Lelieveldt, BPF, van der Geest, RJ, Rezaee, MR, Bosch, JG, and Reiber, JHC, "Anatomical model matching with fuzzy implicit surfaces for segmentation of thoracic volume scans," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 218-230, 1999.

    Abstract:   Many segmentation methods for thoracic volume data require manual input in the form of a seed point, initial contour, volume of interest etc. The aim of the work presented here is to further automate this segmentation initialization step. In this paper an anatomical modeling and matching method is proposed to coarsely segment thoracic volume data into anatomically labeled regions. An anatomical model of the thorax is constructed in two steps: 1) individual organs are modeled with blended fuzzy implicit surfaces and 2) the single organ models are grouped into a tree structure with a solid modeling technique named constructive solid geometry (CSG), The combination of CSG with fuzzy implicit surfaces allows a hierarchical scene description by means of a boundary model, which characterizes the scene volume as a boundary potential function. From this boundary potential, an energy function is defined which is minimal when the model is registered to the tissue-air transitions in thoracic magnetic resonance imaging (MRI) data. This allows automatic registration in three steps: feature detection, initial positioning and energy minimization, The model matching has been validated in phantom simulations and on 15 clinical thoracic volume scans from different subjects. In 13 of these sets the matching method accurately partitioned the image volumes into a set of volumes of interest for the heart, lungs, cardiac ventricles, and thorax outlines. The method is applicable to segmentation of various types of thoracic MR-images, provided that a large part of the thorax is contained in the image volume.

  17. Chalmond, B, and Girard, SC, "Nonlinear modeling of scattered multivariate data and its application to shape change," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 422-432, 1999.

    Abstract:   We are given a set of points in a space of high dimension. For instance, this set may represent many visual appearances of an object, a face, or a hand. We address the problem of approximating this set by a manifold in order to have a compact representation of the object appearance. When the scattering of this set is approximately an ellipsoid, then the problem has a well-known solution given by Principal Components Analysis (PCA). However, in some situations like object displacement learning or face learning, this linear technique may be ill- adapted and nonlinear approximation has to be introduced. The method we propose can be seen as a Non Linear PCA (NLPCA), the main difficulty being that the data are not ordered. We propose an index which favors the choice of axes preserving the closest point neighborhoods. These axes determine an order for visiting all the points when smoothing. Finally, a new criterion, called "generalization error," is introduced to determine the smoothing rate, that is, the knot number for the spline fitting. Experimental results conclude this paper: The method is tested on artificial data and on two data bases used in visual learning.

  18. Cosio, FA, and Davies, BL, "Automated prostate recognition: a key process for clinically effective robotic prostatectomy," MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, vol. 37, pp. 236-243, 1999.

    Abstract:   Clinical trials of PROBOT, a robotic system for prostate surgery, have shown that robotic surgery of soft tissue can be successful. Monitoring of the progress of the resection has shown to be a necessary feature of an effective robotic system for prostate surgery. It should provide the surgeon with a reliable method of assessing the cavity during resection. An automatic system for intraoperative monitoring of the progress of the resection during robotic prostatectomy consists of two subsystems: real-time intraoperative imaging of the prostate and automatic identification of the contour of the gland on each image. The development of a fully automatic scheme for prostate recognition on transurethral ultrasound scans is reported. A genetic algorithm has been developed to automatically adjust a model of the prostate boundary until an optimum fit to the prostate in a given image is obtained. An analysis of ifs performance on 22 different ultrasound images showed an average error of 6.21 mm. Use of a genetic algorithm and a constrained prostate model have shown to be a robust way to automatically identify the prostate in ultrasound images. The scheme is able to produce approximate prostate boundaries, without any human intervention, on ultrasound scans of varying quality. In addition to soft tissue robotic surgery, the generic algorithm technique is also applicable to a wide range of computer assisted surgical techniques.

  19. Gavrila, DM, "The visual analysis of human movement: A survey," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 73, pp. 82-98, 1999.

    Abstract:   The ability to recognize humans and their activities by vision is key for a machine to interact intelligently and effortlessly with a human-inhabited environment. Because of many potentially important applications, "looking at people" is currently one of the most active application domains in computer vision. This survey identifies a number of promising applications and provides an overview of recent developments in this domain. The scope of this survey is limited to work on whole-body or hand motion; it does not include work on human faces. The emphasis is on discussing the various methodologies; they are grouped in 2-D approaches with or without explicit shape models and 3-D approaches. Where appropriate, systems are reviewed. We conclude with some thoughts about future directions. (C) 1999 Academic Press.

  20. Behiels, G, Vandermeulen, D, Maes, F, Suetens, P, and Dewaele, P, "Active Shape Model-based segmentation of digital X-ray images," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 128-137, 1999.

    Abstract:   We propose an improved search procedure for Active Shape Model (ASM) based delineation of anatomical structures in digital X- ray images. Whereas the original ASM search method (1) iteratively improves the current estimate of the location of boundary points by a limited least squares adjustment of the pose and shape parameters, our method additionally requires the subsequent changes in shape during the search to be smooth, which is achieved by using a minimum cost path search algorithm. We, compare the two methods oil a database of more than 400 manual segmentations of digital X-ray images of the femur, humerus and calcaneus. We evaluate the accuracy and robustness of both methods using a cross-validation procedure.

  21. Fleute, M, and Lavallee, S, "Nonrigid 3-D/2-D registration of images using statistical models," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 138-147, 1999.

    Abstract:   This paper presents a new algorithm for reconstruction of 3D shapes using a few x-ray views and a statistical model. In many applications of surgery such as orthopedics, it is desirable to define a surgical planning oil 3-D images and then to execute the plan using standard registration techniques and image- guided surgery systems. But the cost, time and x-ray dose associated with standard pre-operative Computed Tomography makes it difficult to use this methodology for rather standard interventions. Instead, we propose to use. a few x-ray images generated from a C-Arm and to build the 3-D shape of the patient bones or organs intra-operatively, by deforming a statistical 3-D model to the contours segmented oil the x-ray views. In this paper, we concentrate on the application of our method to bone reconstruction. The algorithm starts from segmented contours of the bone oil the x-ray images and ail initial estimate of the pose of the 3-D model in the common coordinate system of the set of x-ray projections. The statistical model is made of a few principal modes that are sufficient to represent the normal anatomy. Those modes are built by using a generalization of the Cootes and Taylor method to 3-D surface models, previously published in MICCAI'98 by the authors. Fitting the model to the contours is achieved by using a generalization of the Iterative Closest Point Algorithm to nonrigid 3D/2D registration. For pathological shapes, the statistical model is not valid and subsequent local refinement is necessary. First results are presented for a 3-D statistical model of the distal part of the femur.

  22. Lotjonen, J, Magnin, IE, Reinhardt, L, Nenonen, J, and Katila, T, "Automatic reconstruction of 3D geometry using projections and a geometric prior model," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 192-201, 1999.

    Abstract:   A method has been developed to reconstruct 3D surfaces from two orthogonal X-ray projections. A 3D geometrical prior model, composed of triangulated surfaces, is deformed according to contours segmented from projection images. The contours are segmented by a new method based on free-form deformation. First, virtual X-ray images of the prior model are constructed by simulating real X-ray imaging, Thereafter, the contours segmented from the virtual projections are elastically matched with patient data. Next, the produced 2D vectors are back- projected onto the surface of the prior model and the prior model is deformed using the back-projected vectors with shape- based interpolation. The accuracy of the method is validated by it data set, containing 20 cases. The method is applied to reconstruct thorax and lung surfaces. The average matching error is about 1.2 voxels, corresponding to 5 mm.

  23. Suri, JS, Haralick, RM, and Sheehan, FH, "Linear vs. quadratic optimization algorithms for bias correction of left ventricle chamber boundaries in low contrast projection ventriculograms produced from xray cardiac catheterization procedure," COMPUTER ANALYSIS OF IMAGES AND PATTERNS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1689, pp. 108-117, 1999.

    Abstract:   Cardiac catheterization procedure produces ventriculogram which have very low contrast in the apical, anterior and inferior zones of the left ventricle (LV). Pixel-based classifiers operating on these images produce boundaries which have systematic positional and orientation bias and have a mean error of about 10.5 mm. Using the IV convex information, comprising of the apex and the aortic valve plane, this pa, per presents a comparison of the linear and quadratic optimization algorithms to remove these biases. These algorithms axe named after the way the coefficients are computed: the identical coefficient and the independent coefficient. Using the polyline metric, we show that the quadratic optimization is better than the linear optimization. We also show that the independent coefficient method performs better than the identical coefficient when the training data is large. The overall mean system error was 2.49 mm while the goal set by the cardiologist was 2.5 mm.

 
2000

  1. Hill, A, Taylor, CJ, and Brett, AD, "A framework for automatic landmark identification using a new method of nonrigid correspondence," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 22, pp. 241-251, 2000.

    Abstract:   A framework for automatic landmark indentification is presented based on an algorithm for corresponding the boundaries of two shapes. The auto-landmarking framework employs a binary tree of corresponded pairs of shapes to generate landmarks automatically on each of a set of example shapes. The landmarks are used to train statistical shape models known as Point Distribution Models. The correspondence algorithm locates a matching pair of sparse polygonal approximations, one for each of a pair of boundaries by minimizing a cost function, using a greedy algorithm. The cost function expresses the dissimilarity in both the shape and representation error (with respect to the defining boundary) of the sparse polygons. Results are presented for three classes of shape which exhibit various types of nonrigid deformation.

  2. Bronkorsta, PJH, Reinders, MJT, Hendriks, EA, Grimbergen, J, Heethaar, RM, and Brankenhoff, GJ, "On-line detection of red blood cell shape using deformable templates," PATTERN RECOGNITION LETTERS, vol. 21, pp. 413-424, 2000.

    Abstract:   For the purpose of automating a clinical diagnostic apparatus to quantify the deformability of human red blood cells, we present an automated image analysis procedure for on-line detection of the cell shape based upon the method of parametric deformable templates. (C) 2000 Elsevier Science B.V. All rights reserved.

  3. Brett, AD, and Taylor, CJ, "A method of automated landmark generation for automated 3D PDM construction," IMAGE AND VISION COMPUTING, vol. 18, pp. 739-748, 2000.

    Abstract:   A previous publication has described a method of pairwise three-dimensional (3D) surface correspondence for the automated generation of landmarks on a set of examples from a class of shape (A.D. Brett, A. Hill, C.J. Taylor, A method of 3D surface correspondence for automated landmark generation, in: 8th British Machine Vision Conference, Essex, England, September 1997, pp 709-718). In this paper we describe a set of improved algorithms which give more accurate and more robust results. We show how the pairwise corresponder can be used in an extension of an existing framework for establishing dense correspondences between a set of training examples (A. Hill, A.D. Brett, C.J. Taylor, Automatic landmark identification using a new method of non-rigid correspondence, in: J. Duncan, G. Gindi, (Eds.), 15th Conference on Information Processing in Medical Imaging, Poulteney, VT, Springer, Berlin, 1997, pp. 483-488) to build a 3D Point Distribution Model. The framework relies upon additional algorithms for the production of surface paths between vertices on a polyhedral mesh, and these are described. Example statistical models are shown for both smooth synthetic data and the left lateral ventricle of the brain, a complex biological shape which demonstrates considerable variation between individuals. (C) 2000 Elsevier Science B.V. All rights reserved.

  4. Bowden, R, Mitchell, TA, and Sarhadi, M, "Non-linear statistical models for the 3D reconstruction of human pose and motion from monocular image sequences," IMAGE AND VISION COMPUTING, vol. 18, pp. 729-737, 2000.

    Abstract:   This paper presents a model based approach to human body tracking in which the 2D silhouette of a moving human and the corresponding 3D skeletal structure are encapsulated within a non-linear point distribution model. This statistical model allows a direct mapping to be achieved between the external boundary of a human and the anatomical position. It is shown how this information, along with the position of landmark features such as the hands and head can be used to reconstruct information about the pose and structure of the human body from a monocular view of a scene. (C) 2000 Elsevier Science B.V. All rights reserved.

  5. Egmont-Petersen, M, Schreiner, U, Tromp, SC, Lehmann, TM, Slaaf, DW, and Arts, T, "Detection of leukocytes in contact with the vessel wall from in vivo microscope recordings using a neural network," IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, vol. 47, pp. 941-951, 2000.

    Abstract:   Leukocytes play an important role in the host defense as they may travel from the blood stream into the tissue in reacting to inflammatory stimuli. The leukocyte-vessel wall interactions are studied in post capillary vessels by intraviral video microscopy during in vivo animal experiments. Sequences of video images are obtained and digitized with a frame grabber. A method for automatic detection and characterization of leukocytes in the video images is developed. Individual leukocytes are detected using a neural network that is trained with synthetic leukocyte images generated using a novel stochastic model. This model makes it feasible to generate images of leukocytes with different shapes and sizes under various lighting conditions. Experiments indicate that neural networks trained with the synthetic leukocyte images perform better than networks trained with images of manually detected leukocytes. The best performing neural network trained with synthetic leukocyte images resulted in an 18% larger area under the ROC curve than the best performing neural network trained with manually detected leukocytes.

  6. Zhong, Y, Jain, AK, and Dubuisson-Jolly, MP, "Object tracking using deformable templates," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 22, pp. 544-549, 2000.

    Abstract:   We propose a novel method for object tracking using prototype- based deformable template models. To track an object in an image sequence, we use a criterion which combines two terms: the frame-to-frame deviations of the object shape and the fidelity of the modeled shape to the Input image. The deformable template model utilizes the prior shape information which is extracted from the previous frames along with a systematic shape deformation scheme to model the object shape in a new frame. The following image information Is used in the tracking process: 1) edge and gradient information: the object boundary consists of pixels with large image gradient, 2) region consistency: the same object region possesses consistent color and texture throughout the sequence, and 3) interframe motion: the boundary of a moving object is characterized by large interframe motion. The tracking proceeds by optimizing an objective function which combines both the shape deformation and the fidelity of the modeled shape to the current image (in terms of gradient, texture, and interframe motion). The inherent structure in the deformable template. together with region, motion, and image gradient cues. makes the proposed algorithm relatively insensitive to the adverse effects of weak image features and moderate amounts of occlusion.

  7. Huang, CL, Wu, MS, and Jeng, SH, "Gesture recognition using the multi-PDM method and Hidden Markov Model," IMAGE AND VISION COMPUTING, vol. 18, pp. 865-879, 2000.

    Abstract:   This paper introduces a multi-Principal-Distribution-Model (PDM) method and Hidden Markov Model (I-m IM) for gesture recognition. To track the hand-shape, it uses the PDM model which is built by learning p