Документ взят из кэша поисковой машины. Адрес оригинального документа : http://imaging.cs.msu.ru/pub/2011.VCIP.Krylov_Sorokin.KeyP.en.pdf
Дата изменения: Thu Nov 17 11:19:55 2011
Дата индексирования: Mon Oct 1 19:46:14 2012
Кодировка:
Gauss-Laguerre Keypoints Descriptors for Color Images
Andrey S. Krylov and Dmitry V. Sorokin
Laboratory of Mathematical Methods of Image Processing, Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, http://imaging.cs.msu.ru Email: {kryl, dsorokin}@cs.msu.ru
Abstract--Keypoints descriptors construction algorithm for color images is introduced. The algorithm is based on multiscale Gauss-Laguerre circular harmonic functions expansions of each image color component. The color components are obtained from color invariance model derived from Kubelka-Munk photometric reflectance theory. The matching results of proposed algorithm confirm the importance of color information for keypoints descriptors construction.

complex space and defined as: (, ; ) =


(2 / )



,

I. I NTRO DUCTION The local image feature extraction is an initial step of many complex image processing tasks. The methods of keypoints localization and parametrization are widely presented in literature [1]. There are many approaches to the keypoints detection problem [2], [3], [4]. The problem of keypoints descriptor construction is also widely presented in literature [1], [2], [3]. The target property of the keypoints descriptors is the invariance to a class of projective and photometric transformations. The majority of keypoints extraction methods are based on the analysis of image intensity while color information is also very important. The using of color characteristics allows to achieve better matching rate in the case of photometric transformations caused by variations in illumination and shading. Color-based descriptors construction methods are also presented in literature [5]. The most popular approach for keypoints descriptors construction uses Gaussian filter and its derivatives like in SIFT method [3]. Papers [6], [7] use colorbased methods derived from SIFT approach. In [8] the keypoints extraction method based on GaussLaguerre circular harmonic functions was proposed. The method was constructed for grayscale images. The comparison of its matching rate with SIFT method was presented in [9]. In this paper we present its extension to color images on the basis of color invariants proposed in [10]. The color invariants are derived from the Kubelka-Munk photometric reflectance model. The comparison of color and grayscale Gauss-Laguerre keypoints descriptors is demonstrated in the results section. II. G AU S S -L AGUERRE K EYPOINTS E XTRACTION A. Gauss-Laguerre Circular Harmonic Functions Gauss-Laguerre circular harmonic functions (CHFs) form a family of orthonormal and polar separable functions in

and referenced by integers = 0, 1, ... (referred by radial order) and = 0, ±1, ±2... (referred by angular order). The radial profiles of Gauss-Laguerre CHFs are Laguerre functions: 1 () = /2 -/2 () , !( + +1) where () are Laguerre polynomials [9]. Being linearly related to the 2D Hermite functions [11] that are the eigenfunctions of Fourier transformation, GaussLaguerre CHFs keep their shape under Fourier transformation. This family of functions is also self-steerable, i.e. they can be rotated by the angle using multiplication by the factor . Due to the orthogonality and completeness of family in 2 the observed image (, ) can be expanded in the neighborhood of the analysis point 0 ,0 for fixed in Cartesian system as: ( + 0 , + 0 ) = where
=- =0



,

(0 ,0 ; ) (, ; ) ,



,

2 + 2 , = arctan( ) , (0 ,0 ; ) = ( + 0 , + 0 ) (, ; ) . =
- -

The scale parameter is denoted as and corresponds to the size of the support region of the given point 0 ,0 . The keypoints detection and descriptors construction is performed on the basis of the , coefficients analysis. Complete description of the keypoints detection algorithm and local descriptors construction process can be found in [8], [9]. In this paper a brief review of these algorithms is presented. B. Keypoints Detection Due to the specific property of Gauss-Laguerre CHFs as natural detectors of different image features (like crosses and corners) [4] the keypoints detection is based on the analysis of


3,0 and 4,0 coefficients at different scales. For this purpose the image scalogram is defined: (, ; ) =
3, 0

(, ; )2 +

4, 0

(, ; )2 , { } .

It is constructed on a set of different scales { }. The set of keypoints = (, ; ) is defined as local maxima of the scalogram higher than a selected threshold. C. Keypoints Descriptor Construction Each keypoint is associated with a local descriptor. The descriptor is a complex-valued vector consisted of , for given and range at the keypoint reference scale and 2max scales neighbor to it. The vector is defined as: (, , ) =
, ,

E. Color Extension of Keypoints Descriptors In [8] this Gauss-Laguerre keypoints detection and descriptors construction approach is presented for grayscale images. In this paper the descriptors construction algorithm is extended to color images while the keypoints detection algorithm works with only intensity data. The color Gauss-Laguerre descriptors are constructed concatenating descriptors that were obtained for each color component independently. The procedure of choosing the color components for descriptor construction is described in the next section. III. C OLOR I NVARIANCE M ODEL The color invariance model proposed in [10] is used for our approach to keypoints descriptors construction for color images. This model is based on Kubelka-Munk theory of the reflection spectrum of colored bodies. The main equation of this model is (, - ) = (, - )(1- (- ))2 (, - )+(, - ) (- ) ,


(, ; ) (, ; )
max

- -





,
max

(1) ,

= 0, ..., max , = 1, ...,

, = -

max

, ...,

where is the j-th scale following if > 0, or preceding if < 0 in the { } set of the scales. The normalization makes descriptor stable to the contrast changes. The phase shift - is used to make the descriptors stable to the keypoint pattern orientation, where = arg(1,0 (, ; )) . D. Acceleration of Descriptor Construction The descriptors construction process can be accelerated using 2D Hermite functions image expansion. The details of the acceleration procedure is described in [9]. The main idea of acceleration is presented in this paper. The 2D Hermite functions , (, ; ) form the complete orthonormal system in 2 space and can be defined as: ( ) ( ) 1 , (, ; ) = , 2 1 - () = 2 () , ! 2

(3) where is the wavelength and - is denotes the position of imaging plane. The illumination spectrum is denoted as (, - ) and (- ) is Fresnel reflectance at - . The ma - ) and the reflected terial reflectivity is denoted as (, spectrum in the viewing direction is represented by (, - ). This model is defined for non transparent objects. Another requirement is high enough resolution of imaging to consider image pixels locally planar. Assume that the scene is uniformly illuminated with equal spectral energy. Hence the source illumination (, - ) is constant over the wavelengths and varied over the position and can be denoted as (- ). So the Eq. (3) is transformed: ( - ) = (- ) (1 - (- ))2 (, - )+ (- )) . (,


(4) Differentiating Eq. (4) with respect to twice results: - (, ) where = 0, 1, 2... and () are Hermite polynomials [9]. (5) = (- )(1 - (- ))2 The given image (, ) can be expanded in the analysis point 0 ,0 for fixed into a set of 2D Hermite functions and - 2 (, ) . , : = (- )(1 - (- ))2 (6) 2 Dividing Eq. (5) by Eq. (6) one can get: ( + 0 , + 0 ) = , (0 ,0 ; ), (, ; ) , (, - ) 2 (, - ) =0 =0 = / = 2 where that depends only on the material reflectivity (, - ). Thus component is independent of viewpoint, surface orientation, (+0 , +0 ), (, ; ) . , (0 ,0 ; ) = illumination direction, intensity and Fresnel reflectance coef- - (2) ficient. In the assumption of Eq. (4) let us consider the case of As 2D Hermite functions , are linearly related to the Gauss-Laguerre CHFs [8], [11], the image expansions matte and dull surfaces. So the Fresnel reflectance coefficient - coefficients , and , are also linearly related. The ( ) 0 and Eq. (4) is reduced to: separability of , allows to reduce the calculation of the (, - ) = (- ) (, - ) . (7) double integral in the right part of (2) to the repeated integral. This accelerates the computation up to 3.5 times [9]. The So the partial differences of Eq. (7) with respect to are: - - 2 further acceleration in 1.5 times can be achieved using fast (, ) , = (- ) (, ) . = (- ) Hermite projection method [9], [12]. 2


Thus one can get two more components = / and = / which depend only on material reflectivity (, - ) and are independent of viewpoint, surface orientation, illumination direction and intensity. There are some more invariants can be derived from Kubelka-Munk model with some other assumptions by spatial differentiation. The complete set of the invariants and their properties can be found in [10]. ^^ ^ Spectral differential quotients (, , ) is used as (, , ) components. Spectral differential quotients ^^ ^ (, , ) are obtained from known RGB components us^ ^^ ing the Gaussian color model. The , and components of the Gaussian color model well approximate the CIE 1964 XYZ basis linearly for 0 = 520nm and = 55nm [10]. Meanwhile XYZ basis is linearly related to RGB components. ^ ^^ Thus one can obtain the , and components from RGB image components using the following formula [10]: ^ 0.06 0.63 0.27 = 0.3 0.04 -0.35 . ^ ^ 0.34 -0.6 0.17 All the invariants in [10] that requires spatial differentiation of , , , , and are obtained using convolution with Gaussian derivatives instead of differentiating. The Gauss-Laguerre approach to descriptors construction [8], [9] is based on image extention into a set of Gauss-Laguerre CHFs. But Gauss-Laguerre CHFs are linearly related to the 2D Hermite functions that are the higher order derivatives of Gaussian. Hence using the , , , , and color invariants as the initial image in Gauss-Laguerre method allows to obtain higher order invariants as the components of keypoints descriptors. IV. R ESULTS The precision-recall characteristics of proposed algorithm were evaluated on the images from the dataset freely available on the web, which provides the images and the relating homographies sequences (http://www.robots.ox.ac.uk/ vgg/research/affine/). The keypoints detector parameters were set for each image pair independently to get about 1000 keypoints per image. Only the color images from this dataset were taken. To demonstrate the performance of the proposed descriptors in the case of illumination changes the images from ALOI database [13] were taken. The keypoints detection was performed using just the intensity information. There is a possibility to improve the detection process using color information but in general case the improvement looks to have slight influence on the matching results. Although in some special cases of imaging it looks possible to get significant improvement in matching. So this is the question of future investigation. The comparison of different combinations of color invariants for descriptor construction was performed. The values of parameters max , max and max were also varied to achieve the optimal balance between the resulting vector size and the matching performance. The intensity component

was included into every combination as the most informative component in the general case. The grayscale Gauss-Laguerre descriptors were compared with SIFT descriptors for the values max = 5, max = 5 and max = 2 (denoted as Gauss-Laguerre 5,5,2). It gives the 150 elements in the GaussLaguerre descriptor (see Eq. (1)). Hence the same values were used as initial for intensity channel. Different combinations of color invariants for descriptors construction were tested: (, , ), (, ), (, , ), (, , , ) and (, , , ). For , , , , the max and max were varied from 2 to 5, max was varied from 0 to 2. The most appropriate results were obtained using the combination (, , ). The component does not justify the hopes of being very informative. Moreover the test showed that precision-recall curves changed insignificantly while varying parameters max , max and max for channel. So it is reasonable to use just 1 or 2 coefficients of channel as additional to the main components of descriptors to increase precision slightly in the case of very high recall requirement. The (, , ) combination with max = 5, max = 5 and max = 2 for all components (denoted as color GaussLaguerre 5,5,2-5,5,2) gives the closest curve to the grayscale descriptors results for the low values recall and begins to overcome grayscale results from 0.42 recall value. But in this case the initial size of the descriptor triples. To hold the dimension of the descriptors within appropriate range of about 150 elements the values of max , max parameters for component were reduced. Tests showed that reduction of scaling parameter max leads to decrease in matching performance so only max and max were varied. The best matching performance was achieved using max = 4, max = 4, max = 2 for component (100 elements) and max = 2, max = 2, max = 2 for , components (2 в 30 elements). This combination is denoted as color GaussLaguerre 4,4,2-2,2,2. Thus the overall size of the descriptors is 160 elements that is close to the grayscale analog while the precision-recall characteristic of those descriptors is similar to 450 elements descriptors mentioned above. Even more, this type of descriptors allows to achieve greater value of recall overall. The comparison of precision-recall graphs of defined above keypoints descriptors is illustrated in Fig. 1. The precision-recall graphs for proposed descriptors with fast Hermite projection method acceleration [12], [9] are illustrated in Fig. 2. In Fig. 3 the example of keypoints matching for a pair of images from ALOI database [13] is shown. The overall number of matches for grayscale descriptors is 41 where 4 matches are incorrect. The overall number of matches for color descriptors is 77 where 3 matches are incorrect. The parameters of descriptors matching were the same for both descriptors types.


1 0.95 0.9 Precision 0.85 0.8 0.75 0.7 0.65 0 Color Gauss-Laguerre 5,5,2-5,5,2 Color Gauss-Laguerre 4,4,2-2,2,2 Gauss-Laguerre 5,5,2 SIFT 0.2 0.4 Recall 0.6 0.8 1

Fig. 1.

Precision-recall graphs for graf1-graf2 image pair.

1 0.95 0.9 Precision 0.85

Fig. 3. Example of keypoints matching in the case of illumination changes. The upper image pair was produced using Gauss-Laguerre 5,5,2. The lower image pair was performed using color Gauss-Laguerre 4,4,2-2,2,2.

R EFERENCES
0.8 0.75 0.7 0.65 0 Color Gauss-Laguerre 5,5,2-5,5,2 Color Gauss-Laguerre 4,4,2-2,2,2 Fast Color Gauss-Laguerre 5,5,2-5,5,2 Fast Color Gauss-Laguerre 4,4,2-2,2,2 0.2 0.4 Recall 0.6 0.8 1

Fig. 2. Precision-recall graphs for graf1-graf2 image pair. Comparison of color Gauss-Laguerre descriptors with fast color Gauss-Laguerre descriptors.

V. C ONCLUSION Keypoints descriptors construction algorithm for color images was introduced. The algorithm is based on GaussLaguerre CHFs expansion approach [4], [8], [9]. The matching results of proposed algorithm confirm the importance of color information in matching procedure. The presented comparison with Gauss-Laguerre grayscale descriptors and SIFT algorithm looks promising. The using of color information in keypoints detection process and a comparison with existing descriptors for color images is the question of future investigation. ACKNOWLEDGMENT The work was supported by RFBR grant 09-07-92000-HHC.

[1] K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors," IEEE Trans. on Pattern Anal. and Mach. Intell., vol. 27, no. 10, pp. 1615­1630, 2005. [2] C. Harris and M. Stephens, "A Combined Corner and Edge Detection," in Proc. of The Fourth Alvey Vision Conf., 1988, pp. 147­151. [3] D. G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int. J. of Comp. Vis., vol. 60, no. 2, pp. 91­110, 2004. [4] L. Sorgi, N. Cimminiello, and A. Neri, "Keypoints Selection in the Gauss Laguerre Transformed Domain," in Proc. of BMVC06, 2006, pp. 539­547. [5] G. J. Burghouts and J.-M. Geusebroek, "Performance evaluation of local colour invariants," Comp. Vis. and Im. Understanding, vol. 113, no. 1, pp. 48­62, 2009. [6] A. E. Abdel-Hakim and A. A. Farag, "CSIFT: A SIFT Descriptor with Color Invariant Characteristics," in Comp. Vis. and Pat. Rec., 2006 IEEE Comp. Soc. Conf. on, vol. 2, 2006, pp. 1978­1983. ~ [7] A.Bosch,A.Zisserman, andX.Munoz, "Scene classification via pLSA," in Proc. of ECCV, 2006, pp. 517­530. [8] D. V. Sorokin and A. S. Krylov, "Fast Gauss-Laguerre Keypoints Extraction using 2D Hermite Functions," in Proc. of PRIA-10-2010, 2010, pp. 339­342. [9] D. V. Sorokin, M. M. Mizotin, and A. S. Krylov, "Gauss-Laguerre Keypoints Extraction Using Fast Hermite Projection Method," in Im. Anal. and Rec., ser. LNCS, A. Campilho and M. Kamel, Eds. Springer, 2011, vol. 6753, pp. 284­293. [10] J.-M. Geusebroek, R. van den Boomgaard, A. W. M. Smeulders, and H. Geerts, "Color Invariance," IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, no. 12, pp. 1338­1350, 2001. [11] E. Zauderer, "Complex argument hermite-gaussian and laguerregaussian beams," J. Opt. Soc. Am. A, vol. 3, no. 4, pp. 465­469, 1986. [12] A. Krylov and D. Korchagin, "Fast Hermite Projection Method," in Im. Anal. and Rec., ser. LNCS, A. Campilho and M. Kamel, Eds. Springer, 2006, vol. 4141, pp. 329­338. [13] J.-M. Geusebroek, G. J. Burghouts, and A. W. Smeulders, "The amsterdam library of object images," Int. J. of Comp. Vis., vol. 61, pp. 103­112, 2005.