Abstract
Local feature matching is an essential component of many image and object retrieval algorithms. Euclidean and Mahalanobis distances are mostly used in order to quantify the similarity of two stipulated feature vectors. The Euclidean distance is inappropriate in the typical case where the components of the feature vector are incommensurable entities, and indeed yields unsatisfactory results in practice. The Mahalanobis distance performs better, but is less generic in the sense that it requires specific training data.
In this paper we consider two alternative ways to construct generic distance measures for image and object retrieval, which do not suffer from any of these shortcomings. The first approach aims at obtaining a (image independent) covariance matrix for a Mahalonobis-like distance function without explicit training, and is applicable to feature vectors consisting of partial image derivatives. In the second approach a stability based similarity measure (SBSM) is introduced for feature vectors that are composed of arbitrary algebraic combinations of image derivatives, and likewise requires no explicit training. The strength and novelty of SBSM lies in the fact that the associated covariance matrix exploits local image structure. A performance analysis shows that feature matching based on SBSM outperforms algorithms based on Euclidean and Mahalanobis distances.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Ashbrook, A.P., Thacker, N.A., Rockett, P.I., Brown, C.I.: Robust recognition of scaled shapes using pairwise geometric histograms. In: BMVC ’95: Proceedings of the 6th British Conference on Machine Vision, Surrey UK, 1995, vol. 2, pp. 503–512. BMVA Press, Bristol (1995)
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: Speeded-up robust features. In: 9th European Conference on Computer Vision, Graz, Austria
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)
Blom, J.: Topological and geometrical aspects of image structure. Ph.D. thesis, University of Utrecht, Department of Medical and Physiological Physics, Utrecht, The Netherlands (1992)
Blom, J., ter Haar Romeny, B.M., Bel, A., Koenderink, J.J.: Spatial derivatives and the propagation of noise in Gaussian scale-space. J. Vis. Commun. Image Represent. 4(1), 1–13 (1993)
Florack, L.M.J.: Image Structure. Computational Imaging and Vision Series, vol. 10. Kluwer Academic, Dordrecht (1997)
Florack, L.M.J., ter Haar Romeny, B.M., Koenderink, J.J., Viergever, M.A.: Scale and the differential structure of images. Image Vis. Comput. 10(6), 376–388 (1992)
Florack, L.M.J., ter Haar Romeny, B.M., Koenderink, J.J., Viergever, M.A.: Cartesian differential invariants in scale-space. J. Math. Imaging Vis. 3(4), 327–348 (1993)
Florack, L.M.J., ter Haar Romeny, B.M., Koenderink, J.J., Viergever, M.A.: General intensity transformations and differential invariants. J. Math. Imaging Vis. 4(2), 171–187 (1994)
Freeman, W., Adelson, E.: The design and use of steerable filters. IEEE Trans. Pattern Anal. Mach. Intell. 13(9), 891–906 (1991)
Gabor, D.: Theory of communication. J. IEEE 93, 429–457 (1946)
Gouet, V., Montesinos, P., Pele, D.: A fast matching method for color uncalibrated images using differential invariants (1998)
Griffin, L.D.: The second order local-image-structure solid. IEEE Trans. Pattern Anal. Mach. Intell. 29(8), 1355–1366 (2007)
Grigorescu, S.E., Petkov, N., Kruizinga, P.: A comparative study of filter based texture operators using mahalanobis distance. ICPR 03, 3897 (2000)
ter Haar Romeny, B.M.: Front-End Vision and Multi-Scale Image Analysis: Multi-Scale Computer Vision Theory and Applications, written in Mathematica. Computational Imaging and Vision Series, vol. 27. Kluwer Academic, Dordrecht (2003)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proc. 4th Alvey Vision Conf., pp. 189–192 (1988)
Koenderink, J.J.: The structure of images. Biol. Cybern. 50, 363–370 (1984)
Koenderink, J.J., van Doorn, A.J.: Receptive field families. Biol. Cybern. 63, 291–298 (1990)
Laws, K.I.: Rapid texture identification. In: Proc. SPIE Conf. Image Processing for Missile Guidance, pp. 376–380 (1980)
Lazebnik, S., Schmid, C., Ponce, J.: A sparse texture representation using local affine regions. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1265–1278 (2005)
Lindeberg, T.: Scale-space for discrete signals. IEEE Trans. Pattern Anal. Mach. Intell. 12(3), 234–245 (1990)
Lindeberg, T.: Scale-Space Theory in Computer Vision. The Kluwer International Series in Engineering and Computer Science. Kluwer Academic, Dordrecht (1994)
Loog, M.: The jet metric. In: Sgallari, F., Murli, A., Paragios, N. (eds.) SSVM. Lecture Notes in Computer Science, vol. 4485, pp. 25–31. Springer, Berlin (2007)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Markussen, B., Pedersen, K.S., Loog, M.: A scale invariant covariance structure on jet space. In: Olsen, O.F., Florack, L.M.J., Kuijper, A. (eds.) DSSCV. Lecture Notes in Computer Science, vol. 3753, pp. 12–23. Springer, Berlin (2005)
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. Comput. Vis. 60(1), 63–86 (2004)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27(10), 1615–1630 (2005)
Olver, P.J.: Classical Invariant Theory. London Mathematical Society Student Texts, vol. 44. Cambridge University Press, Cambridge (1999)
Platel, B., Florack, L.M.J., Kanters, F.M.W., Balmachnova, E.G.: Using multiscale top points in image matching. In: Proceedings of the 11th International Conference on Image Processing, Singapore, pp. 389–392 (2004)
Randen, T., Husøy, J.H.: Filtering for texture classification: a comparative study. IEEE Trans. Pattern Anal. Mach. Intell. 21(4), 291–310 (1999)
Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 19(5), 530–535 (1997)
Unser, M.: Local linear transforms for texture measurements. Signal Process. 11(1), 61–79 (1986)
Witkin, A.P.: Scale-space filtering. In: Proceedings of the International Joint Conference on Artificial Intelligence, Karlsruhe, Germany, pp. 1019–1022 (1983)
Yan, K., Sukthankar, R.: Pca-sift: a more distinctive representation for local image descriptors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 506–513 (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Balmashnova, E., Florack, L.M.J. Novel Similarity Measures for Differential Invariant Descriptors for Generic Object Retrieval. J Math Imaging Vis 31, 121–132 (2008). https://doi.org/10.1007/s10851-008-0079-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10851-008-0079-0