Skip to main content

14.05.2024 | Original Article

Image retrieval by aggregating deep orientation structure features

verfasst von: Fen Lu, Guang-Hai Liu

Erschienen in: International Journal of Machine Learning and Cybernetics

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Aggregating deep features for image retrieval has shown excellent performance in terms of accuracy. However, exploring visual perception properties to activate the dormant discriminative cues of deep convolutional feature maps has received little attention. To address this issue, we present a novel representation, namely the deep orientation aggregation histogram, to image retrieval via aggregating deep orientation structure features. Its main highlights are: (1) A statistical orientation computation model is proposed to detect candidate directions. It will help to use the feature maps to exploit various orientation to provide robust representation. (2) A computed module is proposed to active the discriminative orientation cues hidden in the deep convolutional feature maps. It can boost the representation of deep features with aid of the statistical orientation and their orientation structures. (3) The proposed method can stimulate orientation-selectivity mechanism to provide a strong discriminative yet compact representation. Experimental results on five popular benchmark datasets demonstrated that the proposed method could improve retrieval performance in terms of mAP scores. Furthermore, it outperforms some existing state-of-the-art methods without complex fine-tuning. The proposed method benefits to retrieve the scene images with various color and direction details.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Geirhos R, Rubisch P, Michaelis C, et al (2019) ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. International Conference on Learning Representations, New Orleans, LA, USA. Geirhos R, Rubisch P, Michaelis C, et al (2019) ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. International Conference on Learning Representations, New Orleans, LA, USA.
2.
Zurück zum Zitat Liu G-H, Yang J-Y (2023) Exploiting deep textures for image retrieval. Int J Mach Learn Cybern 14(2):483–494CrossRef Liu G-H, Yang J-Y (2023) Exploiting deep textures for image retrieval. Int J Mach Learn Cybern 14(2):483–494CrossRef
3.
Zurück zum Zitat Hubel D, Wiesel TN (1962) Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol 160:106–154CrossRef Hubel D, Wiesel TN (1962) Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol 160:106–154CrossRef
4.
Zurück zum Zitat Marr D, Hildreth E (1980) Theory of edge detection. Proc R Soc Lond B 207(1167):187–217CrossRef Marr D, Hildreth E (1980) Theory of edge detection. Proc R Soc Lond B 207(1167):187–217CrossRef
5.
Zurück zum Zitat Liu G-H, Yang J-Y (2021) Deep-seated features histogram: a novel image retrieval method. Pattern Recogn 116:107926CrossRef Liu G-H, Yang J-Y (2021) Deep-seated features histogram: a novel image retrieval method. Pattern Recogn 116:107926CrossRef
6.
Zurück zum Zitat Rossi LF, Harris KD, Carandini M (2020) Spatial connectivity matches direction selectivity in visual cortex. Nature 588:648–652CrossRef Rossi LF, Harris KD, Carandini M (2020) Spatial connectivity matches direction selectivity in visual cortex. Nature 588:648–652CrossRef
7.
Zurück zum Zitat Liu G-H, Yang J-Y (2013) Content-based image retrieval using color deference histogram. Pattern Recogn 46(1):188–198CrossRef Liu G-H, Yang J-Y (2013) Content-based image retrieval using color deference histogram. Pattern Recogn 46(1):188–198CrossRef
8.
Zurück zum Zitat Liu G-H, Yang J-Y, Li ZY (2015) Content-based image retrieval using computational visual attention model. Pattern Recogn 48(8):2554–2566CrossRef Liu G-H, Yang J-Y, Li ZY (2015) Content-based image retrieval using computational visual attention model. Pattern Recogn 48(8):2554–2566CrossRef
9.
Zurück zum Zitat Hua J-Z, Liu G-H, Song S-X (2019) Content-based image retrieval using color volume histograms. Int J Pattern Recognit Artif Intell 33(9):1940010CrossRef Hua J-Z, Liu G-H, Song S-X (2019) Content-based image retrieval using color volume histograms. Int J Pattern Recognit Artif Intell 33(9):1940010CrossRef
10.
Zurück zum Zitat Liu G-H, Yang J-Y (2019) Exploiting color volume and color difference for salient region detection. IEEE Trans Image Process 28(1):6–16MathSciNetCrossRef Liu G-H, Yang J-Y (2019) Exploiting color volume and color difference for salient region detection. IEEE Trans Image Process 28(1):6–16MathSciNetCrossRef
11.
Zurück zum Zitat Lowe DG (2004) Distinctive image features from scale-invariant key-points. Int J Comput Vision 60:91–110CrossRef Lowe DG (2004) Distinctive image features from scale-invariant key-points. Int J Comput Vision 60:91–110CrossRef
12.
Zurück zum Zitat Bay H, Ess A, Tuytelaars T, Gool LV (2008) Speeded-Up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRef Bay H, Ess A, Tuytelaars T, Gool LV (2008) Speeded-Up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRef
13.
Zurück zum Zitat Csurka G, Dance C, Fan L et al (2004) Visual categorization with bags of keypoints. European Conference on Computer Vision 1:1–22 Csurka G, Dance C, Fan L et al (2004) Visual categorization with bags of keypoints. European Conference on Computer Vision 1:1–22
14.
Zurück zum Zitat Perronnin F, Liu Y, Sanchez J, Poirier H (2010) Large-scale image retrieval with compressed Fisher vectors. IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp: 3384–3391. Perronnin F, Liu Y, Sanchez J, Poirier H (2010) Large-scale image retrieval with compressed Fisher vectors. IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp: 3384–3391.
15.
Zurück zum Zitat Jégou H, Perronnin F, Douze M et al (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716CrossRef Jégou H, Perronnin F, Douze M et al (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716CrossRef
16.
Zurück zum Zitat Jégou H, Zisserman A (2014) Triangulation Embedding and Democratic Aggregation for Image Search. IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp: 3310–3317. Jégou H, Zisserman A (2014) Triangulation Embedding and Democratic Aggregation for Image Search. IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp: 3310–3317.
17.
Zurück zum Zitat Husain SS, Bober M (2017) Improving large-scale image retrieval through robust aggregation of local descriptors. IEEE Trans Pattern Anal Mach Intell 39(9):1783–1796CrossRef Husain SS, Bober M (2017) Improving large-scale image retrieval through robust aggregation of local descriptors. IEEE Trans Pattern Anal Mach Intell 39(9):1783–1796CrossRef
18.
Zurück zum Zitat Alex K, Ilya S, Hinton G (2012) ImageNet classification with deep convolutional neural networks. Proceedings of the 25th International Conference on Neural Information Processing Systems, Red Hook, NY, United States, pp: 1097–1105. Alex K, Ilya S, Hinton G (2012) ImageNet classification with deep convolutional neural networks. Proceedings of the 25th International Conference on Neural Information Processing Systems, Red Hook, NY, United States, pp: 1097–1105.
19.
Zurück zum Zitat Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. International Conference on Learning Representations, San Diego, CA. Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. International Conference on Learning Representations, San Diego, CA.
20.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, et al (2015) Going deeper with convolutions. IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, pp: 1–9. Szegedy C, Liu W, Jia Y, et al (2015) Going deeper with convolutions. IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, pp: 1–9.
21.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2018) Deep residual learning for image recognition. IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, pp: 770–778. He K, Zhang X, Ren S, Sun J (2018) Deep residual learning for image recognition. IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, pp: 770–778.
22.
Zurück zum Zitat Huang G, Liu Z, Van Der Maaten L, Weinberger K.Q (2017) Densely Connected Convolutional Networks. IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, pp: 2261–2269. Huang G, Liu Z, Van Der Maaten L, Weinberger K.Q (2017) Densely Connected Convolutional Networks. IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, pp: 2261–2269.
23.
Zurück zum Zitat Yandex A.B, Lempitsky V (2015) Aggregating local deep features for image retrieval. IEEE International Conference on Computer Vision, Santiago, Chile, pp: 1269–1277. Yandex A.B, Lempitsky V (2015) Aggregating local deep features for image retrieval. IEEE International Conference on Computer Vision, Santiago, Chile, pp: 1269–1277.
24.
Zurück zum Zitat Razavian AS, Sullivan J, Carlsson S, Maki A (2016) Visual instance retrieval with deep convolutional networks. ITE Transactions on Media Technology and Applications 4(3):251–258CrossRef Razavian AS, Sullivan J, Carlsson S, Maki A (2016) Visual instance retrieval with deep convolutional networks. ITE Transactions on Media Technology and Applications 4(3):251–258CrossRef
25.
Zurück zum Zitat Tolias G, Sicre R, Jégou H (2016) Particular object retrieval with integral max-pooling of CNN activations. International Conference on Learning Representations, San Juan, Puerto Rico, pp: 1–12. Tolias G, Sicre R, Jégou H (2016) Particular object retrieval with integral max-pooling of CNN activations. International Conference on Learning Representations, San Juan, Puerto Rico, pp: 1–12.
26.
Zurück zum Zitat Arandjelović R, Gronat P, Torii A et al (2018) NetVLAD: CNN architecture for weakly supervised place recognition. IEEE Trans Pattern Anal Mach Intell 40(6):1437–1451CrossRef Arandjelović R, Gronat P, Torii A et al (2018) NetVLAD: CNN architecture for weakly supervised place recognition. IEEE Trans Pattern Anal Mach Intell 40(6):1437–1451CrossRef
27.
Zurück zum Zitat Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. European Conference on Computer Vision 9913:685–701 Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. European Conference on Computer Vision 9913:685–701
28.
Zurück zum Zitat Jiménez A, Alvarez J, Giro X (2018) Class-weighted convolutional features for visual instance search. Proceedings of the 28th British Machine Vision Conference, Newcastle, UK, pp: 1–12. Jiménez A, Alvarez J, Giro X (2018) Class-weighted convolutional features for visual instance search. Proceedings of the 28th British Machine Vision Conference, Newcastle, UK, pp: 1–12.
29.
Zurück zum Zitat Forcén JI, Pagola M, Barrenechea E, Bustince H (2020) Co-occurrence of deep convolutional features for image search. Image Vis Comput 97:103909CrossRef Forcén JI, Pagola M, Barrenechea E, Bustince H (2020) Co-occurrence of deep convolutional features for image search. Image Vis Comput 97:103909CrossRef
30.
Zurück zum Zitat Zhang B-J, Liu G-H, Hu J-K (2022) Filtering deep convolutional features for image retrieval. Int J Pattern Recognit Artif Intell 36(1):2252003CrossRef Zhang B-J, Liu G-H, Hu J-K (2022) Filtering deep convolutional features for image retrieval. Int J Pattern Recognit Artif Intell 36(1):2252003CrossRef
31.
Zurück zum Zitat Lu F, Liu G-H (2022) Image retrieval using contrastive weight aggregation histograms. Digital Signal Processing 123:103457CrossRef Lu F, Liu G-H (2022) Image retrieval using contrastive weight aggregation histograms. Digital Signal Processing 123:103457CrossRef
32.
Zurück zum Zitat Lu Z, Liu G-H, Lu F, Zhang B-J (2023) Image retrieval using dual-weighted deep feature descriptor. Int J Mach Learn Cybern 14(3):643–653CrossRef Lu Z, Liu G-H, Lu F, Zhang B-J (2023) Image retrieval using dual-weighted deep feature descriptor. Int J Mach Learn Cybern 14(3):643–653CrossRef
33.
Zurück zum Zitat Liu G-H, Li Z-Y, Yang J-Y, Zhang D (2024) Exploiting sublimated deep features for image retrieval. Pattern Recogn 147:110076CrossRef Liu G-H, Li Z-Y, Yang J-Y, Zhang D (2024) Exploiting sublimated deep features for image retrieval. Pattern Recogn 147:110076CrossRef
34.
Zurück zum Zitat Garg AK, Li P, Rashid MS, Callaway EM (2019) Color and orientation are jointly coded and spatially organized in primate primary visual cortex. Science 364(6447):1275–1279CrossRef Garg AK, Li P, Rashid MS, Callaway EM (2019) Color and orientation are jointly coded and spatially organized in primate primary visual cortex. Science 364(6447):1275–1279CrossRef
35.
Zurück zum Zitat Lu F, Liu G-H (2023) Image retrieval using object semantic aggregation histogram. Cogn Comput 15:1736–1747CrossRef Lu F, Liu G-H (2023) Image retrieval using object semantic aggregation histogram. Cogn Comput 15:1736–1747CrossRef
36.
Zurück zum Zitat Philbin J, Chum O, Isard M, et al (2007) Object retrieval with large vocabularies and fast spatial matching. IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA, pp: 1–8. Philbin J, Chum O, Isard M, et al (2007) Object retrieval with large vocabularies and fast spatial matching. IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA, pp: 1–8.
37.
Zurück zum Zitat Philbin J, Chum O, Isard M, et al (2008) Lost in quantization: improving particular object retrieval in large scale image databases. IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, pp: 1–8. Philbin J, Chum O, Isard M, et al (2008) Lost in quantization: improving particular object retrieval in large scale image databases. IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, pp: 1–8.
38.
Zurück zum Zitat Jégou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. Proceedings of the 10th European Conference on Computer Vision: Part I, Marseille, France, pp: 304–317. Jégou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. Proceedings of the 10th European Conference on Computer Vision: Part I, Marseille, France, pp: 304–317.
39.
Zurück zum Zitat Radenovic F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, pp: 5706–5715. Radenovic F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, pp: 5706–5715.
40.
Zurück zum Zitat Siméoni O, Avrithis Y, Chum O (2019) Local Features and Visual Words Emerge in Activations. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, pp: 11643–11652. Siméoni O, Avrithis Y, Chum O (2019) Local Features and Visual Words Emerge in Activations. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, pp: 11643–11652.
Metadaten
Titel
Image retrieval by aggregating deep orientation structure features
verfasst von
Fen Lu
Guang-Hai Liu
Publikationsdatum
14.05.2024
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-024-02172-w