This work explores attention models to weight the contribution of local convolutional representations for the instance search task. We present a retrieval framework based on bags of local convolutional features (BLCF) that benefits from saliency weighting to build an efficient image representation. The use of human visual attention models (saliency) allows significant improvements in retrieval performance without the need to conduct region analysis or spatial verification, and without requiring any feature fine tuning. We investigate the impact of different saliency models, finding that higher performance on saliency benchmarks does not necessarily equate to improved performance when used in instance search tasks. The proposed approach outperforms the state-of-the-art on the challenging INSTRE benchmark by a large margin, and provides similar performance on the Oxford and Paris benchmarks compared to more complex methods that use off-the-shelf representations.Source code is publicly available at https://github.com/imatge-upc/salbow.
Ireland ->
Dublin City University ->
Publication Type = Conference or Workshop Item
Ireland ->
Dublin City University ->
DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland ->
Dublin City University ->
Subject = Computer Science: Artificial intelligence
Ireland ->
Dublin City University ->
Subject = Computer Science: Image processing
Ireland ->
Dublin City University ->
Status = Published
Ireland ->
Dublin City University ->
Subject = Computer Science: Information retrieval
Ireland ->
Dublin City University ->
DCU Faculties and Centres = Research Initiatives and Centres: INSIGHT Centre for Data Analytics
Noel E. O'Connor,
Xavier Giro-i-Nieto,
Kevin McGuinness,
Eva Mohedano