Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

Dong, Shiyin; Zhu, Mingrui; Wang, Nannan; Gao, Xinbo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.05144 (cs)

[Submitted on 9 May 2023 (v1), last revised 9 Aug 2023 (this version, v3)]

Title:Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

Authors:Shiyin Dong, Mingrui Zhu, Nannan Wang, Xinbo Gao

View PDF

Abstract:Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions. Previous methods fine-tune pre-trained models with various side information and learning strategies to learn a compact feature space that is shared between the sketch and photo domains and bridges seen and unseen classes. However, these efforts are inadequate in adapting domains and transferring knowledge from seen to unseen classes. In this paper, we present an effective ``Adapt and Align'' approach to address the key challenges. Specifically, we insert simple and lightweight domain adapters to learn new abstract concepts of the sketch domain and improve cross-domain representation capabilities. Inspired by recent advances in image-text foundation models (e.g., CLIP) on zero-shot scenarios, we explicitly align the learned image embedding with a more semantic text embedding to achieve the desired knowledge transfer from seen to unseen classes. Extensive experiments on three benchmark datasets and two popular backbones demonstrate the superiority of our method in terms of retrieval accuracy and flexibility.

Comments:	10 pages, 7 figures, 6 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.05144 [cs.CV]
	(or arXiv:2305.05144v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.05144

Submission history

From: Shiyin Dong [view email]
[v1] Tue, 9 May 2023 03:10:15 UTC (3,522 KB)
[v2] Thu, 18 May 2023 06:06:01 UTC (3,680 KB)
[v3] Wed, 9 Aug 2023 14:12:34 UTC (3,994 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computer Vision and Pattern Recognition

Title:Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.