TY - JOUR
T1 - SBIR-BYOL
T2 - a self-supervised sketch-based image retrieval model
AU - Saavedra, Jose M.
AU - Morales, Javier
AU - Murrugarra-Llerena, Nils
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2023/3
Y1 - 2023/3
N2 - Sketch-based image retrieval is demanding interest in the computer vision community due to its relevance in the visual perception system and its potential application in a wide diversity of industries. In the literature, we observe significant advances when the models are evaluated in public datasets. However, when assessed in real environments, the performance drops drastically. The big problem is that the SOTA SBIR models follow a supervised regimen, strongly depending on a considerable amount of labeled sketch-photo pairs, which is unfeasible in real contexts. Therefore, we propose SBIR-BYOL, an extension of the well-known BYOL, to work in a bimodal scenario for sketch-based image retrieval. To this end, we also propose a two-stage self-supervised training methodology, exploiting existing sketch-photo pairs and contour-photo pairs generated from photographs of a target catalog. We demonstrate the benefits of our model for the eCommerce environments, where searching is a critical component. Here, our self-supervised SBIR model shows an increase of over 60 % of mAP.
AB - Sketch-based image retrieval is demanding interest in the computer vision community due to its relevance in the visual perception system and its potential application in a wide diversity of industries. In the literature, we observe significant advances when the models are evaluated in public datasets. However, when assessed in real environments, the performance drops drastically. The big problem is that the SOTA SBIR models follow a supervised regimen, strongly depending on a considerable amount of labeled sketch-photo pairs, which is unfeasible in real contexts. Therefore, we propose SBIR-BYOL, an extension of the well-known BYOL, to work in a bimodal scenario for sketch-based image retrieval. To this end, we also propose a two-stage self-supervised training methodology, exploiting existing sketch-photo pairs and contour-photo pairs generated from photographs of a target catalog. We demonstrate the benefits of our model for the eCommerce environments, where searching is a critical component. Here, our self-supervised SBIR model shows an increase of over 60 % of mAP.
KW - Deep-learning
KW - Representation learning
KW - Self-supervision
KW - Sketch-based image retrieval
UR - http://www.scopus.com/inward/record.url?scp=85141216541&partnerID=8YFLogxK
U2 - 10.1007/s00521-022-07978-9
DO - 10.1007/s00521-022-07978-9
M3 - Article
AN - SCOPUS:85141216541
SN - 0941-0643
VL - 35
SP - 5395
EP - 5408
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 7
ER -