TY - JOUR
T1 - Improving pattern spotting in historical documents using feature pyramid networks
AU - Úbeda, Ignacio
AU - Saavedra, Jose M.
AU - Nicolas, Stéphane
AU - Petitjean, Caroline
AU - Heutte, Laurent
N1 - Publisher Copyright:
© 2020
PY - 2020/3
Y1 - 2020/3
N2 - Pattern spotting consists of locating different instances of a given object (i.e. an image query) in a collection of historical document images. These patterns may vary in shape, size, color, context and even style because they are hand-drawn, which makes pattern spotting a difficult task. To tackle this problem, we propose a Convolutional Neural Network (CNN) approach based on Feature Pyramid Networks (FPN) as the feature extractor of our system. Using FPN allows to extract descriptors of local regions of the documents to be indexed and queries, at multiple scales with just a single forward pass. Experiments conducted on DocExplore dataset show that the proposed system improves mAP by 73% (from 0.157 to 0.272) in pattern localization compared with state-of-the-art results, even when the feature extractor is not trained with domain-specific data. Memory requirement and computation time are also decreased since the descriptor dimension used for distance computation is reduced by a factor of 16.
AB - Pattern spotting consists of locating different instances of a given object (i.e. an image query) in a collection of historical document images. These patterns may vary in shape, size, color, context and even style because they are hand-drawn, which makes pattern spotting a difficult task. To tackle this problem, we propose a Convolutional Neural Network (CNN) approach based on Feature Pyramid Networks (FPN) as the feature extractor of our system. Using FPN allows to extract descriptors of local regions of the documents to be indexed and queries, at multiple scales with just a single forward pass. Experiments conducted on DocExplore dataset show that the proposed system improves mAP by 73% (from 0.157 to 0.272) in pattern localization compared with state-of-the-art results, even when the feature extractor is not trained with domain-specific data. Memory requirement and computation time are also decreased since the descriptor dimension used for distance computation is reduced by a factor of 16.
KW - Convolutional neural network
KW - Historical documents
KW - Image retrieval
KW - Pattern spotting
UR - http://www.scopus.com/inward/record.url?scp=85078975619&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2020.02.002
DO - 10.1016/j.patrec.2020.02.002
M3 - Article
AN - SCOPUS:85078975619
SN - 0167-8655
VL - 131
SP - 398
EP - 404
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -