Pattern spotting in historical documents using convolutional models

Ignacio Úbeda, Jose M. Saavedra, Stéphane Nicolas, Caroline Petitjean, Laurent Heutte

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Pattern spotting consists of searching in a collection of historical document images for occurrences of a graphical object using an image query. Contrary to object detection, no prior information nor predefined class is given about the query so training a model of the object is not feasible. In this paper, a convolutional neural network approach is proposed to tackle this problem. We use RetinaNet as a feature extractor to obtain multiscale embeddings of the regions of the documents and also for the queries. Experiments conducted on the DocExplore dataset show that our proposal is better at locating patterns and requires less storage for indexing images than the state-of-the-art system, but fails at retrieving multiple pages containing instances of the query.

Original languageEnglish
Title of host publicationProceedings of the 2019 Workshop on Historical Document Imaging and Processing, HIP 2019
PublisherAssociation for Computing Machinery
Pages60-65
Number of pages6
ISBN (Electronic)9781450376686
DOIs
StatePublished - 20 Sep 2019
Externally publishedYes
Event5th International Workshop on Historical Document Imaging and Processing, HIP 2019, held in conjunction with ICDAR 2019 - Sydney, Australia
Duration: 20 Sep 201921 Sep 2019

Publication series

NameACM International Conference Proceeding Series

Conference

Conference5th International Workshop on Historical Document Imaging and Processing, HIP 2019, held in conjunction with ICDAR 2019
Country/TerritoryAustralia
CitySydney
Period20/09/1921/09/19

Bibliographical note

Publisher Copyright:
© 2019 Association for Computing Machinery.

Keywords

  • Convolutional neural network
  • Historical documents
  • Image retrieval
  • Pattern spotting

Fingerprint

Dive into the research topics of 'Pattern spotting in historical documents using convolutional models'. Together they form a unique fingerprint.

Cite this