Abstract
Pattern spotting consists of searching in a collection of historical document images for occurrences of a graphical object using an image query. Contrary to object detection, no prior information nor predefined class is given about the query so training a model of the object is not feasible. In this paper, a convolutional neural network approach is proposed to tackle this problem. We use RetinaNet as a feature extractor to obtain multiscale embeddings of the regions of the documents and also for the queries. Experiments conducted on the DocExplore dataset show that our proposal is better at locating patterns and requires less storage for indexing images than the state-of-the-art system, but fails at retrieving multiple pages containing instances of the query.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2019 Workshop on Historical Document Imaging and Processing, HIP 2019 |
Publisher | Association for Computing Machinery |
Pages | 60-65 |
Number of pages | 6 |
ISBN (Electronic) | 9781450376686 |
DOIs | |
State | Published - 20 Sep 2019 |
Externally published | Yes |
Event | 5th International Workshop on Historical Document Imaging and Processing, HIP 2019, held in conjunction with ICDAR 2019 - Sydney, Australia Duration: 20 Sep 2019 → 21 Sep 2019 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 5th International Workshop on Historical Document Imaging and Processing, HIP 2019, held in conjunction with ICDAR 2019 |
---|---|
Country/Territory | Australia |
City | Sydney |
Period | 20/09/19 → 21/09/19 |
Bibliographical note
Publisher Copyright:© 2019 Association for Computing Machinery.
Keywords
- Convolutional neural network
- Historical documents
- Image retrieval
- Pattern spotting