Abstract
The Visual Object Retrieval problem consists in locating the occurrences of a specific entity in an image/video dataset. In this work, we focus on discovering new occurrences of an entity by propagating the detection scores of already computed candidates to other video segments. The score propagation follows the edges of a pre-computed Similarity Shot Graph (SSG). The SSG connects video segments that are similar according to some criterion. Four methods for creating the SSG are presented: two based on computing and comparing low-level visual features, one based on comparing text transcriptions, and other based on computing and comparing high-level concepts. The score propagation is evaluated on the INS 2014 dataset. The results show that the detection performance can be slightly improved by the proposed algorithm. However, the performance is variable and depends on the properties of the SSG and the target entity. It is part of the future work to automatically decide the kind of SSG that will be used to propagate scores given a set of detection candidates.
Original language | English |
---|---|
Title of host publication | SLAM 2015 - Proceedings of the 2015 Workshop on Speech, Language and Audio in Multimedia, co-located with ACM MM 2015 |
Publisher | Association for Computing Machinery, Inc |
Pages | 19-22 |
Number of pages | 4 |
ISBN (Electronic) | 9781450337496 |
DOIs | |
State | Published - 30 Oct 2015 |
Externally published | Yes |
Event | 3rd Workshop on Speech, Language and Audio in Multimedia, SLAM 2015 - Brisbane, Australia Duration: 30 Oct 2015 → … |
Publication series
Name | SLAM 2015 - Proceedings of the 2015 Workshop on Speech, Language and Audio in Multimedia, co-located with ACM MM 2015 |
---|
Conference
Conference | 3rd Workshop on Speech, Language and Audio in Multimedia, SLAM 2015 |
---|---|
Country/Territory | Australia |
City | Brisbane |
Period | 30/10/15 → … |
Bibliographical note
Publisher Copyright:© 2015 ACM.