Video shots retrieval using local invariant features

Du, Yuanjia and Shao, Ling (2009) Video shots retrieval using local invariant features. In: IMCE '09 - 1st International Workshop on Interactive Multimedia for Consumer Electronics, 19th - 24th October 2009, Beijing, China.

Full text not available from this repository. (Request a copy)
Official URL:


In this paper, we present an efficient video shots retrieval system based on local feature detection, description and matching. A face tracker is first used to obtain information on faces in different viewpoints. A visual vocabulary is built off-line using an invariant descriptor computed on tracked character face regions in all shots. The vocabulary is refined in two ways to make the retrieval system more efficient. Firstly, the visual vocabulary is minimized by only using facial features selected on face regions which are detected by an accurate face detector. Secondly, three criteria, namely Inverted-Occurrence-Frequency Weights, Average Feature Location Distance and Reliable Nearest-Neighbors, are calculated in advance to make the on-line retrieval procedure more efficient and precise. The proposed system is experimented on the movie "Groundhog Day". The results show that our technique is very effective and efficient on video shots retrieval.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Multimedia retrieval, image and video analysis, bag of visual words
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Paul Burns
Date Deposited: 17 Jun 2015 10:56
Last Modified: 13 Oct 2019 00:30

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics