Dissimilarity Gaussian Mixture Models for Efficient Offline Handwritten Text-Independent Identification using SIFT and RootSIFT Descriptors

Khan, Faraz, Khelifi, Fouad, Tahir, Muhammad and Bouridane, Ahmed (2019) Dissimilarity Gaussian Mixture Models for Efficient Offline Handwritten Text-Independent Identification using SIFT and RootSIFT Descriptors. IEEE Transactions on Information Forensics and Security, 14 (2). pp. 289-303. ISSN 1556-6013

[img]
Preview
Text
Khan et al - Dissimilarity Gaussian Mixture Models AAM.pdf - Accepted Version

Download (925kB) | Preview
Official URL: https://doi.org/10.1109/TIFS.2018.2850011

Abstract

Handwriting biometrics is the science of identifying the behavioural aspect of an individual’s writing style and exploiting it to develop automated writer identification and verification systems. This paper presents an efficient handwriting identification system which combines Scale Invariant Feature Transform (SIFT) and RootSIFT descriptors in a set of Gaussian mixture models (GMM). In particular, a new concept of similarity and dissimilarity Gaussian mixture models (SGMM and DGMM) is introduced. While a SGMM is constructed for every writer to describe the intra-class similarity that is exhibited between the handwritten texts of the same writer, a DGMM represents the contrast or dissimilarity that exists between the writer’s style on one hand and other different handwriting styles on the other hand. Furthermore, because the handwritten text is described by a number of key point descriptors where each descriptor generates a SGMM/DGMM score, a new weighted histogram method is proposed to derive the intermediate prediction score for each writer’s GMM. The idea of weighted histogram exploits the fact that handwritings from the same writer should exhibit more similar textual patterns than dissimilar ones, hence, by penalizing the bad scores with a cost function, the identification rate can be significantly enhanced. Our proposed system has been extensively assessed using six different public datasets (including three English, two Arabic and one hybrid language) and the results have shown the superiority of the proposed system over state-of-the-art techniques.

Item Type: Article
Uncontrolled Keywords: Writer identification, text independent, Gaussian Mixture Model, dissimilarity framework, weighted histograms, SIFT, RootSIFT
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Paul Burns
Date Deposited: 14 Jun 2018 09:00
Last Modified: 31 Jul 2021 13:31
URI: http://nrl.northumbria.ac.uk/id/eprint/34530

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics