Adam, Kalthoum, Baig, Asim, Al-Maadeed, Somaya, Bouridane, Ahmed and El-Menshawy, Sherine (2018) KERTAS: dataset for automatic dating of ancient Arabic manuscripts. International Journal on Document Analysis and Recognition (IJDAR), 21 (4). pp. 283-290. ISSN 1433-2833
|
Text
Adam2018_Article_KERTASDatasetForAutomaticDatin.pdf - Published Version Available under License Creative Commons Attribution 4.0. Download (1MB) | Preview |
Abstract
The age of a historical manuscript can be an invaluable source of information for paleographers and historians. The process of automatic manuscript age detection has inherent complexities, which are compounded by the lack of suitable datasets for algorithm testing. This paper presents a dataset of historical handwritten Arabic manuscripts designed specifically to test state-of-the-art authorship and age detection algorithms. Qatar National Library has been the main source of manuscripts for this dataset while the remaining manuscripts are open source. The dataset consists of over 2000 images taken from various handwritten Arabic manuscripts spanning fourteen centuries. In addition, a sparse representation-based approach for dating historical Arabic manuscript is also proposed. There is lack of existing datasets that provide reliable writing date and author identity as metadata. KERTAS is a new dataset of historical documents that can help researchers, historians and paleographers to automatically date Arabic manuscripts more accurately and efficiently.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Historical documents dataset, Image processing, Classification, Feature extraction |
Subjects: | G400 Computer Science G900 Others in Mathematical and Computing Sciences |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Depositing User: | Elena Carlaw |
Date Deposited: | 20 Feb 2019 10:27 |
Last Modified: | 01 Aug 2021 13:04 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/38132 |
Downloads
Downloads per month over past year