Algerian Modern Colloquial Arabic Speech Corpus (AMCASC): regional accents recognition within complex socio-linguistic environments

Djellab, Mouad, Amrouche, Abderrahmane, Bouridane, Ahmed and Mehallegue, Noureddine (2017) Algerian Modern Colloquial Arabic Speech Corpus (AMCASC): regional accents recognition within complex socio-linguistic environments. Language Resources and Evaluation, 51 (3). pp. 613-641. ISSN 1574-020X

Full text not available from this repository. (Request a copy)
Official URL: http://dx.doi.org/10.1007/s10579-016-9347-6

Abstract

The Algerian linguistic situation is very intricate due to the ethnic, geographical and colonial occupation influences which have lead to a complex sociolinguistic environment. As a result of the contact between different languages and accents, the Algerian speech community has acquired a distinctive sociolinguistic situation. In addition to the intra- and inter- lingual variations describing day-to-day linguistic behavior of the Algerian speakers, their speech is characterized by the presence of many linguistic phenomena such as bilingualism and code switching. The study of automatic regional accent recognition in such a type of environment is a new idea in the field of automatic languages, dialect and accent recognition especially that previous studies were conducted using monolingual evaluation data. The assessment of the effectiveness of GMM-UBM and i-vectors frameworks for accent recognition approaches through the use of the Algerian Modern Colloquial Arabic Speech Corpus (AMCASC), which is a linguistic resource collected for this purpose, shows that not only the recording conditions mismatch, channels mismatch, recordings length mismatch and the amplitude clipping which have a non-desirable effect on the effectiveness of these acoustic approaches but also language contact phenomena are other perturbation sources which should be taken into consideration especially in real life applications.

Item Type: Article
Uncontrolled Keywords: Regional accents recognition, Acoustic approaches, Complex socio-linguistic environments, Algerian Modern Colloquial Arabic Speech Corpus, Code-switching, Language contact phenomena
Subjects: G400 Computer Science
Q100 Linguistics
Q900 Others in Linguistics, Classics and related subjects
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Nicola King
Date Deposited: 24 Mar 2016 11:01
Last Modified: 13 Sep 2017 10:18
URI: http://nrl.northumbria.ac.uk/id/eprint/26432

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics


Policies: NRL Policies | NRL University Deposit Policy | NRL Deposit Licence