Ketsbaia, Lida, Issac, Biju and Chen, Xiaomin (2020) Detection of Hate Tweets using Machine Learning and Deep Learning. In: The 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (IEEE TrustCom 2020), 29 Dec 2020 - 1 Jan 2021, Guangzhou, China. (In Press)
|
Text
Accepted Manuscript_Ketsbaia et al.pdf - Accepted Version Download (475kB) | Preview |
Abstract
Cyberbullying has become a highly problematic occurrence due to its potential of anonymity and its ease for others to join in the harassment of victims. The distancing effect that technological devices have, has led to cyberbullies say and do harsher things compared to what is typical in a traditional face-to-face bullying situation. Given the great importance of the problem, detection is becoming a key area of cyberbullying research. Therefore, it is highly necessary for a framework to accurately detect new cyberbullying instances automatically. To review the machine learning and deep learning approaches, two datasets were used. The first dataset was provided by the University of Maryland consisting of over 30,000 tweets, whereas the second dataset was based on the article ‘Automated Hate Speech Detection and the Problem of Offensive Language’ by Davidson et al., containing roughly 25,000 tweets. The paper explores machine learning approaches using word embeddings such as DBOW (Distributed Bag of Words) and DMM (Distributed Memory Mean) and the performance of Word2vec Convolutional Neural Networks (CNNs) to classify online hate.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | Hate Speech, CNN, Machine Learning, Word2Vec, Doc2Vec |
Subjects: | G400 Computer Science |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Related URLs: | |
Depositing User: | John Coen |
Date Deposited: | 11 Nov 2020 09:07 |
Last Modified: | 02 Jan 2021 03:30 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/44729 |
Downloads
Downloads per month over past year