Detection of Hate Tweets using Machine Learning and Deep Learning

Ketsbaia, Lida, Issac, Biju and Chen, Xiaomin (2020) Detection of Hate Tweets using Machine Learning and Deep Learning. In: The 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (IEEE TrustCom 2020), 29 Dec 2020 - 1 Jan 2021, Guangzhou, China. (In Press)

Accepted Manuscript_Ketsbaia et al.pdf - Accepted Version

Download (475kB) | Preview


Cyberbullying has become a highly problematic occurrence due to its potential of anonymity and its ease for others to join in the harassment of victims. The distancing effect that technological devices have, has led to cyberbullies say and do harsher things compared to what is typical in a traditional face-to-face bullying situation. Given the great importance of the problem, detection is becoming a key area of cyberbullying research. Therefore, it is highly necessary for a framework to accurately detect new cyberbullying instances automatically. To review the machine learning and deep learning approaches, two datasets were used. The first dataset was provided by the University of Maryland consisting of over 30,000 tweets, whereas the second dataset was based on the article ‘Automated Hate Speech Detection and the Problem of Offensive Language’ by Davidson et al., containing roughly 25,000 tweets. The paper explores machine learning approaches using word embeddings such as DBOW (Distributed Bag of Words) and DMM (Distributed Memory Mean) and the performance of Word2vec Convolutional Neural Networks (CNNs) to classify online hate.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Hate Speech, CNN, Machine Learning, Word2Vec, Doc2Vec
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Related URLs:
Depositing User: John Coen
Date Deposited: 11 Nov 2020 09:07
Last Modified: 02 Jan 2021 03:30

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics