Enhancing cyberbullying detection with RoBERTa: A transformer-based approach
Book chapter
Azhar, H. and Runa, A. 2024. Enhancing cyberbullying detection with RoBERTa: A transformer-based approach. in: 16th International Conference on Global Security, Safety & Sustainability, ICGS3-24 - Cybersecurity and Human Capabilities through Symbiotic Artificial Intelligence Springer.
Authors | Azhar, H. and Runa, A. |
---|---|
Abstract | This study investigates the effectiveness of transformer-based models, specifically RoBERTa, in detecting cyberbullying on social media platforms compared to traditional machine learning models such as Random Forest (RF) and Long Short-Term Memory (LSTM). Cyberbullying poses significant challenges due to the evolving nature of language and the anonymity provided by digital platforms. The research focuses on fine-tuning RoBERTa for cyberbullying detection and evaluates its performance using a comprehensive real world dataset comprising approximately 48,000 manually annotated tweets, categorised into various forms of cyberbullying, including explicit and subtle abuses related to ethnicity, age, gender, and other characteristics. Results show that RoBERTa achieved the highest accuracy of 83.9%, outperforming LSTM (77.7%) and RF (79.2%). It excelled in the religion category (accuracy: 96.06%, precision: 96.48%) and ethnicity (accuracy: 98.60%, precision: 98.28%). RF led in the age category (accuracy: 98.31%, precision: 94.54%), with RoBERTa closely behind at 97.53% accuracy. LSTM performed lower, especially in the gender category, with an accuracy of 88.45% and precision of 72.12%. Although the results highlight the effectiveness of RoBERTa in recognising subtle forms of cyberbullying, it faced challenges in real-time applications due to slower inference times and higher computational costs. This research highlights the importance of contextual understanding in cyberbullying detection and the potential of transformer-based models to improve accuracy. |
Keywords | Cyberbullying; Transformer models; RoBERTa; Natural Language Processing; LSTM; Random Forest; Social media; Deep learning; Machine learning |
Year | 2024 |
Book title | 16th International Conference on Global Security, Safety & Sustainability, ICGS3-24 - Cybersecurity and Human Capabilities through Symbiotic Artificial Intelligence |
Publisher | Springer |
Output status | In press |
Publication process dates | |
Deposited | 05 Dec 2024 |
Related URL | https://london.northumbria.ac.uk/icgs3-24/ |
https://repository.canterbury.ac.uk/item/99v71/enhancing-cyberbullying-detection-with-roberta-a-transformer-based-approach
6
total views0
total downloads6
views this month0
downloads this month