A benchmark for toxic comment classification on civil comments dataset
In Extraction et gestion des connaissances, EGC 2023, lyon, france, 16 au 20 janvier 2023
Abstract
In Extraction et gestion des connaissances, EGC 2023, lyon, france, 16 au 20 janvier 2023
Abstract
In Workshop EGC 2022 DL for NLP
Abstract Hate speech and toxic comment detection on social media has proven to be an essential issue for content moderation. This paper displays a comparison between different Transformer models for Hate Speech detection such as Hate BERT, a BERT-based model, RoBERTa and BERTweet which is a RoBERTa based model. These Transformer models are tested on Jibes&Delight 2021 reddit dataset using the same training and testing conditions. Multiple approaches are detailed in this paper considering feature extraction and data augmentation.
Copyright (c) 2022, LRE; all rights reserved.
Template by Bootstrapious. Ported to Hugo by DevCows.