The internet constitutes a society and as in every society there are malicious people so there are users on the internet who victimize other members of the community by making vulgar and provocative comments. Such toxic behaviors in first phase prevent the victims from exercising their right to freedom of speech in the future and in second phase they desert the community. The purpose of this thesis is to investigate and predict the toxicity in comments using various Neural Network architectures. The data set was taken from Kaggle's ‘Jigsaw Unintended Bias in Toxicity Classification’ competition organized by Jigsaw, a Google research team. The architectures, synthesized, are 16 in total: 6 using LSTM, 6 using GRU, 1 using CNN, 1 using BERT, 1 using RoBERTa and 1 using GPT2. Finally, ensemble learning was used, testing various combinations for the 4 best architectures. The best results were shown by the use of all four best architectures ranking this solution in the top 6% of the best solutions of the competition.
-
Notifications
You must be signed in to change notification settings - Fork 0
This is my repository and all the code needed to complete my Bachelor thesis on the detection of toxic comments.
License
CoGian/Detecting-toxic-comments-and-minimizing-of-unintetional-prejudice-using-neural-networks
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This is my repository and all the code needed to complete my Bachelor thesis on the detection of toxic comments.
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published