AI sucks at stopping online trolls spewing toxic comments
A group of researchers from Aalto University and the University of Padua found this out when they tested seven state-of-the-art models used to detect hate speech. All of them failed to recognize foul language when subtle changes were made, according to a paper [PDF] on arXiv. Adversarial examples can be created automatically by using algorithms Read more about AI sucks at stopping online trolls spewing toxic comments[…]