New TokenBreak Attack Bypasses AI Moderation with Single-Character Text Changes - The Hacker News
The TokenBreak attack exploits specific tokenization strategies (BPE or WordPiece) in text classification models by introducing single-character changes, bypass...
Read Analysis →