John Graham-Cumming's blog: How to beat an adaptive/Bayesian spam filter (2004)

2023-07-05

How to beat an adaptive/Bayesian spam filter (2004)

That was the title of my talk at the 2004 MIT Spam Conference on January 16, 2004. As I recently recovered the slides I am creating this blog for posterity.

The core of the talk was that it was possible to take one machine learning spam filter and use another identical one to learn the characteristics of the other. That way one machine learning system would fight spam and the other would automatically identify the other's weaknesses. Thus a machine learning algorithm could learn how to write spam that would get through a tuned machine learning spam filter. This is now referred to as "Adversarial Machine Learning".

The talk also point out that spammers were trying a technique dubbed "Word Salad" to include random words to try to evade filtering.

Slides are here as a PDF and embedded below as images.