There have been some negatives here and there (and false positives) but so far, I haven’t read a spam in months! I completely forgot about its existance.
I recently ditched my long standing e-mail address, simply because of the amount of spam I recieved.
I did use POPFile, but it was the fact that every morning when I turned my computer on, it had to download around 1000 e-mails a day, with 99.9% being spam.
These days I’ve got two e-mails address, one super secret one just for family and friends. The other for using in places that are likely to get it on a spam list.
The Markovian filtering would deliver something like 99.99% in this case… rule of thumb is, it cuts the amount of spam that makes it through a bayesian filter in half.
Is the approach similar to Bayesian in that it uses statistical inference, but just with phrases?
Yes. From page 20 of the slide deck (which I still highly recommend – it’s great)