15 Dec
2004
15 Dec
'04
8:47 p.m.
On Wed, 15 Dec 2004 tallison@tacocat.net wrote:
Bogofilter and SA, when they added Bayesian filtering) both exhibited a rather retarded functionality for the first 100 emails or so. After a bit they began to learn. Given that initial curve... Unless dspam starts with a preloaded wordlist or something else, I can't imagine it's success being significantly different at the beginning.
The SA instructions _specifically_ tell you that you must train the Bayes stuff _before_ using it; you have to feed it at least 500 spams and 500 hams, or something along those lines.
bogofilter recommends 1000+ but it's reasonably effective after only 100 of both ham and spam.