Hauke Fath wrote:
While this of course depends on your definition of "larger", some people seem to think otherwise:
Not having a Usenix login, I cannot comment on the full paper, but to quote this from the abstract:
Bayesian classification has been able to solve the spam problem for this user population for the present and observable future, with a single wordlist, and with no secondary spam filtering techniques ^^^^^^^^^^^^^^^
The use of a single wordlist is appropriate for limited circumstances. Even in a corporate environment like I manage, there is a very wide definition of what constitutes spam, and a configuration such as described above wouldn't work here. It would work even less in an ISP environment, with widely varied userbase.
This is now veering into Off Topic Territory...
John
-- John Peacock Director of Information Research and Technology Rowman & Littlefield Publishing Group 4501 Forbes Boulevard Suite H Lanham, MD 20706 301-459-3366 x.5010 fax 301-429-5748