Bluehost.com Web Hosting $6.95

Re: Really good spam filtering (Bayesian)

This is a discussion on Re: Really good spam filtering (Bayesian) within the Linux Networking forums, part of the Linux Forums category; "Jem Berkes" <jb@users.pc9.org> wrote in message news:Xns93BC90BBFF2CDjbuserspc9org@205.200.16.73.. . | I ...


Go Back   Usenet Forums > Linux Forums > Linux Networking

FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 07-18-2003
Ken Kauffman
 
Posts: n/a
Default Re: Really good spam filtering (Bayesian)


"Jem Berkes" <jb@users.pc9.org> wrote in message
news:Xns93BC90BBFF2CDjbuserspc9org@205.200.16.73.. .
| I don't know why this project hasn't received more attention. I am
| currently using SpamProbe, a Bayesian filter by Brian Burton
| http://spamprobe.sourceforge.net/
|
| This filter is invoked by procmail. spamprobe assigns a label and scores
| each incoming email. Unlike spamassasin and homegrown techniques, the spam
| score is NOT determined by arbitrary thresholds. With Bayesian filtering,
| each word and phrase in the email is looked up in a database to calculate
| probability of spam based on prior experience with the word combinations.
|
| Bayesian filtering has only two downsides as far as I can tell: first, you
| have to "train" the system to teach what is spam and what is good, along
| with occasional corrections. Second, a rather large database of words is
| required. In my experience this uses around 20 to 30 MB of storage.
|
| On my most "hopeless" email address (the one I post to USENET with ;) I
| have been using spamprobe for many months, and achieving about 98% filter
| accuracy. I'm laughing, seriously.
|
| I have also now released a group of scripts that add a web interface to
| spamprobe, so that you can reclassify mail without having to type in
| console stuff. If you want to check out these bash scripts, visit:
| http://www.pc-tools.net/unix/webfilt/
|
| --
| Jem Berkes
| http://www.pc-tools.net/
| Windows, Linux & UNIX software
|

vipuls razor is utilized by spamassassin if you install it first. the
combination is quite effective with minimal training.

ken k


Reply With Quote
Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On




All times are GMT +1. The time now is 08:53 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.0.0