Tuesday, November 14, 2006

AJAX Security

We recently analyzed 200,000 tokens, or words, in the bayesian spam filter on our mail server. We analyzed many factors within this data. The most compelling was the spam to ham (legitimate email) ratios. We compiled a list of over 50 words with the highest spam to ham ratio.

Words like click and here don't show up as high, since they are used often in legitimate email. It also delinerates that words like madam which is rarely found in legitimate email, while readily found in spam email, had very high ratios. Using this method we created a superior list of words found in spam email. The words are ordered from highest to lowest

View the list HERE.

0 Comments:

Post a Comment

<< Home