[geeklog-devel] Google Summer of Code 2009: Geeklog

saurabh gupta saurabhgupta1403 at gmail.com
Thu Mar 12 15:05:36 EDT 2009


On Fri, Mar 13, 2009 at 12:31 AM, Dirk Haun <dirk at haun-online.de> wrote:

> saurabh gupta wrote:

>

>>> My gut feeling is that our users won't be willing to spend a lot of time

>>> training a spam filter. I may be wrong, though

>>

>>Well, what I thought in this part is that the spam filter will work in its

>>own way initially, but in case if sometimes  a post is made *spam* by

>>mistake, then user can mark it as *not spam* (similar to what we have in

>>gmail) and the spam filter should be intelligent enough to adapt to this and

>>vice versa. Training will be done automatically.

>

> You would still have to save all the posts, at least for a while, to be

> able to correct any false negatives. So apart from the technical issues

> (you currently can't save a post marked as spam such that it could be

> posted properly again later), there's also the issue of having a

> (possibly) really long list of spam posts in a queue.

>

> Of course, you could purge that queue on a regular basis, e.g. delete

> all posts older than 24 or 48 hours. We would have to test how that

> works in real life.


All right. I will too think over this again and will discuss it with you.


--
Saurabh Gupta
Senior,
NSIT,New Delhi, India


More information about the geeklog-devel mailing list