[geeklog-devel] Google Summer of Code 2009: Geeklog

saurabh gupta saurabhgupta1403 at gmail.com
Thu Mar 12 15:05:36 EDT 2009


On Fri, Mar 13, 2009 at 12:31 AM, Dirk Haun <dirk at haun-online.de> wrote:
> saurabh gupta wrote:
>
>>> My gut feeling is that our users won't be willing to spend a lot of time
>>> training a spam filter. I may be wrong, though
>>
>>Well, what I thought in this part is that the spam filter will work in its
>>own way initially, but in case if sometimes  a post is made *spam* by
>>mistake, then user can mark it as *not spam* (similar to what we have in
>>gmail) and the spam filter should be intelligent enough to adapt to this and
>>vice versa. Training will be done automatically.
>
> You would still have to save all the posts, at least for a while, to be
> able to correct any false negatives. So apart from the technical issues
> (you currently can't save a post marked as spam such that it could be
> posted properly again later), there's also the issue of having a
> (possibly) really long list of spam posts in a queue.
>
> Of course, you could purge that queue on a regular basis, e.g. delete
> all posts older than 24 or 48 hours. We would have to test how that
> works in real life.

All right. I will too think over this again and will discuss it with you.


-- 
Saurabh Gupta
Senior,
NSIT,New Delhi, India



More information about the geeklog-devel mailing list