[geeklog-devel] Google Summer of Code 2009: Geeklog
saurabhgupta1403 at gmail.com
Thu Mar 12 10:03:37 EDT 2009
On Wed, Mar 11, 2009 at 8:38 AM, saurabh gupta
<saurabhgupta1403 at gmail.com> wrote:
> What I am thinking is to make the spam-handling engine as a portable one.
> Means, if a user has made a site and for a long time, its spam engine has
> evolved and has the statistics also (like the counts of spam filters used,
> some blocking keywords for detecting spam and more intelligence). Now, is
> that user or another one makes another site using geeklog, then he has the
> option to use the same spam intelligence in the new site. The information
> can be made portable by storing in some database or xml file. A nice GUI can
> also be created which will simply take the source site and the destination
> site and merge the intelligence of spam-engine together.
All right. The idea is more or less like that of SWOT. I went through
the idea of SWOT now and would like to discuss this with the
community. The implementation part mainly consists of parsing of xml
files. I have already worked on the parsing of xml files using
libraries like msxml and nsxml (however not open source) and some of
my own wrappers.
Besides the ideas and to-do list given on the wiki, the following
things can also be implemented:
=> Adding one more item in the RSS file indicating the validity of any
blacklist item. For example, if a blacklist item has the title as
"casino", then a new entry under the item for "casino" will tell that
how many posts or spams have been recognized by this keyword "casino".
This will have the benefit of determining the validity of any
blacklist item. Now, for the sites subscribing to this RSS feed can
decide if any blacklist item is to be imported or not based upon the
counts of the posts having this keyword. The overhead in this idea is
that for each post which is recognized as spam, the RSS feed file is
to be updated.
Comments, suggestions and feedbacks are most welcome.
NSIT,New Delhi, India
More information about the geeklog-devel