[geeklog-devel] Google Summer of Code 2009: Geeklog

saurabh gupta saurabhgupta1403 at gmail.com
Mon Mar 16 00:31:59 EDT 2009


On Mon, Mar 16, 2009 at 4:47 AM, Website Master
<websitemaster at cogeco.net> wrote:
> Will there be an API for this so plugins could use it as well?

API for which feature?
 Sorry, I couldn't get for what you are asking.



> -----Original Message-----
> From: geeklog-devel-bounces at lists.geeklog.net
> [mailto:geeklog-devel-bounces at lists.geeklog.net] On Behalf Of saurabh gupta
> Sent: March-15-09 2:39 PM
> To: Geeklog Development
> Subject: Re: [geeklog-devel] Google Summer of Code 2009: Geeklog
>
> On Fri, Mar 13, 2009 at 12:31 AM, Dirk Haun <dirk at haun-online.de> wrote:
>> saurabh gupta wrote:
>>
>>>> My gut feeling is that our users won't be willing to spend a lot of time
>>>> training a spam filter. I may be wrong, though
>>>
>>>Well, what I thought in this part is that the spam filter will work in its
>>>own way initially, but in case if sometimes  a post is made *spam* by
>>>mistake, then user can mark it as *not spam* (similar to what we have in
>>>gmail) and the spam filter should be intelligent enough to adapt to this
> and
>>>vice versa. Training will be done automatically.
>>
>> You would still have to save all the posts, at least for a while, to be
>> able to correct any false negatives. So apart from the technical issues
>> (you currently can't save a post marked as spam such that it could be
>> posted properly again later), there's also the issue of having a
>> (possibly) really long list of spam posts in a queue.
>>
>> Of course, you could purge that queue on a regular basis, e.g. delete
>> all posts older than 24 or 48 hours. We would have to test how that
>> works in real life.
>
> What we can do is to mark the spam comments as spam and group them in
> a spam category (spam queue). To escape the space issue and long list
> of spam comments, we can limit the size of spam list. For example,
> maximum number of spam comments are 30 and whenever new spam comment
> is added, the first one added will be deleted ( first in first out).
> The maximum number of spam comments can also be given as an option to
> the site Admin to set. The benefits of this approach will be :
>
> 1. There will be a second chance for the false positives.
>
> 2. Test Mode (
> http://wiki.geeklog.net/index.php/SoC_spam-x_overhaul#Test_Mode
> ) is implemented in a better way. For example, when any user add a
> regular expression as filter rule, there will be a button to check the
> validity of that expression. It will then check all the comments (or
> some) and according to the latest rule, add the spam comments in the
> spam queue. The user can then see if the expression fed by him is
> proper or not.
>
> 3. Similarly, the *Mass Delete Spam Comments* can be modified and in a
> similar way, the caught spams will be placed in the spam queue.
>
> Other modifications which can be implemented in spam-X plugin are:
>
> 1. While adding an entry in the *Spam-X Personal Blacklist*, it
> doesn't check for the duplicate entries. So, if a user press the
> button *Add Censor list* multiple times, all the entries are added
> multiple times. The same happens with other blacklist entries also.
> So, an API for duplicate check entries can be added in this.
>
> 2. If spam queue is implemented, then whenever, a comment is caught as
> spam, it will be entered in the spam queue and those words will be
> highlighted or underlined (marked) which provided a basis of spam
> detection. For example, whenever a comment is caught as spam because
> its content has the word *xyz*, then this word will be highlighted and
> the comment will be sent to spam queue. This is beneficial to provide
> an overview and idea to the site user to see that which word is
> adulterated and is helpful for the test mode case of spam-X training.
> Another advantage of this will be to implement a use_counter (
> http://wiki.geeklog.net/index.php/SoC_spam-x_overhaul#Use_Counter ) in
> a better way.
>
>
> Comments and suggestions are welcome.
>
>
> --
> Saurabh Gupta
> Senior,
> NSIT,New Delhi, India
> _______________________________________________
> geeklog-devel mailing list
> geeklog-devel at lists.geeklog.net
> http://eight.pairlist.net/mailman/listinfo/geeklog-devel
>
> __________ Information from ESET NOD32 Antivirus, version of virus signature
> database 3937 (20090314) __________
>
> The message was checked by ESET NOD32 Antivirus.
>
> http://www.eset.com
>
>
>
>
> _______________________________________________
> geeklog-devel mailing list
> geeklog-devel at lists.geeklog.net
> http://eight.pairlist.net/mailman/listinfo/geeklog-devel
>



-- 
Saurabh Gupta
Senior,
Electronics and Communication Engg.
NSIT,New Delhi, India



More information about the geeklog-devel mailing list