May 26, 2008

Mozilla/5.0 (MrCarlito-0.1 http://www.mrcarlito.com/spider.html)

bad-behavior
403 Required header 'Accept' missing
Agent: Mozilla/5.0 (MrCarlito-0.1 http://www.mrcarlito.com/spider.html)
64.237.57.194 64-237-57-194.reliableservers.com

MrCarlito-0.1 is an experimental spider that collects header & link information from web pages. The spider is written in PERL (Practical Extraction and Report Language), and uses the LWP::UserAgent Class. Currently this spider does not delve into websites, it simply obtains the headers & hostnames contained in your web page index.


Humm you had better fix this broken bot if you plan on using it for a real website.
Your blocked because you were detected loading webpages not headers.

Mozilla/5.0 (compatible; zermelo; +http://www.powerset.com) [email:paul@page-store.com-crawl@powerset.com

Mozilla/5.0 (compatible; zermelo; +http://www.powerset.com) [email:paul@page-store.com-crawl@powerset.com


blocked by bad-behavior
403 Required header 'Accept' missing
Agent: Mozilla/5.0 (compatible; zermelo; +http://www.powerset.com) [email:paul@page-store.com-crawl@powerset.com]
67.202.57.133 ec2-67-202-57-133.compute-1.amazonaws.com


Another broken bot running on amazonaws.com

www.radian6.com/crawler

Wow another corp snoop bot. see http://www.radian6.com/crawler/


r6_feedfetcher(www.radian6.com/crawler)
r6_commentreader(www.radian6.com/crawler)
142.166.3.122
142.166.170.93
142.166.170.92

It does not follow robots.txt file so you have to email someone to tell them to stop buring up your bandwidth. Hu?

I really hate these corp PR snoops that think you have to sever content to them.
I wonder if they ever thought about the fact that taking my content and serving it up to subscribers (charging for it) without my permission is a criminal copyright violation.

May 15, 2008

IncrediBILL's Random Rants: Impact On Your Bandwidth Will Be Minimal My Ass

This just about sums up where trafic is going today.

IncrediBILL's Random Rants: Impact On Your Bandwidth Will Be Minimal My Ass

skweezer.net open proxy service

Post UPDATED::

See orginal post here This site skweezer.net
is a proxy for moble content. Will allow users that you have banned to bypass your ban and use this site as a proxy.

bad behavior blocks no longer blockes this site. And its domain has changed.
Also no longer inserts adverts around your content but strips out your adverts.

Old ones
65.38.160.138 hugehosting.com
65.38.160.162 hugehosting.com
65.38.160.160 hugehosting.com
New ones so far
65.38.160.138 gwc05.gwcorp.net
65.38.160.156 gwc14.gwcorp.net

Likely more.

Add the domain name to the domain ban file of MMAUTOBAN
hugehosting.com,proxy
gwcorp.net,proxy


And add the ip block to your htaccess file.
deny from 65.38.160.0/24

May 10, 2008

yandex.ru bot

yandex/1.01.001 (compatible; win16; h)
Last Hit From walrus020.yandex.ru 77.88.22.115
First Hit From walrus085.yandex.ru 77.88.22.151


Violates robots file see http://www.braemoor.co.uk

67.202.15.206 compute-1.amazonaws.com www.powerset.com

This company powerset.com says "we employ a small army of PhDs" But they know nothing about building bots. The blog they run won't even take comments without giving a error page.

bad-behavior 403 Required header 'Accept' missing
Agent: Mozilla/5.0 (compatible; zermelo; +http://www.powerset.com) [email:paul@page-store.com-crawl@powerset.com]
67.202.15.206 ec2-67-202-15-206.z-1.compute-1.amazonaws.com

amazonaws.com keeps showing up in my logs. It looks like this is a web hosting div of amazon so we may be able to ban it without banning amazon.

May 1, 2008

List of hacker servers

I ran into this site that is keeping a list of the sites hosting the scripts used to attack your site. The user tries to get your site to run a script located on one of these sites and once it does he can take over your site.
The hacking is explained here http://www.whyron.com/http.htm

List is here http://www.whyron.com/http0.htm

You should add the domains from this list to the hackers.txt file in MMAUTOBAN to users attempting to inject these scripts on your server.

Free submit script for your website.

Ran into this its a free submit form. I dont use it since wrote my own perhaps when I have the time I will make a free version of it.

This one looks like it works just dont use the reply to user options. Since you should never have a form reply to someone because it can be used to relay spam via your server.

GBCF-v3 Secure & Accessible Form Script


While your at it never take input from a form and use that input to create a message headers like To: and subject: always hard code the headers and put the inputed fields inside the body of the message.