|
Contributions to this area are especially welcome, particularly if you are willing to provide the code or a link for other people to use the page. The "no damage" rule applies especially in this area, though it is pretty difficult to do damage via HTML. I have some annoyance pages and ideas of my own: Javascript Trap In this demo, it will abort after 15 clicks, but you will get the idea. try it This is the simple version. In my working version, if I know a name or email of the spammer I will send them an email suggesting they might like to visit www.otherdomain/sales/default.asp?ID=spammer-email and when they get trapped it displays "spammer-email" on the page and sends them an email. I also get an email every time someone gets caught. If there is a value in "spammer-email", I receive that so I know when it works :o) download: simple version (4KB) ASP version (5KB) Email Harvester Clogger Upper top This is the technical name for a piece of code modified from a CGI script that I have forgotten the name of. The premise is that email harvesters work their way through pages looking for email addresses to record. They also follow links looking for more email addresses. On my other domain, I have several email addresses that are only used on web pages, as you probably have already worked out, they are regular spam targets. This page is written in ASP and is pretty lean for what it does. It has 30,000 esoteric words in the database - the original db had almost 300,000 words, but the db was huge, so I kept every 10th word and deleted the rest. The db also has a list of all country codes (.au, .uk, etc) and a list of top level domains (.com, .net,.org etc). The page has to look something like a real web page so the harvester will take it seriously. So I generate a random number < 50 and write that many random words from the db - the text on the page. view sample page with notations Then I generate another random number < 20 and build that many email addresses using randomword@randomword.randomtld.randomcountry so it generates something like solstice@euphamism.mil.nz - The idea of the exotic words is that hopefully it won't actually build a real email address. Then there is a bit more "text" to fill the page. Now, if it just grabbed the email addresses and moved on it would have (say) 15 addresses, so I build some URLs using the same process ending up with something like www.cataclysm.net.il - the trick is that the actual URL they point to is the same page. i.e. the page keeps linking back to itself and every time it loads, it is completely different. On occasions the harvester has gone into a feeding frenzy looping back gobbling up fake email addresses. A couple of refinements are that the URL has the record ID attached as a querystring so it looks like default.asp?ID=1234 - the idea being that the harvester will see the URL as being different to the one that it is on right now and will therefore follow it. I am currently learning ASP.NET and am going to set it up so that it builds URL pointing to a non-existant page e.g. photogenic.asp and then when the page isn't found, it is automatically redirected back to the original page. I get an email everytime it is hit, including the ID number if it has followed the dummy links. The trick is to fool the bot into thinking it has found a new page every time. Refinements to this page are encouraged and more than welcome. A couple of other things to do:
Due to space limitations, I won't be posting the database here. If you are serious about implementing this, drop me a line and I will send it to you. download: source code (3KB) |