Fighting back against spam
SpamDope is inspired by spampoison and identifies a bad crawler, then keeps on serving useless data for any further requests made to the same server - regardless of the request method, or url -- only after it is identified as a spam bot. This specific edition is PHP based so the information below will assume you are using popular software stacks associated. Please visit the spamdope wiki for more information on the methodology.
Assuming you have "Allow Aoverrides" enabled in your server config, the supplied .htaccess file will take care of re-routing all requests to the directory index as specified. If you do not like this, please feel free to change as you want it, but also, consult the spamdope wiki about why it is done this way. Just pull the spamdope source into your test server docroot and test as you see fit. The data is compressed tar.gz SQL, for import into a mySQL database and is in the src folder.
If you already have the spamdope database, then just ignore the spamdope.tar.gz file for quicker clone, or pull. Change the cfg/database.json details to your own database details. The supplied data is from various sources that freely supply information of current unsolicited email sending domains, which is used in the leading brands like "spamassassin", etc. Please note that the data supplied here should be used for testing, but if you plan to be more effective, update your own data from a reliable source at regular intervals - at least once a month.
Change the supplied image and CSS to your needs. The point is to have it look like a "normal" site to fool the spam bots to make it harder for the spam bot programmer to identify the given information as "fake". Also, please note that making larger lists of data served increases chances of being identified as "fake" data, so try to keep it as close to a "real" site as possible, where usually you will not find thousands of names and email- addresses on one page. Refer to the wiki for more info ;)
By using this you are contributing to fight back against spam, by feeding spammers useless information that points to eccessive spam rated and black listed domain names, effectively giving them the chance to spam each other and render large portions of their data useless. This results in fake lead generation and discourages bulk advertising organizations to buy leads from useless data sources.
Please feel free to modify the code, suggest changes, etc.