Page 5 of 5 FirstFirst ... 345
Results 41 to 44 of 44

Thread: Block Web Content Scrapers and Downloaders

  1. #41
    TopDogger's Avatar
    TopDogger is offline Über Hund
    Join Date
    Jan 2009
    Posts
    1,343
    Thanks
    201
    Thanked 575 Times in 403 Posts
    You will find hundreds of new spiders with bot-trap.
    Last edited by TopDogger; 27 May, 2011 at 13:31 PM.
    You can have it fast, good or cheap. Pick any two.

  2. #42
    dgswilson is offline siteowner
    Join Date
    May 2011
    Location
    N.E. Texas
    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts
    ## this line won't be read as a command (or at all) so it won't create an error. But, if you get a hash mark in front of some code - then...

    Anyone know where I can find an up to date comprehensive list of SE user agents? I need the names for white list, not IP numbers

  3. #43
    TopDogger's Avatar
    TopDogger is offline Über Hund
    Join Date
    Jan 2009
    Posts
    1,343
    Thanks
    201
    Thanked 575 Times in 403 Posts
    Just do a search for 'search engine user agents'. Lots of sites come up.

    List of User-Agents (Spiders, Robots, Browser)

    It seems like recently an entire new group of spiders has been unleashed. I'm currently trapping dozens of new spiders every day.
    Last edited by TopDogger; 30 May, 2011 at 12:27 PM.
    You can have it fast, good or cheap. Pick any two.

  4. #44
    TopDogger's Avatar
    TopDogger is offline Über Hund
    Join Date
    Jan 2009
    Posts
    1,343
    Thanks
    201
    Thanked 575 Times in 403 Posts
    If you like Blekko, add these IP ranges to your Allow list.

    Allow from 38.99.96.0/24
    Allow from 64.13.159.0/24
    Allow from 199.87.248.0/21
    You can have it fast, good or cheap. Pick any two.

  5. Thanked by:

    Will.Spencer (26 March, 2012)

Page 5 of 5 FirstFirst ... 345

Similar Threads

  1. Obfuscate Proxy Content to make harder to Block
    By tibbie in forum Web Proxies
    Replies: 4
    Last Post: 2 May, 2011, 08:34 AM
  2. New Content writer on the block!
    By AjiContent in forum Introduction Forum
    Replies: 0
    Last Post: 28 February, 2011, 08:19 AM
  3. Google on Content Scrapers
    By Kovich in forum Managing
    Replies: 18
    Last Post: 14 May, 2010, 06:58 AM
  4. Block Robots and Web Downloaders with robots.txt
    By Will.Spencer in forum Managing
    Replies: 12
    Last Post: 6 June, 2009, 15:40 PM
  5. How to Profit from Content Scrapers?
    By Shenron in forum Promoting
    Replies: 4
    Last Post: 12 March, 2009, 18:58 PM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •