Cyveillance, redux: a number of readers have pointed to this comment by Brian Murray, VP of client services at Cyveillance:
http://news.spamcop.net/pipermail/spamcop-help/2003-June/034004.html
Brian writes:
"In terms of the other concerns expressed in this Forum relating to how Cyveillance gathers information from the Internet, we set the highest standards for our online activities... and we take great pains to ensure that our crawlers minimize the load on other sites servers"
My response:
Brian-
I read your comments on 'setting high standards' and 'taking pains to minimize your crawlers' load on servers' with great interest.
If the 'bot that usually identifies itself as some flavor of IE operating on IP addresses 63.148.99.xxx is indeed yours (as is widely reported on the Net), I would be grateful for your comments on the behavior I and others have noted in our server's logs. Here's a GREP of my Apache Web server's access log, Dec. 15 to present:
http://www.gulker.com/music_industry/63_148_99_log.txt
Please note that the 'bot in question connects repeatedly to long directories and downloads files sequentially without pausing - sometimes more than a hundred in a row - as fast as my relatively modest 144K net connection will allow. A number of other Webmasters have written me with similar experiences.
While this 'bot is connected, my server is all but inaccessible to others, and we at gulker.com are unable to access external sites easily. This 'bot is not well-behaved: it also ignores robots.txt.
So is it yours? If so, when will you apply your stated policy, and fix the darn thing?
Chris Gulker
http://www.gulker.com/
PS. This is probably redundant, since your firm specializes in knowing what's happening on the Net, but there is a category, complete with RSS feed, of information about the behavior of this 'bot:
http://www.gulker.com/categories/cyveillancebot/
Here's an essay I wrote describing my experience with this creature:
http://www.gulker.com/stories/2003/05/06/whatToThinkAboutCyveillanc
And the column I wrote for London's Independent 2 weeks ago:
http://news.independent.co.uk/digital/features/story.jsp?story=408191
And the article about same on Slashdot:
http://yro.slashdot.org/article.pl?sid=03/05/07/0120237
Awaiting a response with great interest...
Comments
9:07:18 AM
|