logo_text_trans.gif
Click to see the XML version of this web page.
Saturday, May 10, 2003

Interesting: Cyveillance's 'bot ignores robots.txt (it hasn't accessed that file on gulker.com in nearly 1000 visits), but Cyveillance publishes a robots.txt on their site. Hmmm... so what's good for the goose is not good for us ganders?
Comments [ ] 11:55:16 PM    

Cyveillance will analyze your online brand for free: go to this page and enter a brand that you want to analyze (I tried my name) and it will tell you how many domains you have, how many pages you show up on (and breaks them out by conventional, violence, porn etc.), how often you turn up in links, etc.

Interesting... but I sense something's missing... here are the results of 3 searches on names in both Cyveillance and Google and the number of hits they return:

Search NameDave WinerHilary RosenChris Gulker
Cyveillance2526022
Googlebot130,00031,00030,500

Hilary Rosen is CEO of RIAA, and much in the news lateley. Dave Winer is a Berkman Fellow at Harvard, and a much-read observer of technology. So, I'm either misunderstanding what Cyveillance means by 'home pages' or the universe that Cyveillancebot crawls is miniscule compared to Googlebot's. Googlebot touches gulker.com about 1000 times more often than Cyveillancebot: if that's any measure of the relative crawl-power of the two 'bots, then you'd expect to see 1000x fewer hits on a given name on Cyveillance, which is very roughly the case...
Comments [ ] 7:05:56 PM    


Photo District News has an interview with Brian Walski, former LA Times photog who doctored a photo. He's not happy with himself... and showing as much class as a guy in his shoes could be expected to...
Comments [ ] 3:25:05 PM    

Another interesting (and amusing) referrer:

209.86.105.35 - - [07/May/2003:11:50:41 -0700] "GET /graphics/left_bg.jpg HTTP/1.1" 200 15741 "http://www.gulker.com/graphics/left_bg.jpg" "Mozilla/8.0 (compatible; If trivial copyright encryption is a digital electrical fence, the DMCA is digital piss.)"

Amazing the media people use to communicate these days....
Comments [ ] 3:19:05 PM    


Phil Ringnalda comments that some 'bots treat robots.txt as a menu, not a fence. I've set gulker.com's version to help study this behavior...
Comments [ ] 2:09:05 PM    

Cyveillancebot found the first part of its 'admirers' site:

63.148.99.232 - - [09/May/2003:16:32:35 -0700] "GET /stories/2003/05/06/whatToThinkAboutCyveillanc HTTP/1.1" 200 23742 "http://archipelago.phrasewise.com/" "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 5.0)"

63.148.99.232 - - [09/May/2003:16:32:46 -0700] "GET /2003/05/05.html HTTP/1.1" 200 27713 "http://archipelago.phrasewise.com/" "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 5.0)"

63.148.99.232 - - [09/May/2003:16:50:38 -0700] "GET /2003/05/05.html HTTP/1.1" 200 27713 "http://archipelago.phrasewise.com/" "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 3.51)"

63.148.99.232 - - [09/May/2003:16:50:42 -0700] "GET /stories/2003/05/06/whatToThinkAboutCyveillanc HTTP/1.1" 200 23742 "http://archipelago.phrasewise.com/" "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 5.0)"

63.148.99.232 - - [09/May/2003:21:31:28 -0700] "GET /music_industry/cyveillancebot.html HTTP/1.1" 200 5989 "http://www.blogue.com/howto/" "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 5.0)"

Be very interesting to see what it does next... it still hasn't found the full site...
Comments [ ] 12:59:50 PM    




Top of page | Home | About gulker.com | About Chris Gulker

Updated 6/1/03; 5:39:11 PM

Chris Gulker's view from Silicon Valley - in words and pictures



Updated 6/1/03; 5:39:11 PM

May 2003
Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
Apr   Jun




Dotcom Garden
Picture Weblog
Random Access (soon)
Search
Venture News
Weblog Metrics

gulker.com Cam
gulker.com Cam



Natalie d'Arbeloff
Azeem Azhar
Ken Bereskin
Blogging Ecosysytem
Blogging Network
BlogStreet
Boing Boing
Tim Bray
Matt Croydon
DaveNet
Rael Dornfest
Esther Dyson
Dave Farber's IP
Dave Fitch
David Galbraith
William Gibson
Dan Gillmor
James Gleick
Bernie Goldbach
Meg Hourihan
Joi Ito
Xeni Jardin
Jeff Jarvis
Linux Journal
Mitch Kapor
Kuro5hin
Gunnar Langemark
Joshua Levy
Scott Loftesness
Macintouch
Ross Mayfield
Hans Moravec
Rafe Needleman
Nonsense Verse
OS Opinion
Tim Porter
Recommended Reading
Reverse Cowgirl
Glenn Reynolds
Roger Ridey
Phil Ringnalda
John Robb
Scott Rosenberg
Anita Rowland
Brent Simmons
Robert Scoble
Doc Searls
Gavin Sheridan
Shifted Librarian
Stefan Smalla
Bruce Sterling
Scripting News
Slashdot
Dan Shafer
John Tringham
Jon Udell
Moicho Umeda
Kevin Werbach
Amy Wohl

Click here to visit the Radio UserLand website.

Subscribe to "www.gulker.com - words and pictures from Silicon Valley" in Radio UserLand.






Google