Of spooks and spam: What to do when the ŒFBIı spams your Web site

 

By Chris Gulker

 

The FBI came Œknockingı the other day here at gulker.com. They wanted me to know that my Weblog, an expression of free speech protected by the U.S. Constitutionıs First Amendment was now included on their Œsuspectsı watch list.

 

Or, at least thatıs what I initially made of a reference in my Web serverıs access log. Web servers, the 21st Centuryıs version of Gutenbergıs press, if you didnıt already know, create a record of everybody who visits.

 

And a Weblog, also for those who donıt know, is a personal online diary of things that interest me, maintained in reverse chronological order. Mine is served by an aging Macintosh running Apache, the Open Source Web server software. Apache records, among other things, the IP address of every Web visitor, as well as something called the Œreferrerı.

 

The referrer is a line that, in theory, tells me who sent the visitor my way. For example, if you clicked a link on the Independentıs Digital Web page that sent you to www.gulker.com, the access log would record a line that would include the following:

 

209.220.11.66 ³GET index.html² ³http://news.independent.co.uk/digital²

 

Translated, this means someone whose computer was using the IP address 209.220.11.66 accessed the home page (Œindex.htmlı), which they got to by clicking a referring link on the Independentıs Digital page.  So, imagine my surprise when these Œreferrerı lines appeared:

 

http://homeland.fbi.gov/Watchlists/suspect/view.jsp?record=895754

 

http://homeland.fbi.gov/Watchlists/suspect/view.jsp?record=948082

 

FBI? Watchlists? Suspect? Uh, oh Houston, I think we have a problem, hereŠ

 

A nerdy Sherlock would infer from these lines that 2 pages of my Web site had been recorded in a database maintained on a server named Œhomelandı belonging to the FBI.  Were shadowy figures lurking in a spooky government facility perusing my Œsuspectı Web pages?

 

But things arenıt always what they seem to be. The pages that homeland.fbi.gov had supposedly viewed were 2 rather dry, technical treatises, not some of my more outspoken rants expressing deep reservations about my nationıs current foray into Iraq. Curious, I thought.

 

A quick check of the worldıs Domain Name Server records, showed no entry for Œhomeland.fbi.govı. However, a Google search revealed some 200 pages containing  Œhomeland.fbi.govı. 

 

Diving into those pages, it was apparent that dozens of Weblogs had seen the same thing. Brent Simmons, a Seattle-based programmer had seen them on his utterly apolitical Weblog.

 

Brentıs Weblog allows visitors to leave comments: many of those comments, left by  other programmers proclaimed the whole affair a hoax.  The same opinion was offered by a number of computer scientists on an email list where my experience was posted.

 

It turns out that itıs easy to spoof the Œreferrerı line: a programmer with only modest skills could write a short program called a script that would cause the entries seen at  gulker.com and elsewhere.

 

So, hoaxed againŠ but it then occurred to me that the hoaxer had chosen a very unusual medium through which to perpetrate this mischief. A Web serverıs access log is hardly email or a Web page.

 

But the hoaxer succeeded, knowing that I, like other Webloggers periodically scan these logs to see whoıs been visiting.  Many have even observed Œreferrer spamı in which a site records a sudden surge of hits.  When the curious victim clicks the referring link they get a page advertising the usual spammer dross.

 

Indeed one firm touts its ŒReferrer Advertisingı services, but they may be ruing the day.  Turns out a miffed programmer wrote and posted a script that proved popular: when the firmıs software visits a Weblog with a spam link, the script returns a referrer link of its own, replete with a lengthy, and highly unpublishable admonition.