When I first came to Canada I worked at SoftQuad. SoftQuad was one of the first SGML companies, well known (in some circles, anyway) for its President, Yuri Rubinsky. And well known in many other circles for its HTML editor, HoTMetaL. The Surrey office did most of the development work on HoTMetaL and it was my main focus for quite some time. So it was with a certain amount of nostalgia that I saw HoTMetaL listed on eWeek’s Jim Rapoza Picks the Top Web Technologies of All Time — gone but not forgotten, as they say. Thanks to Kim for sending me the link.
Category: Technology
Google Cloaking
The aff sites are still framing my site, and the number of hits and amount of bandwidth is certainly not decreasing with time (details at previous posts, if you want to catch up on the story). I’m still not entirely sure what they’re doing, but the script they use (if you fetch the pages with a command-line tool) has specific instructions for Google and other search engines, so there’s obviously some reason for that.
As far as I can tell, this is a classic cloaking attack, and, to quote Wikipedia as of the time I read the article, “major search engines consider cloaking for deception to be a violation of their guidelines, and therefore, they delist sites when deceptive cloaking is reported”. So I figured that was worth a try and filled out the form at Report a Spam Result (for your entertainment, the search query I put in was “adul riendfinder.com”, which illustrates the problem nicely).
Has anyone else ever tried this and have it work? Any hints? I submitted the form over a week ago, and have seen no results yet. I thought I’d try with Google first since they generally are quick at updating their indices (they were certainly quicker at flushing the hacking results than Yahoo).
Privacy in the Internet Age
Darren Barefoot has some interesting thoughts about privacy in the internet age and the way in which today’s north american teenagers are growing up posting everything about their lives on the internet. Up till now, most of the discussion I’ve read on the subject has revolved around the effects on future careers of posting potentially embarrassing stuff on the web. Derek Miller points out that bosses will also have embarrassing stuff up on the web, although there will still be a generation gap there for some years until those future bosses become bosses (assuming that most bosses will still continue to be older than many of the people they employ).
We’re starting to discern the outlines of some likely effects of this now. For example, if I get an interesting email from someone I haven’t heard of, I’ll look them up in Google or Yahoo search, or LinkedIn. I don’t necessarily ignore the email if I don’t find any information about the person, but I can see that happening in the future — if you don’t exist in search engines, is that going to be considered weird?
Finding people I’ve lost touch with is getting easier every year, as long as they haven’t changed their name. I’ve managed to track down old friends, and others (who did change their name after marriage) have managed to track me down. Mind you, I’m relatively easy to find.
One effect I’m wondering about is on politicians: currently politicians either have to be squeaky clean, or good at hiding things the electorate might not like to hear about. Rudy Giuliani’s personal life includes three marriages and gay friends, all well-documented; in previous years this would have made a presidential campaign basically impossible. Now it just makes it more difficult, or maybe it’s just discussed more; in future years when more information is available about everyone on the internet, and hiding these things is going to be impossible, will voters be more accepting?
One interesting aspect to this is how little information is available about Google’s founders — and more than a little ironic, given how easy they’ve made it to find information about other people. An article on Mother Jones, via Bruce Schneier, has more.
Hacked!
On top of being framed (and yes, they’re still there), my site was recently hacked. Somehow someone managed to edit a post, adding a script and a bunch of porn keywords to two posts. And managed thereby to elevate their site to the front page of Google searches on those strings, in some cases the number one hit, so it’s clear why they did it. I found these while browsing through the search engine strings (teen porn keywords are not usually searches that find my site), found the posts and stripped out the offending divs. It’s not obvious to me how they got in, but since the WordPress development blog has been warning of security exploits, I assume it’s one of them. So I upgraded to the latest version, 2.1, and would advise anyone else running WordPress to do the same.
Between the AFF people and these hackers, I do sometimes wonder whether blogging is worthwhile for someone like me, who doesn’t blog a lot. Sort of takes the fun out of it.
They’re Back!
The aff people, that is. I am too, after some time spent in Australia with little access to the internet. Enough to notice my stats, but I had no inclination to actually do anything about them.
Looking at the logs, the aff people went away on December 18 and started sending referrers my way again on December 23. Nothing seems to have changed about what they’re doing, and the explanation put forth by commenters to the original post that it’s likely an attempt to scam the affiliate program of an adult site seems the most likely explanation. I’m still not sure what (if anything) to do about this; if the bandwidth usage becomes excessive I’ll use one of the methods suggested by the commenters to my original post.
In the meantime, I guess I’d better post some more interesting content…
Post results
The light shone on that dark corner of the internet (see my Framed! post, as well as Tim’s Framing Lauren linked post) and it brought results. If you haven’t read my posting, go ahead; there’s too much information to usefully summarize here and the comments are good too.
After reading all the comments that came in (including some private email), Tim and I chatted a bit with Paul Hoffman, noted IETF/IMC heavy. A likely explanation (as pointed out by many commenters) is that the purveyors of the “aff” sites were probably trying to run an affiliate scam on the Adult Friend Finder site. They probably chose my site to frame because Tim has a high Page Rank and often links to my site, and thus my Page Rank is also reasonably high. Paul checked and found that they were doing their own name serving, hiding themselves quite effectively from us (well, given more time and effort I’m sure Paul could have dug up more information, but it didn’t seem worth it).
I posted my piece on Saturday and Tim pointed to it not long afterwards. By Sunday afternoon the “aff” sites were noticeably slower and seemed to be going off the air. The last referrer I had to my site was from site 23 on Monday December 18th at 10 am local time; pinging the address shows it’s still there but there is no longer an http server attached to it. The site is still in the Google index but I expect that to go away at some stage as well. So they reacted quickly; I expect we will never know the entire story. I’ve learned two things though: keep a closer eye on my access logs, and post about things that look weird.