Even by analytics standards...Bastards.
A privacy researcher has revealed the evil genius behind a for-profit web analytics service capable of following users across more than 500 sites, even when all cookie storage was disabled and sites were viewed using a browser's privacy mode. The technique, which worked with sites including Hulu, Spotify and GigaOm, is …
Since they are exploiting HTTP headers (as "xlq" explains very well below), they can technically be any resource at the site. You could even just make it look like a nice safe image, object or other url and then transparently use server-side processing (think Apache mod_rewrite style url rewriting) to pass it to a server "script" or module to do the rest.
Assuming Kissmetrics jealously guard their server-side processing so that it is all conducted on servers from that one main domain, then this could be blocked by domain or hostnames. However, if they either use other domains or the server script can be shared or even if *any* website you visit decides to transparently redirect resource urls to one of Kissmetric's domains, then potentially you could never even realise they are doing it.
If you are truly paranoid, short of disabling all the headers mentioned by xlq and probably significantly slowing your Internet connection since almost nothing would be cached client-side, then the only way to prevent this is as mentioned at the bottom of the article: "block all cookies and clear the browser cache after each site visited".
If this were to become widespread, it pretty much undermines every existing notion of browser privacy control since it directly abuses the HTTP protocol. Truly unpleasant!
I'm mixed about this. Often, one is browsing content for free - which is basically funded by advertisers.
Advertisers pay a specific amount, based on what they can expect to get in return. Think of it simply as:
(a) Random ad image
(b) Targeted ad image that should be something the visitor MIGHT be interested in
(c) Focused ad that definitely is "up the alley" of the visitor.
A website (Reg, Ars, etc) must pull in enough money to survive. (a) ads pay the least, (b) more since users are more likely to click, and (c) the most.
I'm on the fence. I have trained myself to ignore virtually all ads (and, yes, when I see the rapid spastic "Click The Monkey", I adblock or RIP it). However, I do click on some - if they are things I'm genuinely interested in.
Like with TiVo; I have often reviewed to watch something that looked interesting. Often, movie trailers that I merely add yo my Netfix queue :)
The problem is that the adverts get more and more intrusive, and more and more irritating, so people end up writing tools to get rid of them. This ends up throwing the unobtrusive ads out with the awful popunders and flash animations that trot out into the middle of the screen and strobe at you; everyone loses out in the end.
I'd like more sites to offer optional subscriptions. I'm a grown up now with a salary; I'm prepared to part with some of my hard-earned if it turns my favourite bits of the internet back into readable, useable sites.
How much do el reg expect to earn from each reader using ads?
Funding "free" content with advertising is the site owner's choice and theirs alone. If I choose to filter the ads, that is my choice too. There is absolutely no moral obligation on my part to accomodate a web site's chosen business model any more than I am obliged to sit through the ads on a taped TV show (yah, I know I'm dating myself here). If a site can't survive because people block their ads or tracking bugs, obviously their business model is rubbish.
Number 6: Where am I?
Number 2: In the Village.
Number 6: What do you want?
Number 2: We want information.
Number 6: Whose side are you on?
Number 2: That would be telling. We want information... information... information.
Number 6: Who are you?
Number 2: The new Number 2.
Number 6: Who is Number 1?
Number 2: You are Number 6.
Number 6: I am not a number, I am a free man.
Cue demonic laughing from Number 2!
I'm not against tracking site visitors, I do it myself, but when you track people over so many sites such that you can build a pretty accurate profile of the person, presumably to chuck relevant ads at them, well that's just creepy.
And resurrecting data when the user has tried to consciously delete it isn't just creepy, it's wrong.
This is where that sodding EU cookie law goes wrong; a site owner having a look at anonymous metrics on one site that they own is a far cry from this sort of thing, and it can't all be lumped together in one piece of legislation.
Not so simple. The IP range in question is Amazon WS which I doubt you want to block completely. I have kissmetrics.com in the squidGuard blacklists already but that's trivial to circumvent if they start hosting that script locally or adding A records pointing to that host on the client DNS.
Bit of a bugger, really. I'm tempted to create a ClamAV signature matching that script's content and use Squid's Clam redirector. That would stop it dead - until they change the script. Snort might also come in handy...
What needs to happen is for NoScript to be able to detect where page markup is being created by document.write (possibly by using regexes to search/replace instances of document.write followed by literal strings and replace them with NULL, or by parsing variables only where used in conjunction with document.write) and converting them back to raw HTML markup without running any other script on the page. Something along the lines of ~= s/document\.write(['|"]//g; perhaps.
Why has the internet turned into a fucking warzone for the greedy and unscrupulous? Why must we constantly be waging an endless arms race to defend our right not to be tracked, spied on, and exploited?
"Why has the internet turned into a fucking warzone for the greedy and unscrupulous? Why must we constantly be waging an endless arms race to defend our right not to be tracked, spied on, and exploited?"
Because people use it, and that is what people do. If you were greedy and unscrupulous, and you had an idea for making money using the Internet, you would put it into practice - that's what it means to be greedy and unscrupulous.
..... doesn't make it clear how users go about opting out tracking."
heck, as far as I am concerned, I don't even need to know about KISSmetrics, the website that I am _visiting_ should be the one that give me the option to opt-out.
All of these parasites are the same - their business model relies on it being 'opt-out' because nobody in their right mind would opt into it.
This sort of thing should be made a criminal offence under international law - in my opinion, data gathering of this sort, which as you point out, is akin to stalking, violates the human right to a private life. Like piracy on the high seas, these pathetic excuses for human beings should be shot on sight.
The ETag (entity tag) value is part of the HTTP protocol and is used for caching. It represents the version of a particular resource. On the first request, the browser stores the ETag value it received. On a subsequent request, the browser will send an If-None-Match header with the old ETag value, to avoid downloading the page again if the ETag value is the same.
All you have to do is use a unique identifier for the entity tag and the browser will later return it, just like a cookie. This isn't new. It's one of the methods evercookie (http://samy.pl/evercookie/) uses.
There are a few Firefox add-ons that you can use to prevent this. One that I use is "Modify Headers", which can be set to filter the If-Match, If-None-Match, If-Modified-Since, If-Unmodified-Since, etc. headers. (Yes, the last modification date can also be used for tracking.)
as a way of not continuously downloading the same image/... but getting a new version if it changes. So disabling ETAGs effectively makes the Internet run more slowly for you since your browser won't cache so well.
What I really dislike about this is the cross site tracking. I can accept a site remembering me while I visit it but don't want the next site to know anything about what I did elsewhere.
I said that I use the "Modify Headers" add-on to prevent this.
I found out today that the Modify Headers add-on doesn't actually work with cache-related headers like If-None-Match because Firefox inserts those headers before the add-on has a chance to filter them. That'll teach me not to check things!
Now I've installed and configured privoxy to filter those headers instead. It definitely works now.
Just wanted to point that out, so as not to leave misinformation in my name.
Adblock, Cookie Monster, Better Privacy, flashblock, maybe NoScript (I don't bother).
Set your browser to flush the cache on exit.
Evercookie doesn't work against this setup. I see very few ads on the net. If a site needs session cookies to work I can enable them temporarily or permanently as needed. If a site I trust (el reg) wants them I can enable them.
I can stop facebook logos loading when I'm not on facebook.com, kill scripts that slow everything down unnecessarily, generally make the internet a nicer place to be. If a site wants to track me then they can. I'm just not going to let my browser help them.
From Soltani's own website, near the top of the page linked - "Hulu and KISSmetrics have both ceased respawning as of July 29th 2011."
So was this article supposed to have been published a month ago or is it just scaremongering? And, what happened to the death of the Reg icon?
I know it's not addressing the root problem, but if they track you to display ads at you, why not just install adblock and block the ads?
Of course, there are more sinister purposes that tracking can be used for, so see the above posters ideas. I'd also chuck in blocking the relevant KISS hosts at the firewall - all ports, incoming or outgoing.
Agree with other - HTF can this be opt-out, when you don't even know if you're being stalked? They can GTF with that...
I'm not taking sides, but just pointing something out.
Once you get over the notion that web browsing is something happening in your home (where you do have a legal right to privacy), and realize that you (your data at least) is leaving the house and visiting public servers which have every right to track you while visiting the sites.
Analogy: If you go to the grocery store, can you honestly complain when they ask you to remove the ski mask from your face so the security camera can get a good look at you? If you don't agree with the advertising or tracking, do not use the site, it's that simple. (I don't have a Google+ for this reason) If I run a website, I have every right as a private businessperson to run that site in any way I see fit providing I comply with sales and content laws, and of course disclose certain things.
Getting back to the grocery store analogy, how is the information that gets stored via Internet any different than what the grocery store sees on their security camera? You pull up in the same car (browser), waddle in to purchase your case of twinkies (browse content), and pay with cash (don't log in) so that nobody will know that YOU are the 400lb guy with curly black hair and a Ford Taurus who likes twinkies.
To be honest, I'd rather have ads that I might actually *like* appear on my preferred sites, instead of the recent influx of ads having to do with being Mormon. I'm going to see advertising anyway, at least this way, I get something that might be interesting.
Does the grocery store do this?
* Have someone write down your license plate number when you arrive.
* inventory what you buy, as well as what products you seem to look at.
* catalog those results and store them for later analysis.
How about this?
* share that information with someone at your local pub (oh look, they Ford Taurus with plate XYZW1234 is here, from his shopping habits at the store, he's a family man, but today he picked up a box of tampons, so let's try to sell him an additional beer).
You are correct about not going to the site if you disagree with its tracking policy. Here's the rub: they're NOT telling you what they're doing.
AC, even though I'm beginning to doubt it will do me any good :-)
"* inventory what you buy, as well as what products you seem to look at.
* catalog those results and store them for later analysis."
Actually, yes, they appear to do that, at least here in Merka. Except for the "what products you seem to look at" (as far as I know, wouldn't be surprised...).
If you pay with a credit card, or, worse, use one of those "loyalty cards" things that give "discounts" (i.e. remove the artificial increase in price).
It's easy to know: go return/exchange a product to, say, Target or Apple. If you paid with CC, all they need is your CC to accept it, you don't need a receipt. It happened to me recently: my 6th gen nano (crap, but got as a gift from the GF...) broke the other day. I went to the Apple Store to exchange it, and had no receipt (but since the thing was released less than year ago, it must be under warranty). The guy got the serial number, and got me a new one. The receipt had my GF's name written on it, date it was bought, etc. When I told her that, she mentioned she had the same happen to her at Target. So, yeah, they do inventory what you buy, and I'm sure they use it later, and they don't ask for permission to collect nor keep the data -- at least I haven't been asked to sign anything.
Of course it's easy to not use either, pay cash... much easier to circumvent the disgusting web tracking those guys are doing.
"Analogy: If you go to the grocery store,..."
In this particular case, the problem isn't that the grocery store knows what you buy... It's more analogous to that annoying neighbor that you try to avoid -- the one that always buttonholes you with a new "sure thing" that he's always trying to get you to sign up for -- that knows what you bought at the grocery store, the bed-n-bath store, the pharmacy, the newsstand, and that "club" in the next town that you go to on Saturday nights.
I don't see where I have any obligation to give him any of that info.
Hmmm... I'm not a programmer but, OTOH, AppleScript has a random number generator... I may have to dust off my scripting and see if I could set up one that tells the browser to write a random number to that line in the cache every three minutes... Something to think about in what I laughingly refer to as my free time...
In continental Europe there's this thing called Privacy written into various national constitutions and actually European Human Rights directives, which says even stuff which is apparently public is still subject to privacy rules. The exception is if you can show an explicit public interest. A shop tracking a customer does not have a public interest defence - the only way allowed in certain European countries is for the customer to have agreed to the data collection (explicit opt-in). Even if you think the information is public. Without the opt-in the shop is not allowed to do it. The principle is that organisations/businesses hold the minimum information. Information being public is not a defence.
As you're in America, I'll give you some time to pick your lower jaw off the floor.
surely weather they are more targeted or not you can just ignore them? the technique only has effect if multiple sites are using the same technique (obv.) so information obtained (unless your buying WMDs or something, then you should use TOR) would only ever be relevant to advertisers... Just ignore the adverts, targeted or otherwise!!
Yes, you can ignore ads, even 'targeted' ads.
However, browser history can contain more telling personal information that can be used in more pernicious ways. Suppose an employer buys the browser history of their employees. They have layoffs coming up. Lets see now. Worker A has been browsing speed shoppes for racing bicycle parts. Worker B has been searching for homeopathic cancer remedies. I wonder which one the accountants would recommend to be laid off?
This needs to stop.
Excerpt from http://www.scroogle.org/doctorow.html
"He should have seen it coming, of course. The U.S. government had lavished $15 billion on a program to fingerprint and photograph visitors at the border, and hadn't caught a single terrorist. Clearly, the public sector was not equipped to Do Search Right.
The DHS officer had bags under his eyes and squinted at his screen, prodding at his keyboard with sausage fingers. No wonder it was taking four hours to get out of the god damned airport.
"Evening," Greg said, handing the man his sweaty passport. The officer grunted and swiped it, then stared at his screen, tapping.
[ . . . ]
"Tell me about your hobbies. Are you into model rocketry?"
"No," Greg said, "No, I'm not." He sensed where this was going.
The man made a note, did some clicking. "You see, I ask because I see a heavy spike in ads for rocketry supplies showing up alongside your search results and Google mail."
Greg felt a spasm in his guts. "You're looking at my searches and e-mail?" He hadn't touched a keyboard in a month, but he knew what he put into that search bar was likely more revealing than what he told his shrink.
"Sir, calm down, please. No, I'm not looking at your searches," the man said in a mocking whine. "That would be unconstitutional. We see only the ads that show up when you read your mail and do your searching. I have a brochure explaining it. I'll give it to you when we're through here."
"But the ads don't mean anything," Greg sputtered. "I get ads for Ann Coulter ring tones whenever I get e-mail from my friend in Coulter, Iowa!"
The man nodded. "I understand, sir. And that's just why I'm here talking to you. Why do you suppose model rocket ads show up so frequently?"
[ . . . ]
It amuses me endlessly to see the lengths to which marketers will go in pursuit of maybe, just possibly, once in a very long while, a sale by one of those using their services to advertise.
Only speaking for myself, but I use adblock so I see few ads, and those I do see I pay no attention to.
The con is really that marketers claim that targeted ads improve sales. That's not true. Today I may be interested in ginormous nipple rings, tomorrow in an antiquated book on Latin grammar, and the day after in Dog only knows what.
Or to use a more prosaic example, suppose I'm looking for underwear. I have a very clear idea what I want, I know exactly which brand and model will fill the bill, and any adverts to the contrary are just so much wasted effort. What *will* influence me are the web pages that give full, objective information and are clear about sizing, fabric, country of origin, colors, styles, price, and availability. But once I've bought my gaunch, that's it. Throwing more ads at me does nothing, because I have enough rags to shelter my ever lovin' bod from the lust-filled gaze of onlookers, and need no more.
Then there's ebay: in my pursuit of the perfect undies, I found the brand and model, and set up a moderately complex search string to find ebay listings for those and no others. Ebay then, in its blind pursuit of money, altered their search facility so it returned not just what I was looking for, but all sorts of other brands and models, I s'pose with the subliminal message "Maybe these are what you really want?" An intelligent company would have recognized that the more specific a search, the less likely it is that the searcher has interest in other things, particularly when the search string takes steps to exclude other makes and models.
As ebay, so marketing in general: they think their ads actually work, but it's highly questionable whether they do anything other than annoy netizens.
I have my Asus TomatoUSB router blocking about 75,000 domains compiled from various ad/adult/tracking block lists. I was happy to see kissmetrics.com listed in there when this news came out. They can have their unkillable cookie on my systems ... just as long as it can't dial home! :)
Biting the hand that feeds IT © 1998–2019