Over The Counter Culture

Staring at the sun
Latest Posts »
Popular »
» Getting a cutting edge Android smartphone for £85
» Vast EU research grant fraud uncovered, millions lost
» Stewart Brand, on viruses and the scale of things
» UK government amends data protection and cookies law
» Adam Curtis Greencine interview on media elitism, the US and the UK
» NSFW: Oklahoma judge used penis pump during trials
» The Fred Wilson Effect: the benefits of open conversations online
» The Facebook Data Protection Act letter
« Self-replicating, open source 3D printers
A song to make your spine tingle »

Is Google using your brain as you browse?

I just stumbled across a research paper published by a Google employee and a Microsoft employee entitled “A Case for Usage Tracking to Relate Digital Objects“. I have no idea who Elin Rønby Pedersen is but she’s published both on this and on Google’s much vaunted foray into organising your health data.

The paper highlights an interesting idea, potentially just as important to Future Google as Pagerank has been to Google so far. It’s not groundbreaking – you see it on, for example, Amazon. But it’s worth thinking about, applied to the whole web.

The idea is that related objects – and I use the term extremely loosely here – can be identified because you looked at them during a session of Internet browsing; you started with one, and your later browsing takes you to related objects – blog posts or news articles on the same or related subject; similar videos; etc. Your brain does the hard work of deciding what objects you’re looking for; average that with other similar datasets and Google has a pretty damn good idea of what objects on the web are related, no matter what format the object has (could be visual, textual, a flash game, a picture – they could all be related in some way that a machine has no way of ever being able to decipher the way a brain can) – the beauty of this is, the Google machine doesn’t HAVE to understand.

Evidently, there’s a lot of ‘noise’ in the data since people can be quite random when browsing, or visit an unrelated page, etc. The answer to noisy datasets is to aggregate more datasets and average them. Google definitely has access to a lot of data – just through google.com, but also the emails you send through Gmail, through content you share through Google Opensocial apps, by registering your IP each time you view their ads on any of the sites you visit, by monitoring the sites visited by anyone with a Google toolbar – etc.

This is more top-down “semantics”, and only a few companies have the capability of tracking all Internet users around the web; Google is fairly unique because it has so much share of search, email and ads (you could argue that the doubleclick merger approval really missed the significance of the move, with huge privacy and antitrust concerns going unnoticed). Two additional categories of players present themselves: your browser, and your OS. The OS could (controversially) monitor websites you visit. As could your browser. I see huge potential in Mozilla Weave – if, when I send it my web visit history data (at the moment i do that so it syncs my data between my computers), with my approval it processed the data (looking at what I did during my browsing sessions) and pooled it with that of others, it could infer relationship between objects and recommend it within a sidebar.

Technorati Tags: Google,search engines,tracking,web 3.0,implicit web,semantics,data portability,relational web
Bookmark/Share:

Related:

The semantic elephant in the room – Google will settle the "top down vs. bottom up" debate for us
The fundamental principle of semantifying data is that information becomes more easily found and understood by computers. Mix that with AI and you've got some very, very powerful, useful tools for information gathering, processing and decision making! So why is Google - the information lynchpin of the Internet, and thus, of modern society - not THE focus of attention in all this hubris about Web3.0? Here's why it should be: (...)...
Google Friend Connect – part 2: The largest Social Network ever built
Having originally assumed that the reason Facebook, Hi5 and LinkedIn (FHL), amongst others, were involved in the Google Friend Connect (GFC) service, I initially wanted to write this post to argue that this was the biggest strategic mistake of their lives. Turns out, Google is involving them whether they like it or not – using [...]...

Related posts brought to you by Yet Another Related Posts Plugin.

This entry was posted on Saturday, April 12th, 2008 at 6:58 pm and is filed under Musings. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

  • Home
  • About
  • List all posts
  • Current Reading
  • Search

Over The Counter Culture is proudly powered by WordPress
Entries (RSS) and Comments (RSS).