Wednesday, August 19, 2009

Visualizing your online persona

Ever been curious about your online identity? Or, more specifically, how a computer algorithm with all its inherent limitations might perceive your online identity?

Enter Personas, a project by the Social Media Group from the MIT Media Lab:
"[Personas] uses sophisticated natural language processing and the Internet to create a data portrait of one's aggregated online identity. In short, Personas shows you how the Internet sees you.

"Enter your name, and Personas scours the web for information and attempts to characterize the person - to fit them to a predetermined set of categories that an algorithmic process created from a massive corpus of data. The computational process is visualized with each stage of the analysis, finally resulting in the presentation of a seemingly authoritative personal profile."

Okay, so this doesn't work for me because I am a virtual unknown with no online identity to speak of. I am an Internet Nobody, sad but true.

But I tested this out instead with one of my favorite Internet Somebodies: Richard Feynman (full disclosure: he was my longest lasting imaginary crush through high school). The results are quite funny...


Click the image to enlarge (sorry for the low res).

I have to say, I would never guess this was the online persona for a Nobel Prize winning physicist... The big sports component (yellow on the left) is to me most amusing. Run the algorithm and you'll see that it's due to a misinterpretation of the words "series" (in this case "lecture series" not sports series) and "player" (from the oft-reported fact that Feynman was a "bongo player" - really).

Of course, the smart kids at MIT were aware of this all along:
"Personas demonstrates the computer's uncanny insights and its inadvertent errors, such as the mischaracterizations caused by the inability to separate data from multiple owners of the same name. It is meant for the viewer to reflect on our current and future world, where digital histories are as important if not more important than oral histories, and computational methods of condensing our digital traces are opaque and socially ignorant."

Nifty stuff.