Wednesday, July 11, 2012

The story of data

I'm a big believer of Wikileaks, a publisher of raw leaked data about companies and government corruption.
Wikileaks publishes raw data and hands it to the press that sort and summarize it.

When data was scarse, the press could easily summarize data, write the general idea and spread it to the world.
Now, there is so much data that I believe the press should stop working on summarizing data, and concentrate efforts on writing the general idea and spread it.

From wikileaks on the Syria leak :
The database comprises 2,434,899 emails from the 680 domains. There are 678,752 different email addresses that have sent emails and 1,082,447 different recipients. There are a number of different languages in the set, including around 400,000 emails in Arabic and 68,000 emails in Russian.

How can you ask a journalist to summarize so much data and grasp the general idea ?

Data tells a story, but now we can't read the story without computers, there is just too much data.
There is a missing piece between Wikileaks and the press. The world needs data mining and data visualization experts.
These jobs were only accessible through research and big companies. They were the only place were we could find enough data and enough processing power.

Thanks to cloud computing advances, processing power is accessible to any independant developer. And Wikileaks give us the data to play with.
This domain will be a new play field for developers.

No comments:

Post a Comment