UPDATED 16:56 EDT / DECEMBER 09 2009

NEWS

Clean vs. Dirty Data: Data as the New Developer Kit [Twitter and Facebook]

Facebook is racing to open up their privacy settings. Why? Their $15 billion dollar valuation and future depend on it.

The faster Facebook opens up the data the faster a few things happen. First, Facebook can starting implementing and then scale a search offering. Second, Facebook can start rolling out new ad products for advertisers that can command massive premiums to that once locked data. This is huge more on that later.

Third, Facebook can start fostering a healthy and profitable ecosystem of third party developers to build much needed new applications and tools that provide better users experiences. Facebook can’t do it all on their own and they want to have a developer ecosystem. If you’re interested in what Facebook’s engineering vision then read this interview that I did with their VP of Engineering Mike Schroepfer.

The New Developer Kit – DATA

I just posted my Angle on the Twitter Firehose Myth- What you need to know about Twitter’s APIs.

Twitter’s big developer focus is mainly based upon their accidental success with developers due to their clean data – or unstructured data. I think that the Twitter data is a big win for developers, and I’m glad to see them reinforce their position as a “friendly” to developers. In this cloud and social media infested market of innovation all creative developers love access to data and tons of it.

One thing that isn’t being talked about in Twitter’s announcement about their firehose is the quality of the data. From a developers perspective there is clean data and there is dirty data. Let me elaborate.

Clean Data: Twitter

Twitter data is easy to work with. We have seen massive innovation and new venture creation around the data on Twitter. This is in direct contrast to Facebook. Facebook is moving fast to change this (and rightly so) in their move today where they announced that they are going to open up their privacy settings. Translation: Facebook’s data although huge it’s been closed hence messy for developers. Twitter data is huge and very open hence great for creative developers.

Dirty Data: Facebook

Facebook data is massive. At Supernova it was said Facebook has over 10 billion shares per day – that’s just on the sharing. However Facebook own success (invite only social graph) has been their biggest Achilles heal. Developers have had a hard time in dealing with their data due to all the privacy settings. I’ve also heard that the privacy settings have prevented Facebook from really “killing it” in deploying search and selling huge scalable advertising deals (not CPM deals but data based deals).

We can see the evidence of this from the success (or lack thereof) of the Facebook platform. Frankly, it has been problematic (just ask Scott Rafer and others). That is why we are seeing Facebook shift quickly to pushing and expanding on Facebook Connect. Facebook Connect is a much cleaner value proposition for developers and users. Frankly a better move for Facebook.

Having open data and clean data is a wonderful thing for developers. Lets hope Facebook can get there fast.

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Clean vs. Dirty Data: Data as the New Developer Kit [Twitter and Facebook]

The New Developer Kit – DATA

Clean Data: Twitter

Dirty Data: Facebook

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

Clean vs. Dirty Data: Data as the New Developer Kit [Twitter and Facebook]

The New Developer Kit – DATA

Clean Data: Twitter

Dirty Data: Facebook

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

Cookies