State of Data Last Week – Aug 08
August 9, 2010
Cool Numbers – It’s depressing!
- 300 Millions Tweets (also) indicate Thursday (evening) is the nation’s Unhappiest Day
- Statistically, flights numbered 191 are the one to avoid (most number of crashes).
- Not an excuse to skip it though! Raw ingredients in Breakfast have increased in price by 25% in last 90 days (while labor force fell by 0.6M).
- Graphite for fuel? A 2B pencil can write almost 1100 miles of text.
Analysis or Dare?
(a) People who swat flies have a thing for US Today?
(b) Believers in Alien abduction are more likely than nonbelievers to drink Pepsi?
(c) People who cut their sandwiches diagonally are more likely to wear RayBan?
(e) E.R. care accounts for less than 3% of healthcare spending?
Talking of analysis, offloading Predictive Analysis to Google API – Machine Learning as a Service, all you need to know.
Ben Horowitz on “Taking the mystery out of Scaling a company” – a lot applies to scaling technology as well. e.g., replace “people” with “server” in the snippet –
“when adding people server into the company feels like more work than the work that you can offload to the new employees servers”
Bit of nostalgia for those who’d worked with “large, 12MB databases” (or before) – Oracle 5 Installation Live
NOSQL Patterns – finally!
CouchDB has started losing data or, at least, started admitting it.
“ once the bad code path is triggered, subsequent writes to the database are never committed. This means there is potential data-loss for users of 1.0.0.”
Architecture of the Month – LinkedIn Data Infrastructure
Cocktail party cheat-sheet –Even the cheapest SSD (Solid State Disk) is 5-7x faster at TPC than usual drives (for PostGreSQL)