State of Data #59

#analysisWhat metrics to track for Web Marketing effectiveness (e.g., say, a Newsletter)? How many receive the email (Delivery Rate); how many view it (Open Rate); and why it differs from how many click to view it (Click-to-deliver Rate) etc. Avinash Kaushik describes how to measure effectiveness across three dimensionsAcquisition. Behavior. Outcomes. 


It’s hard to avoid articles titled – ‘Is NoSQL Lady Gaga of Database world?’. In context, replace Lady Gaga with NoSQL below –


“You know, there’s a difference between not liking someone’s music and not recognizing their talent. If you can’t recognize the fact that Lady GaGa is, in fact, extremely talented in many ways, then you may want to try to look at her with less of a bias. There’s plenty of artists I can’t stand, but still respect their talent.”

How a Cornell team dug out “Fake Reviews” with non-human “Classifiers” (PDF) and beat humans at it handily. The basic premise was that Truth = Informative writing; Deception = Imaginative Writing.

Now, only if we could now have an API to run it on the restaurant with thousands of five star ratings in Yelp….

Speaking of Yelp, it now has 20M reviews (a cool visualization from them). Bi-rite creamery from San Francisco is the business with most reviews (3903, as of writing)

#conference –   
What is Self-service BI; what infrastructure is needed; how to take your organization towards it –- Focus Roundtable on August 9, 9:30-10:30 AM

Coming to a Server near you soon — No more Reboot after a systems update – Oracle acquires Ksplice

#learning – Read between the lines – Lymbix offers a sentiment-analysis (of, say, your boss’s email feedback) API returning JSON or XML scoring attributes like sadness, humiliation, dominant_emotion, affection etc. Amusement Quotient: 100!



#visualizationCompelling metaphor to illustrate the difference between Data and Information




  • Go SQLiteMobile is now 2% of Global GDP. “worldwide mobile industry should bring in $1.3 trillion in 2011 and will represent about 2 percent of global gross domestic product”

  • Cloud coming home to roost – Microsoft suggests ‘Data Furnaces’ to heat your home. 400 CPUs can heat a single-family home. The full paper (PDF) is interesting read too.

  • United States of Netflix – Visualization of the month does not cost $6 more to see.

  • Metalog – Catalog of Data Catalogs from Governments (and some spam taking advantage of openness) across the world –

About Nilendu Misra
I love to learn, create and coach. Things that I do well are - Communicating ideas - verbally or through words and diagrams; Problem Solving - Logical or Abstract; Very Large Scale Systems; think about 'Frighteningly Simple' approach first. Things that I intend to do better are - Establishing Stringent Process; Exchanging Tough Feedback; Keeping up with my reading or To-Do list to be able to completely relax.

Comments are closed.

%d bloggers like this: