State of Data Last Week -#35

#analysis – Why conversational idioms (like, #dontyouhate) are like ‘sugar rush’ – peaks and wears off fast, but contentious themes (like #Egypt) take longer to catch on – analysis of 500 most popular hashtags and 3B tweets reveal – NY Times version; the actual paper (PDF)

#architecture – Why you should be glad if “214-748-3647” is NOT the most popular phone number in your data store – here is a hint.

#big_data – Super-prestigious scientific journal founded by likes of Graham Bell and Edison dedicates its latest issue to data challenges and opportunities. In a rare gesture the whole collection is free (registration needed). “Big” looks really BIG after reading ‘typical adult brain has ~150 trillion synapses’.

Date of birth, gender and ZIP code alone are enough to personally identify 87% of the U.S. population’. Who feels unlucky in that 13% left out?

#DBMS – A whole day of ‘conversations’ with three of world’s most respected ‘Real World Performance Experts’ – ‘dealing with TBs of data, hundreds of thousands of concurrent users – LIVE’ – highly recommended if you are in San Diego on February 21. (‘answer to every question on performance is – “it depends”’. Love it!)

#learning – Data Source Handbook’ – ‘compact guide book to most useful APIs and bulk data sets’  —  now available in Safari — e.g., controversial Rapleaf API takes email as input and provides age, gender, interests, and – even – income level.

#visualization – A lifetime of Data Visualization experience in 162 concise recommendations – check-list, cheat-sheet style ‘Communicating Data Clearly Presentation’ (PDF) that covers from Gestalt’s Law of Connectedness to ‘acute angles are underestimated’ to ‘only thing worse than a pie chart is several of them’ to (personal favorite) ‘graphs are for the forest; tables are for the trees



About Nilendu Misra
I love to learn, create and coach. Things that I do well are - Communicating ideas - verbally or through words and diagrams; Problem Solving - Logical or Abstract; Very Large Scale Systems; think about 'Frighteningly Simple' approach first. Things that I intend to do better are - Establishing Stringent Process; Exchanging Tough Feedback; Keeping up with my reading or To-Do list to be able to completely relax.

2 Responses to State of Data Last Week -#35

  1. The checklist/ cheat-sheet discussed under #visualization was actually a list of the key points made during a tutorial I delivered. Each was illustrated by an example. Contact Naomi at nbr-graphs dot com for information about the tutorial.

  2. doctordata says:


    Thanks for the clarification. The tutorial sounds very exciting. I will possibly reaching out to you offline for a possibility of a redo at our site.


%d bloggers like this: