State of Data #93
April 8, 2012 Leave a comment
#1Read_this_Week – Feltron Report 2010/11
“Philip K. Dick claimed that “a person’s authentic nature is a series of shifting, variegated planes that establish themselves as he relates to different people; it is created by and appears within the framework of his interpersonal relationships.”
The Feltron 2010/2011 Biennial Report explores this notion by overlapping facets of Nicholas’ behavior to visualize how his personality varies based on location and company.”
#analysis – Logging Best Practices (learnt about two cool tools - logstash and Graylog2)
#architecture – Achieving Rapid Response times in Large Online Services from Jeffrey Dean, Google Fellow
#big_data – ‘Data Mining You’
“He wrote a piece for Wired magazine on a super-secret, $2 billion, one-million-square-foot data center the NSA is building in Bluffdale, Utah. Focused on data mining and code-breaking and five times the size of the U.S. Capitol, it is expected to house information beyond compare, “including the complete contents of private emails, cell phone calls, and Google searches, as well as all sorts of personal data trails — parking receipts, travel itineraries, bookstore purchases, and other digital ‘pocket litter.’”
#Data_Science – How to use ‘I.Q. math’ to solve more murders (or, Chess, Intelligence and Winning Arguments)
“It is also known that the mean IQs of murderers and policemen are 87 and 102, respectively. So successfully solving murders is a puzzle then the “a” coefficient is 0.041, and each IQ point difference is worth 7.2 ELO points. A 3 fold advantage could be had with a 28 point gap between cops and killers. In other words some 31% of outstanding murders could be solved if the USA selected its policemen to have an average IQ of 125 i.e. to be as smart as an average lawyer.”
#DBMS – Count a billion objects using Only 1.5Kb of memory
#idea – How to manipulate Amazon.com recommendation with one single-click
#learning – How to test Random Number Generators from John D Cook, ‘Beautiful Testing’
#visualization – 10 Things You Can Learn from New York Times’ Data Visualizations, including restraint. Don’t forget to check the amazing portfolios at the bottom –:
4. Marije Rooze’s thesis collection
#etc
- Small math errors that caused HUGE disasters
- Tips ‘n Tricks - ‘My Favorite Excel Things’ (pdf)
- Importance of measuring – prisoners put on weight ‘because they do not wear belts’ and ‘miss the little signals of weight gain’
- Convert wind into data art on the wall