State of Technology #56



#architectureAll the interesting problems are scalability problems, Alan Kay agrees – ‘99% or even 99.9% of the effort that goes into creating a large software system is not productive’

All Cheat Sheets for a Web Developer – HTML, CSS, PHP, MySQL, JavaScript, jQuery, mod_rewrite


#designHow doodling has become a key design tools for premier innovative companies


#essay10-minute primer on Good vs. Bad Strategy


#mobileSuite of Design Principles and Mobile Tools from Nokia

– Which sites display content in the ‘masonry’ (vertical bars) style, like Pinterest?



#serviceWhat is (a) in a website vs. (b) what people look for


#social – ‘When you get a product free, you ARE the product’ – How apps sell you on Facebook, e.g., apps that get your ‘Religion and Politics’


#toolFor labeling XY chart data points in Excel


#tweaks n’ hacksThe camera that does not take pictures, it describes them.


“Success is dangerous because often you don’t understand why you succeeded. You almost always know why you’ve failed” – Marc Pincus

State of Data #96

#1Read_this_Week Numerator-only Data – mobile event and gesture tracking with few lines of code

#architectureWhy ‘average user rating’ is a flawed concept                                                                      

#big_dataIBM’s recent acquisitions and their Big Data Strategy on a page

#Data_ScienceEvaluating the Design of R Language (original paper)

#DBMSSingle SQL to create calendar in multiple languages

#idea Tim Berners Lee asks to demand your data from Facebook

Telling stories with Data (pdf); 6 min video (3rd from top)


#visualizationThe first ever pie chart (1801)



State of Data #95

#1Read_this_WeekHow a San Diego Physicist used data and analysis to reduce energy consumption by 5x


#analysisE-reading/E-books data


Download the entire text of Wikipedia and use it offline in ZIM packages


#big_dataAnalysis of 60B pieces of music listening data across 200 cities from 2003 till date reveals very interesting network effects in Music – namely, city size benefits innovation or earning theory does not hold for music.


“Montreal seems to lead North American in indie music listening habits and the leader for hip hop isAtlanta. In Europe, Paris leads for indie music whereas Oslo leads for music as a whole. “


#Data_ScienceCEOs’ vacation related with corporate news (pdf) disclosures – vacation was evaluated by merging {corporate jet flight histories} with {real estate records}


“I find that companies tend to disclose favorable news just before CEOs leave for vacation and then hold over subsequent news announcements until they return to headquarters.  During periods when CEOs are away from the office, stock prices behave quietly with sharply lower volatility than usual.  Volatility increases immediately when CEOs return to work.”

MySQL in 2012 – a detailed progress report – comparing the deployments at Pinterest and Craigslist

#idea When Data Goes Bad’ – all about ‘corrupted’ or ‘falsified’ data spotting with engaging examples

Big Data in Finance (via O’Reilly radar)

#visualizationMiso Project – funded by Bill and Melinda Gates foundation, done by Guardian Data Scientists – is exposition of open-source toolkit to visualize data


State of Technology #54


#architectureJava Tuning in a Nutshell – Part 1

#codeMathematics for Computer Science (pdf) – latest full course material/book from MIT

#design60 Examples of Responsive Website Design


#essayIs Facebook making us lonely?

“If you use Facebook to communicate..—by using the “like” button, commenting on friends’ posts, and so on—it can increase your social capital.  Personalized messages, or what Burke calls “composed communication,” are more satisfying than “one-click communication”—the lazy click of a like. “People who received composed communication became less lonely, while people who received one-click communication experienced no change in loneliness,” “


#mobileWhere is energy spent inside my app – eprof (a fine-grained energy profiler) design details (pdf)

“Eprof sheds lights on internal energy dissipation of these apps and exposes surprising findings like 65%-75% of energy in free apps is spent in third-party advertisement modules”


#saas – Ideaschema takes your ideas (on product, sales, support, engineering) and converts into comics. Very high value!

#serviceIs Twitter just a modern day complaint department?

The ‘blog’ from 17th Century suggests the infrastructure lifecycle is for about a decade


#toolCollection of Software Engineering Tools from Microsoft Research



#tweaks n’ hacksA lightweight reusable water bottle that can be folded




‘Believe you are the baddest ass in town-and you suck!..Be able to keep two completely contradictory ideals alive and well inside of your heart and head at all times. If it doesn’t drive you crazy, it will make you strong. And stay hard, stay hungry and stay alive.’ – Bruce Springsteen, SXSW, 2012

State of Data #94

#1Read_this_WeekNasa generates a ‘Van Gogh’ with the World’s Ocean Currents


#analysisEvaluative words for Wine

“incredibly 93.3846
sensational 93.375
finest 93.2727
fantastic 92.8846
fantastically 92.8421
mignon 92.6154”

#architectureNetflix Recommendation System – Part 1; from Netflix


#big_dataComprehensive list of Publicly available Large Data Sets for Database Research


#Data_ScienceHow to work with Google n-gram data sets in R using MySQL

Twitter open sources its MySQL work at GitHub

On-the-road statistics that prevents accidents

Everything You Wanted to Know about Data Mining but were Afraid to Ask

#visualizationNational Heat Map shows heat generated fromUK buildings

  • Portrait of a Twitter user
  • ..and of a LinkedIn user
  • FT Graphic World with David McCandless
  • QWERTY Effect: “because of the QWERTY keyboard’s asymmetrical shape (more letters on the left than the right), words dominated by right-side letters “acquire more positive valences” — that is to say, they become more likable. “

State of Data #93

#1Read_this_Week – Feltron Report 2010/11

“Philip K. Dick claimed that “a person’s authentic nature is a series of shifting, variegated planes that establish themselves as he relates to different people; it is created by and appears within the framework of his interpersonal relationships.”

The Feltron 2010/2011 Biennial Report explores this notion by overlapping facets of Nicholas’ behavior to visualize how his personality varies based on location and company.”


#analysis – Logging Best Practices (learnt about two cool tools – logstash and Graylog2)

#architecture – Achieving Rapid Response times in Large Online Services from Jeffrey Dean, Google Fellow                                                                


#big_data – Data Mining You

“He wrote a piece for Wired magazine on a super-secret, $2 billion, one-million-square-foot data center the NSA is building in Bluffdale, Utah.  Focused on data mining and code-breaking and five times the size of the U.S. Capitol, it is expected to house information beyond compare, “including the complete contents of private emails, cell phone calls, and Google searches, as well as all sorts of personal data trails — parking receipts, travel itineraries, bookstore purchases, and other digital ‘pocket litter.’”


#Data_Science – How to use ‘I.Q. math’ to solve more murders (or, Chess, Intelligence and Winning Arguments)

“It is also known that the mean IQs of murderers and policemen are 87 and 102, respectively. So successfully solving murders is a puzzle then the “a” coefficient is 0.041, and each IQ point difference is worth 7.2 ELO points. A 3 fold advantage could be had with a 28 point gap between cops and killers. In other words some 31% of outstanding murders could be solved if the USA selected its policemen to have an average IQ of 125 i.e. to be as smart as an average lawyer.

#DBMS – Count a billion objects using Only 1.5Kb of memory

#idea – How to manipulate recommendation with one single-click

#learning – 
How to test Random Number Generators from John D Cook, ‘Beautiful Testing’

#visualization – 10 Things You Can Learn from New York Times’ Data Visualizations, including restraint. Don’t forget to check the amazing portfolios at the bottom –:

1.       Jonathan Corum’s portfolio

2.       Graham Roberts’ portfolio

3.       Kevin Quealy’s portfolio

4.       Marije Rooze’s thesis collection

5.       Small labs inc collection


State of Technology #52


#architectureCould your HTTP servers handle 500,000 Requests/sec? It is new normal.

#codeAbsolutely delightful – “Learning to Code: The Roadmap I wish I had been Given”

#design #1 DVD cover of ‘The Girl with the Dragon Tattoo’ has a faux-pirated look -theme consistent with the trilogy.

#2 Spaghetti Fork

#essayShort note on time scales and ensuing discussion

“So when someone says that Solid State Disks are awesome, it’s because they’re able to turn that slow, unpredictable old sailing ship into a streamlined steam-powered vessel. A good SSD can often make the voyage in less than a week, sometimes in little more than a day. It can also make many thousands more quests for information per year.”


#mobileHighlight – most talked about app in SXSW, but was it a huge success?

#service11 of the Best Customer Service Stories Ever; and a brand new one  (perhaps not entirely ethical)

140 Best Twitter Feeds of 2012 from Time is out


 #toolCable Guys

#tweaks n’ hacksPeriodic Table Table


#parting_thought‘Several people have told me they like the iPad because it lets them bring the Internet into situations where a laptop would be too conspicuous. In other words, it’s a hip flask.’ – Paul Graham