State of Data Last Week – #38

#analysis – 133M blog posts, 231M social media feeds. 3TB data set collected between Jan 13 and Feb 14, 2011. Data Challenge – culminating in ICWSM @ Barcelona this summer – ‘locate significant posts in the collection which are relevant to the revolutions in Tunisia and Egypt’.

#api – Google App Engine to support SQL

#architecture – StackOverflow Architecture lowdown – how it deals with 800 HTTP requests/second. “Some raw SQL” in data access layer.

#big_data – MongoDB – apparently works great when entire data fits in the memory; otherwise it could be ‘up to 17 sec for 30,000 reads’

Big Data is Big Business – TeraData buys Aster Data for $263M

#learning – Why ‘most benchmarks are seriously broken’ because ‘complexity and performance model quality are inversely related’  – a great talk on ‘Performance Anxiety’ at Devoxx 2010.

#visualization – RStudio – new IDE for R – got raving reviews and many endorsements from community




About Nilendu Misra
I love to learn, create and coach. Things that I do well are - Communicating ideas - verbally or through words and diagrams; Problem Solving - Logical or Abstract; Very Large Scale Systems; think about 'Frighteningly Simple' approach first. Things that I intend to do better are - Establishing Stringent Process; Exchanging Tough Feedback; Keeping up with my reading or To-Do list to be able to completely relax.

Comments are closed.

%d bloggers like this: