State of Data Last Week – #38

#analysis – 133M blog posts, 231M social media feeds. 3TB data set collected between Jan 13 and Feb 14, 2011. Data Challenge – culminating in ICWSM @ Barcelona this summer – ‘locate significant posts in the collection which are relevant to the revolutions in Tunisia and Egypt’.

#api – Google App Engine to support SQL

#architecture – StackOverflow Architecture lowdown – how it deals with 800 HTTP requests/second. “Some raw SQL” in data access layer.

#big_data – MongoDB – apparently works great when entire data fits in the memory; otherwise it could be ‘up to 17 sec for 30,000 reads’


#DBMS –
Big Data is Big Business – TeraData buys Aster Data for $263M

#learning – Why ‘most benchmarks are seriously broken’ because ‘complexity and performance model quality are inversely related’  - a great talk on ‘Performance Anxiety’ at Devoxx 2010.

#visualization – RStudio – new IDE for R – got raving reviews and many endorsements from community

#etc

 

About Nilendu Misra
I love to learn, create and coach. Things that I do well are - Communicating ideas - verbally or through words and diagrams; Problem Solving - Logical or Abstract; Very Large Scale Systems; think about 'Frighteningly Simple' approach first. Things that I intend to do better are - Establishing Stringent Process; Exchanging Tough Feedback; Keeping up with my reading or To-Do list to be able to completely relax.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: