Spark excels at processing in-memory data.  We are going to look at various caching options and their effects, and (hopefully) provide some tips for optimizing Spark memory caching.


Read More

Hadoop and HBase (especially HBase) are very picky about DNS
entries.  When setting  up a Hadoop cluster one doesn’t
always have access to a DNS server.  So  here is ‘poor
developers’ guide to getting DNS correct.


Read More

So you have setup a new Hbase cluster, and want to ‘take it for a spin’.  Here is how, without writing a lot of code on your own.

Read More

Some handy classes for using Hadoop / Map Reduce / Hbase

Read More

This is a tutorial on how to run a map reduce job on Hbase.  This
covers version 0.20 and later.


Read More

These are some handy scripts to launch / monitor / manage Amazon EMR jobs.

The code for all this is at GitHub:  https://github.com/sujee/amazon-emr-beyond-basics

Read More

Membase is a NOSQL database.  It is designed to be a persistant
storage behind MEMCACHED – a popular in memory caching
tool.    Membase is relatively new but has found solid
footing in high performance NOSQL world.

This is a quick tutorial on using Membase database from Java.


Read More

Update : This article is now featured at Three20 website! (linky).

Three20 is an open source
iPhone/iPad library that packs a lot of cool
features.   Three20 is popular for its UI components
such as TTNavigator,  TTWebBrowser,
TTPhotoViewer.


Read More

Here are some tips about
setting up cronjobs in rails app.


Read More

Each iPhone and iPad has a unique device ID (just like MAC address) that
uniquely identifies that device.  This UDID as it is called, is
used by many iPhone programs.  Some examples could be


Read More


1 2 3
Copyright 2015 Sujee Maniyam (