I am a believer and evangelist for open source.  Following are some of my open source contributions.

My github : https://github.com/sujee/

Open Source Labs for Learning Hadoop

We created these open source labs to help people learn Hadoop.   There are  30+ labs that cover HDFS, MapReduce, Pig, Hive and HBase.



HBase project home

Spark Job Server

Spark Job Server facilitates running multiple jobs in Spark and sharing cached data across jobs.

submitted multiple patches to the project.

Spark workshop at SVCodeCamp


Hadoop DNS Checker

Hadoop & HBase are very particular about DNS.  This tool verifies DNS is working correctly in a Hadoop cluster

Amazon EMR scripts

scripts to launch and monitor EMR jobs


Other Projects


