I am a believer and evangelist for open source.  Following are some of my open source contributions.

My github : https://github.com/sujee/

Open Source Labs for Learning Hadoop

We created these open source labs to help people learn Hadoop.   There are  30+ labs that cover HDFS, MapReduce, Pig, Hive and HBase.

https://github.com/elephantscale/HI-labs

HBase

HBase project home

Spark Job Server

Spark Job Server facilitates running multiple jobs in Spark and sharing cached data across jobs.

submitted multiple patches to the project.

Spark workshop at SVCodeCamp

github

Hadoop DNS Checker

Hadoop & HBase are very particular about DNS.  This tool verifies DNS is working correctly in a Hadoop cluster

Amazon EMR scripts

scripts to launch and monitor EMR jobs

https://github.com/sujee/amazon-emr-beyond-basics

Other Projects

 

Copyright 2015 Sujee Maniyam (