Securing Hadoop with Kerberos

Lately I was working on making our 10 node Hadoop cluster secure behind Kerberos. The cluster consists of CentOS 6.4 servers with CDH 5.0 distribution. Instead of using our company wide Active Directory running on Windows (didn’t want to mess up data for 32,000 users), I decided to run Kerberos Key Distriburtion Center (KDC) and realm local […]

Uninstalling CDH5 cluster

While implementing Kerberos security on a small CDH5 cluster (each machine running CentOS 6.4) recently, I ended up making quite a few configuration changes manually on few of the nodes in my cluster. The changes I made resulted in my cluster becomming unstable with few nodes having different configuration (from how I had set up […]

Running Traditional Applications on Hadoop

         I work on a cloud-based SaaS business analytics software that transforms multi-dimensional disparate enterprise data into actionable insights through user-friendly infographics, charts, and dashboards. There has been an exponential growth in the amount of data generated across the enterprise. and we’ve re-engineered our product to run on Big Data technologies powered (Hadoop […]