Category : Hadoop

Configure Kerberos Authentication in Hortonworks Hadoop HDP 2.2

This is quick and short tutorial to install and configure Kerberos authentication in hortonworks Hadoop cluster hdp2.2.   Here is my setup environment:   Kerberos Server: kerberos.crazyadmins.com Kerberos Client: myclient.crazyadmins.com Test Hadoop Hortonworks 2.2 Cluster: myclient.crazyadmins.com   Prerequisites:   Please ensure that Kerberos server and Client/Hadoop cluster should have each other’s entry in /etc/hosts file

Read More →

Setting up Hortonworks Hadoop cluster in AWS

In this article we will discuss how to set up Hortonworks Hadoop cluster in AWS (Amazon Web Services). Assuming you have a valid AWS login let us get started with: Launching an Amazon instance Pre-requisites for setting up Hadoop cluster in AWS Hadoop cluster Installation (via Ambari)   1. Launching an Amazon instance   a)

Read More →

Install multinode cloudera hadoop cluster cdh5.4.0 manually

This document will guide you regarding how to install multinode cloudera hadoop cluster cdh5.4.0 without Cloudera manager.   In this tutorial I have used 2 Centos 6.6 virtual machines viz. master.hadoop.com & slave.hadoop.com.   Prerequisites:   CentOS 6.X jdk1.7.X is needed in order to get CDH working. If you have lower version of jdk, please

Read More →

Apache Ranger installation and Configuration in HDP2.2

Apache Ranger installation and Configuration in HDP2.2   In this tutorial I am going to cover how to install and configure Ranger on hortonworks hadoop platform 2.2.   What is Ranger?   It provides central security policy administration in a Hadoop environment. It covers 3 aspects:   Authentication : by the Apache Knox Gateway via

Read More →

Install and Configure Transparent Data Encryption in hadoop HDP 2.2

Hey Guys hope you all are doing well today I’m going to explain you how to install and configure transparent data encryption in hadoop – HDP2.2.   Why do we need Transparent Encryption in HDFS?   When we want to encrypt only selected files/directories in HDFS to save on overhead and protect performance – now this

Read More →

1 2 3 4