Importing Data using Sqoop

Sqoop is an Apache Hadoop top-level project and designed to move data between Hadoop and RDBMS. Sqoop is a collection of related tools. To use Sqoop, you specify the tool you want to use and the arguments that control the tool.

In this post, we will cover how to […]

Introduction to Sqoop and Installation

To process and analyze data in Hadoop, it requires loading data into Hadoop file system that is present on Application server and databases. Sqoop is a tool designed to transfer data between Hadoop and relational databases or mainframes. You can use Sqoop to import data from a relational database management […]

Big Data – Hadoop Interview Questions

To crack an interview, it is must that your basic concepts about frameworks are pretty clear. On request of many of our students, we’ve put together a comprehensive list of questions to help you get through your Big Data – Hadoop interview. We’ve made sure that the most probable questions […]

Install Hive with embedded metastore

Hive package comes with derby as default embeded metastore. Follow below mentioned steps to install Hive with embedded metastore: 1. Download the latest version of Hive from here. 2. Uncompress the package on linux:

3. Add following to ~/.bash_profile sudo nano ~/.bash_profile

Where hduser is the user name […]

Zookeeper Standalone Installation

Pre-requisites Before starting with Zookeeper standalone installation, make sure that the node have the following pre-requisites: a)  Supported Platforms: GNU/Linux, Win32, MacOSX, FreeBSD and Sun Solaris. This blog describes the installation steps for Linux. b)  Sun Java 1.6 or above should already be installed. To install Java, you can refer to the installation steps mentioned in the blog. […]