To crack an interview, it is must that your basic concepts about frameworks are pretty clear. On request of many of our students, we’ve put together a comprehensive list of questions to help you get through your Big Data – Hadoop interview. We’ve made sure that the most probable questions […]
Hive
Install Hive with local metastore
Being a data-warehousing framework, a single session for Hive is not preferred. To solve this limitation of Embedded Metastore, a support for Local Metastore was developed. A separate database service runs as a process on same or remote machine. The Metastore service still runs in the same JVM within hive […]
Install Hive with embedded metastore
Hive package comes with derby as default embeded metastore. Follow below mentioned steps to install Hive with embedded metastore: 1. Download the latest version of Hive from here. 2. Uncompress the package on linux:
1 2 |
tar –xzvf apache-hive-0.13.1-bin.tar.gz |
3. Add following to ~/.bash_profile sudo nano ~/.bash_profile
1 2 3 |
export HIVE_HOME=/home/hduser/hive-0.13.1 export PATH=$PATH:$HIVE_HOME/bin |
Where hduser is the user name […]
Hive Metastore
Hive Metastore Introduction Hive Metastore is a central repository for Hive metadata. It has 2 components: A Service to which the Hive Driver connects to and queries for the database schema. A backing database to store the metadata. Currently Hive supports 5 backend databases: Derby, MySQL, MS SQL Server, Oracle […]
Apache Hive Introduction
Hive Introduction Hive is an Apache software foundation project originated at Facebook. It is a data warehousing system build on top of Hadoop to analyse big data using SQL like query language. This blog covers an overview of Hive architecture and its design goals. The RDBMS and NoSql databases failed to fulfil […]