Nick Pasko: Clone existing (stopped) Hive cluster

1. Find a stopped cluster you want to be cloned and Clone it
1.1. Provide as many machines as you think should be minimal: it's easy to add machines later, harder to remove them,

2. Install s3cmd (SUSE):
cd /etc/yum.repos.d
sudo wget http://s3tools.org/repo/RHEL_6/s3tools.repo
sudo yum install s3cmd
2.1. create ~/s3cfg file, copy content from any EC2 machine you have S3 set up on
(or check here and do it from scratch)

3. Install aws-cli
sudo yum install aws-cli
3.1. create AWS credentials file
mkdir ~/.aws
cd ~/.aws
nano credentials

[default]
aws_access_key_id=Your Access Key ID
aws_secret_access_key=Your Secret Access Key
region=Optional, the default region to use for this profile

4. "s3cmd get" table creation scripts, run them, then run hive and you're good.

Nick Pasko

Friday, April 24, 2015

Clone existing (stopped) Hive cluster

0 Comments:

About Me

Previous Posts