Nick Pasko

Programmer goes enterpreneurship.

Friday, April 24, 2015

Clone existing (stopped) Hive cluster

1. Find a stopped cluster you want to be cloned and Clone it
1.1. Provide as many machines as you think should be minimal: it's easy to add machines later, harder to remove them,

2. Install s3cmd (SUSE):
cd /etc/yum.repos.d
sudo wget http://s3tools.org/repo/RHEL_6/s3tools.repo
sudo yum install s3cmd
2.1. create ~/s3cfg file, copy content from any EC2 machine you have S3 set up on
(or check here and do it from scratch)

3. Install aws-cli
sudo yum install aws-cli
3.1. create AWS credentials file
mkdir ~/.aws
cd ~/.aws
nano credentials
[default]
aws_access_key_id=Your Access Key ID
aws_secret_access_key=Your Secret Access Key
region=Optional, the default region to use for this profile

4. "s3cmd get" table creation scripts, run them, then run hive and you're good.

0 Comments:

Post a Comment

<< Home