Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. follow instruction http://archive.cloudera.com/docs/ec2.html
    1. Item 2.1 . I uploaded 3 tar files: 'client script', 'boto', and 'simplejson'.
      • Un-tarred all 3: tar vzxf ....
      • execute twice , in 'boto' and 'simplejson' directories
        Code Block
        sudo python setup.py install 
      • move hadoop-ec2 to permanent place & add to path (for easier use)
        Code Block
        
        tar vzxf cloudera-for-hadoop-on-ec2-py-0.3.0-beta..tar.gz
         sudo mv cloudera-for-hadoop-on-ec2-py-0.3.0-beta /opt/
         export HADOOP_EC2_HOME=/opt/cloudera-for-hadoop-on-ec2-py-0.3.0-beta
         export PATH=$PATH:$HADOOP_EC2_HOME
        
      • exported by hand environment variables
        AWS_ACCESS_KEY_ID - Your AWS Access Key ID
        AWS_SECRET_ACCESS_KEY - Your AWS Secret Access Key
      • create a directory called ~/.hadoop-ec2 w/ file ec2-clusters.cfg with content:
        Code Block
        [my-hadoop-cluster]
        ami=ami-6159bf08
        instance_type=c1m1.mediumsmall
        key_name=janAmazonKeyjanAmazonKey2
        availability_zone=us-east-1c
        private_key=/home/training/
        ssh_options=-i %(private_key)s -o StrictHostKeyChecking=no
        
      • filer a cluster of 1 server+2 nodes
        cd ~/Desktop/cloudera-for-hadoop-on-ec2-py-0.3.0-beta
        ./hadoop-ec2 launch-cluster my-hadoop-cluster 2

...