Skip to content

Latest commit

 

History

History
76 lines (54 loc) · 3.22 KB

README.md

File metadata and controls

76 lines (54 loc) · 3.22 KB

accumulo

Apache Accumulo in AWS

This work is inspired by:

  1. Exogeni Recipies
  2. Accumulo in Docker

What Is Apache Accumulo?

Apache Accumulo is a key/value store based on the design of Google's BigTable. Accumulo stores its data in Apache Hadoop's HDFS and uses Apache Zookeeper for consensus. While many users interact directly with Accumulo, several open source projects use Accumulo as their underlying store.

See official documentation for more information.

Accumulo Cluster

Cluster

How to use this cloudformation?

Pre-requisites

  1. User must have AWS account with privilges to create/delete IAMRole, IAMPolicy and IAMProfile
  2. Key pair has been created
  3. Change the value of ACCUMULO_PASSWORD in setupaccumulo.sh. Default value is 'secret'

Create a Accumulo Stack

Create a stack on AWS Cloudformation service by using accumuloCloudFormation.json.

Logon to AWS Console and Search for Cloudformation service

Cloudformation

Click Create

Cloudformation

Choose accumuloCloudFormation.json and click Next

Cloudformation

Specify the Stack name and KeyPair and click Next

Cloudformation

Click Next

Cloudformation

Ensure the checkbox for IAMRole warning is checked and click Create

Cloudformation

Stack creation will begin and status will be displayed as below

Cloudformation

namenode instance: NameNode Web UI on port 50070

NameNode: http://[PublicIPv4 of Instance]:50070/dfshealth.html#tab-datanode

NameNode

resourcemanager instance: ResourceManager Web UI on port 8088

ResourceManager: http://[PublicIPv4 of Instance]:8088

ResourceManager

accumulomaster instance: Accumulomaster Web UI on port 9995

Accumulomaster: http://[PublicIPv4 of Instance]:9995

Accumulomaster

worker instance: Worker Web UI on port 9995

Worker: http://[PublicIPv4 of Instance]:9995

Worker

Test Cluster

NOTE: Assumes the cluster is running as configured.

A script named usertable-example.sh will create a sample usertable in Accumulo using 100 randomly generated user entries.

This script should be executed on accumulomaster console as root user.

Accumulo stack hacks

Launch Accumulo with one worker

  1. Edit accumuloCloudFormation.json and remove the section for worker2 line 600-705
  2. Replace worker2 with empty string in accumuloCloudFormation.json

Launch Multiple Accumulo stacks

  1. Make copy of [accumuloCloudFormation.json](.../master/accumuloCloudFormation.json]
  2. Replace bucket name i.e. cometbucket in accumuloCloudFormation.json to a different name in one of the files