CloudxLab

Sunday, December 1, 2013

Falcon Onboarding

Here will start will Falcon first look, as we having been using for our actual production operations for data pipelines.

Will go through how to configure and onboard a pipeline on Falcon..

http://falcon.incubator.apache.org/docs/InstallationSteps.html
http://falcon.incubator.apache.org/docs/FalconArchitecture.html
http://falcon.incubator.apache.org/docs/OnBoarding.html
http://falcon.incubator.apache.org/docs/EntitySpecification.html
http://falcon.incubator.apache.org/docs/FalconCLI.html

First get your hadoop clusters, with oozie and activemq ready, yes clusters.. we will have two hadoop setups for out activity..

keep watching..

Tuesday, August 27, 2013

Cloud Technologies

..here will be posting on hadoop infrastructure and data pipeline engineering...

  • Get Docs
    • http://hadoop.apache.org/docs/stable/hdfs_design.html
    • http://falcon.incubator.apache.org/index.html
    • http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html

  • Get Hadoop installed
    • http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
    • http://www.opensourceclub.net/hadoop/cloudera-hadoop-single-node-cluster-pseudo-distributed-mode-on-mac-os-x-lion/ 
    • http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Quick-Start/cdh4qs_topic_3_3.html