Here will start will Falcon first look, as we having been using for our actual production operations for data pipelines.
Will go through how to configure and onboard a pipeline on Falcon..
http://falcon.incubator.apache.org/docs/InstallationSteps.html
http://falcon.incubator.apache.org/docs/FalconArchitecture.html
http://falcon.incubator.apache.org/docs/OnBoarding.html
http://falcon.incubator.apache.org/docs/EntitySpecification.html
http://falcon.incubator.apache.org/docs/FalconCLI.html
First get your hadoop clusters, with oozie and activemq ready, yes clusters.. we will have two hadoop setups for out activity..
keep watching..
CloudxLab
Sunday, December 1, 2013
Tuesday, August 27, 2013
Cloud Technologies
..here will be posting on hadoop infrastructure and data pipeline engineering...
- Get Docs
- http://hadoop.apache.org/docs/stable/hdfs_design.html
- http://falcon.incubator.apache.org/index.html
- http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html
- Get Hadoop installed
- http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
- http://www.opensourceclub.net/hadoop/cloudera-hadoop-single-node-cluster-pseudo-distributed-mode-on-mac-os-x-lion/
- http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Quick-Start/cdh4qs_topic_3_3.html
Subscribe to:
Posts (Atom)