Monthly Archives: August 2014

Oozie : Pig Action – Pig Jobs using oozie

Hi All, In my upcoming posts, lets discuss about oozie and how to implement various actions using it… What are OOzie and Pig??   Apache Oozie is a system for running workflows of dependent jobs. It is composed of two main parts: a workflow … Continue reading

Posted in oozie, Uncategorized | Tagged , , , , , , , , , | Leave a comment

Sqoop : Incremental Imports using Last-Modified mode

As discussed in my previous post, Sqoop is a tool designed to transfer data between Hadoop and relational databases. Incremental imports mode can be used to retrieve only rows newer than some previously-imported set of rows. Why & When Last-Modified … Continue reading

Posted in sqoop | Tagged , , , , | 3 Comments

Sqoop : Incremental Imports using Append mode

As you all know, Sqoop is a tool designed to transfer data between Hadoop and relational databases. Incremental imports mode can be used to retrieve only rows newer than some previously-imported set of rows. Why Append mode ?? works for numerical data … Continue reading

Posted in sqoop | Tagged , , , , , | 7 Comments