Hadoop - Oozie (Job Scheduler)

About

Oozie is a scheduler for Apache Hadoop jobs.

It's a Java Web application.

job

  • Apache MapReduce,
  • Apache Pig,
  • Apache Hive,
  • Apache Sqoop
  • System job, like Java programs or shell scripts.

Process

Oozie combines multiple jobs sequentially into one logical unit of work.

Documentation / Reference

Task Runner