Updated on 2022-09-14 GMT+08:00

Application Scenarios

Oozie is a workflow engine used to manage Hadoop jobs. Oozie workflows are defined and described based on the directed acyclic graph (DAG). Oozie supports multiple workflow modes and workflow scheduled triggering mechanisms. Oozie provides features such as easy extensibility, convenient maintenance, and high reliability and works closely with each component in the Hadoop ecosystem.

Oozie workflows are classified into three types:

  • Workflow: describes a complete basic service flow.
  • Coordinator: is built on workflows, triggers workflows on a scheduled basis or based on the specified conditions.
  • Bundle: is built on coordinators. It centrally schedules, controls, and manages coordinators.

Oozie provides the following features:

  • Supports distribution, aggregation, and selection of workflow modes.
  • Works closely with each component in the Hadoop ecosystem.
  • Supports parameterized workflow variables.
  • Supports scheduled workflow triggering.
  • Supports high availability (HA).
  • Provides a built-in web console that allows users to view and monitor workflows and view logs.