

Other similar projects include Luigi, Oozie and Azkaban.Īirflow is commonly used to process data, but has the opinion that tasks should ideally be idempotent (i.e., results of the task will be the same, and will not create duplicated data in a destination system), and should not pass large quantities of data from one task to the next (though tasks can pass metadata using Airflow's XCom feature). When the DAG structure is similar from one run to the next, it clarifies the unit of work and continuity.

Rich command line utilities make performing complex surgeries on DAGs a snap.
.png)
The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. When workflows are defined as code, they become more maintainable, versionable, testable, and collaborative. Apache Airflow (or simply Airflow) is a platform to programmatically author, schedule, and monitor workflows.
