Oozie: Does oozie generate output-events? Oozie: Does oozie generate output-events? hadoop hadoop

Oozie: Does oozie generate output-events?


The official Oozie documentation for Oozie Coordinator is not very clear on the exact purpose of the output-events element. However, the book "Apache Oozie: The Workflow Scheduler for Hadoop" mentions the following:

During reprocessing of a coordinator, Oozie tries to help the retry attempt by cleaning up the output directories by default. For this, it uses the <output-events> specification in the coordinator XML to remove the old output before running the new attempt. Users can override this default behavior using the –noCleanup option.

So, in summary:

  • No, files specified in output-events are not automatically created by Oozie, you need to create those files in your Oozie workflow actions.
  • The output-events configuration is for giving Oozie information on what files will be created by your Oozie workflow actions, which Oozie would use to cleanup files when rerunning/reprocessing a coordinator.


Always the actions generate the data, these settings are just for control.You'll find some examples here