EmrStartNotebookExecutionOperator

Amazon

An operator that starts an EMR notebook execution.

View on GitHub

Last Updated: Mar. 2, 2023

Access Instructions

Install the Amazon provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.

Parameters

editor_idThe unique identifier of the EMR notebook to use for notebook execution.
relative_pathThe path and file name of the notebook file for this execution, relative to the path specified for the EMR notebook.
cluster_idThe unique identifier of the EMR cluster the notebook is attached to.
service_roleThe name or ARN of the IAM role that is used as the service role for Amazon EMR (the EMR role) for the notebook execution.
notebook_execution_nameOptional name for the notebook execution.
notebook_paramsInput parameters in JSON format passed to the EMR notebook at runtime for execution.
tagsOptional list of key value pair to associate with the notebook execution.
waiter_max_attemptsMaximum number of tries before failing.
waiter_delayNumber of seconds between polling the state of the notebook.
waiter_countdownTotal amount of time the operator will wait for the notebook to stop. Defaults to 25 * 60 seconds. (Deprecated. Please use waiter_max_attempts.)
waiter_check_interval_secondsNumber of seconds between polling the state of the notebook. Defaults to 60 seconds. (Deprecated. Please use waiter_delay.)

Documentation

An operator that starts an EMR notebook execution.

See also

For more information on how to use this operator, take a look at the guide: Start an EMR notebook execution

Was this page helpful?