EmrStartNotebookExecutionOperator
AmazonAn operator that starts an EMR notebook execution.
Access Instructions
Install the Amazon provider package into your Airflow environment.
Import the module into your DAG file and instantiate it with your desired params.
Parameters
editor_idThe unique identifier of the EMR notebook to use for notebook execution.
relative_pathThe path and file name of the notebook file for this execution, relative to the path specified for the EMR notebook.
cluster_idThe unique identifier of the EMR cluster the notebook is attached to.
service_roleThe name or ARN of the IAM role that is used as the service role for Amazon EMR (the EMR role) for the notebook execution.
notebook_execution_nameOptional name for the notebook execution.
notebook_paramsInput parameters in JSON format passed to the EMR notebook at runtime for execution.
tagsOptional list of key value pair to associate with the notebook execution.
waiter_max_attemptsMaximum number of tries before failing.
waiter_delayNumber of seconds between polling the state of the notebook.
waiter_countdownTotal amount of time the operator will wait for the notebook to stop. Defaults to 25 * 60 seconds. (Deprecated. Please use waiter_max_attempts.)
waiter_check_interval_secondsNumber of seconds between polling the state of the notebook. Defaults to 60 seconds. (Deprecated. Please use waiter_delay.)
Documentation
An operator that starts an EMR notebook execution.
See also
For more information on how to use this operator, take a look at the guide: Start an EMR notebook execution