SageMakerTrainingOperatorAsync

Astronomer Providers

SageMakerTrainingOperatorAsync starts a model training job and polls for the status asynchronously. After training completes, Amazon SageMaker saves the resulting model artifacts to an Amazon S3 location that you specify.

Access Instructions

Install the Astronomer Providers provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.

Parameters

configThe configuration necessary to start a training job (templated). For details of the configuration parameter see SageMaker.Client.create_training_job
aws_conn_idThe AWS connection ID to use.
print_logif the operator should print the cloudwatch log during training
check_intervalif wait is set to be true, this is the time interval in seconds which the operator will check the status of the training job
max_ingestion_timeThe operation fails if the training job doesn’t finish within max_ingestion_time seconds. If you set this parameter to None, the operation does not timeout.
check_if_job_existsIf set to true, then the operator will check whether a training job already exists for the name in the config.
action_if_job_existsBehaviour if the job name already exists. Possible options are “increment” (default) and “fail”. This is only relevant if check_if_job_exists is True.

Documentation

SageMakerTrainingOperatorAsync starts a model training job and polls for the status asynchronously. After training completes, Amazon SageMaker saves the resulting model artifacts to an Amazon S3 location that you specify.

See also

For more information on how to use this operator, take a look at the guide: :ref:howto/operator:SageMakerTrainingOperator

Was this page helpful?