Transfers data from Amazon Glacier to Google Cloud Storage

View on GitHub

Last Updated: Sep. 13, 2022

Access Instructions

Install the Amazon provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.


aws_conn_idThe reference to the AWS connection details
gcp_conn_idThe reference to the GCP connection details
vault_nameRequiredthe Glacier vault on which job is executed
bucket_nameRequiredthe Google Cloud Storage bucket where the data will be transferred
object_nameRequiredthe name of the object to check in the Google cloud storage bucket.
gzipRequiredoption to compress local file or file data for upload
chunk_sizesize of chunk in bytes the that will downloaded from Glacier vault
delegate_toThe account to impersonate using domain-wide delegation of authority, if any. For this to work, the service account making the request must have domain-wide delegation enabled.
google_impersonation_chainOptional Google service account to impersonate using short-term credentials, or chained list of accounts required to get the access_token of the last account in the list, which will be impersonated in the request. If set as a string, the account must grant the originating account the Service Account Token Creator IAM role. If set as a sequence, the identities from the list must grant Service Account Token Creator IAM role to the directly preceding identity, with first account from the list granting this role to the originating account (templated).


Transfers data from Amazon Glacier to Google Cloud Storage


Please be warn that GlacierToGCSOperator may depends on memory usage. Transferring big files may not working well.

See also

For more information on how to use this operator, take a look at the guide: Amazon S3 Glacier To GCS transfer operator

Was this page helpful?