BigQueryColumnCheckOperator

Google

BigQueryColumnCheckOperator subclasses the SQLColumnCheckOperator in order to provide a job id for OpenLineage to parse. See base class docstring for usage.

View on GitHub

Last Updated: Apr. 14, 2023

Access Instructions

Install the Google provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.

Parameters

tableRequiredthe table name
column_mappingRequireda dictionary relating columns to their checks
partition_clausea string SQL statement added to a WHERE clause to partition data
gcp_conn_id(Optional) The connection ID used to connect to Google Cloud.
use_legacy_sqlWhether to use legacy SQL (true) or standard SQL (false).
locationThe geographic location of the job. See details at: https://cloud.google.com/bigquery/docs/locations#specifying_your_location
impersonation_chainOptional service account to impersonate using short-term credentials, or chained list of accounts required to get the access_token of the last account in the list, which will be impersonated in the request. If set as a string, the account must grant the originating account the Service Account Token Creator IAM role. If set as a sequence, the identities from the list must grant Service Account Token Creator IAM role to the directly preceding identity, with first account from the list granting this role to the originating account (templated).
labelsa dictionary containing labels for the table, passed to BigQuery

Documentation

BigQueryColumnCheckOperator subclasses the SQLColumnCheckOperator in order to provide a job id for OpenLineage to parse. See base class docstring for usage.

See also

For more information on how to use this operator, take a look at the guide: Check columns with predefined tests

Was this page helpful?