BigQueryColumnCheckOperator
GoogleBigQueryColumnCheckOperator subclasses the SQLColumnCheckOperator in order to provide a job id for OpenLineage to parse. See base class docstring for usage.
Access Instructions
Install the Google provider package into your Airflow environment.
Import the module into your DAG file and instantiate it with your desired params.
Parameters
tableRequiredthe table name
column_mappingRequireda dictionary relating columns to their checks
partition_clausea string SQL statement added to a WHERE clause to partition data
gcp_conn_id(Optional) The connection ID used to connect to Google Cloud.
use_legacy_sqlWhether to use legacy SQL (true) or standard SQL (false).
locationThe geographic location of the job. See details at: https://cloud.google.com/bigquery/docs/locations#specifying_your_location
impersonation_chainOptional service account to impersonate using short-term credentials, or chained list of accounts required to get the access_token of the last account in the list, which will be impersonated in the request. If set as a string, the account must grant the originating account the Service Account Token Creator IAM role. If set as a sequence, the identities from the list must grant Service Account Token Creator IAM role to the directly preceding identity, with first account from the list granting this role to the originating account (templated).
labelsa dictionary containing labels for the table, passed to BigQuery
Documentation
BigQueryColumnCheckOperator subclasses the SQLColumnCheckOperator in order to provide a job id for OpenLineage to parse. See base class docstring for usage.
See also
For more information on how to use this operator, take a look at the guide: Check columns with predefined tests