BigQueryTableCheckOperator

Google

BigQueryTableCheckOperator subclasses the SQLTableCheckOperator in order to provide a job id for OpenLineage to parse. See base class for usage.

View on GitHub

Last Updated: Mar. 16, 2023

Access Instructions

Install the Google provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.

Parameters

tableRequiredthe table name
checksRequireda dictionary of check names and boolean SQL statements
partition_clausea string SQL statement added to a WHERE clause to partition data
gcp_conn_id(Optional) The connection ID used to connect to Google Cloud.
use_legacy_sqlWhether to use legacy SQL (true) or standard SQL (false).
locationThe geographic location of the job. See details at: https://cloud.google.com/bigquery/docs/locations#specifying_your_location
impersonation_chainOptional service account to impersonate using short-term credentials, or chained list of accounts required to get the access_token of the last account in the list, which will be impersonated in the request. If set as a string, the account must grant the originating account the Service Account Token Creator IAM role. If set as a sequence, the identities from the list must grant Service Account Token Creator IAM role to the directly preceding identity, with first account from the list granting this role to the originating account (templated).
labelsa dictionary containing labels for the table, passed to BigQuery

Documentation

BigQueryTableCheckOperator subclasses the SQLTableCheckOperator in order to provide a job id for OpenLineage to parse. See base class for usage.

See also

For more information on how to use this operator, take a look at the guide: Check table level data quality

Was this page helpful?