WebHDFSHook

Apache HDFS

Interact with HDFS. This class is a wrapper around the hdfscli library.

View on GitHub

Last Updated: Oct. 26, 2022

Access Instructions

Install the Apache HDFS provider package into your Airflow environment.

Import the module into your DAG file and instantiate it with your desired params.

Parameters

webhdfs_conn_idThe connection id for the webhdfs client to connect to.
proxy_userThe user used to authenticate.

Documentation

Interact with HDFS. This class is a wrapper around the hdfscli library.

Was this page helpful?