Python SDK for Databricks Lakehouse development
Top 66.4% on sourcepulse
This Python SDK provides a comprehensive interface for interacting with the Databricks Lakehouse platform, targeting developers and data engineers who need to automate Databricks workflows and manage resources programmatically. It aims to simplify Databricks operations by abstracting the underlying REST APIs.
How It Works
The SDK offers a robust internal HTTP client with intelligent retry mechanisms for handling failures. It exposes a WorkspaceClient
and AccountClient
for managing workspace resources and account-level configurations, respectively. Authentication is flexible, supporting Databricks native tokens, Azure AD, and GCP credentials, with a prioritized lookup order for configuration. Long-running operations are managed via a Waiter
interface, and paginated API responses are abstracted into Python iterators.
Quick Start & Requirements
pip install databricks-sdk
Highlighted Details
Waiter
interface for managing long-running operations like cluster creation and job execution.dbutils
functionality (e.g., dbutils.fs
, dbutils.secrets
) implemented natively in Python.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 days ago
1 week