dbt-databricks icon indicating copy to clipboard operation
dbt-databricks copied to clipboard

Support for new serverless compute

Open gaoshihang opened this issue 1 year ago • 3 comments
trafficstars

Describe the feature

Databricks just released the new Serverless Compute at May 1st, I think it will help us a lot on the pipeline, to avoid slow spin-up time of All-purpose/job cluster. So, can we add support for this type of Compute?

Describe alternatives you've considered

Additional context

https://docs.databricks.com/en/workflows/jobs/run-serverless-jobs.html

Who will this benefit?

I think all kinds of job can get benefit from this, because we don't need to control the cluster by ourselves.

Are you interested in contributing this feature?

Yes, but don't know how to do.

gaoshihang avatar May 02 '24 18:05 gaoshihang

In progress; it will only work for python models, since much like job clusters, I don't think these nodes have a thrift server.

benc-db avatar Jun 17 '24 17:06 benc-db

Hi @benc-db ,

I would like to use Serverless as compute for the DBT-CLI - as this would save around five minutes to spin up the compute for DBT-CLI for each DBT-Job.

Currently trying to do that results in the following error-message: Databricks adapter: Connection(session-id=Unknown) - Exception while trying to create connection: Error during request to server Error properties: attempt=1/30, bounded-retry-delay=None, elapsed-seconds=843.3235409259796/900.0, error-message=, http-code=None, method=OpenSession, no-retry-reason=non-retryable error, original-exception=Retry request would exceed Retry policy max retry duration of 900.0 seconds, query-id=None, session-id=None Will your Pull request also solve this issue?

Thanks! :-)

dbph avatar Jul 01 '24 12:07 dbph

@dbph I would file a ticket with your company's Databricks contact, because that scenario is already supposed to be supported, and we'd need investigation from the Jobs team to know why its not in your case.

benc-db avatar Jul 01 '24 15:07 benc-db