dbt-helper
dbt-helper copied to clipboard
A helper tool for dbt development and data warehouse management.
dbt-helper (beta)
NOTE: dbt-helper currently only works with dbt version >= 0.17.1. This is still in extremely-limited beta release. We'd love your help, testing it though! Until we're out of beta, please don't build this into anything production-touching as it will surely get broken sooner rather than later
dbt-helper is a command line tool that helps with developing dbt projects and managing data warehouses.
This repository is not formally associated with dbt and is not maintained by fishtown-analytics (the maintainers of dbt). If you have problems with one of these (dbt-helper) tools, please file an issue on this repository and do not bother the dbt maintainers about it.
Installation
pip install dbt-helper
NOTE: dbt-helper may not work with dbt when dbt is installed via homebrew. We are currently investigating options for fixes, but if you experience issues that might be the culprit.
Usage
dbt-helper (currently) has seven sub-commands:
compare: Compare the relations in your warehouse with those that dbt is managing. This is useful for identifying "stale" relations that are no longer being updated by dbt (like if, for example, you converted the model from materialized to ephemeral).- Note:
dbt-helper comparewill compare all schemas that are impacted by models in themodels/directory. There is (currently) no way to specify a single schema to compare.
- Note:
bootstrap: Create starter "schema.yml" files for your project. This function helpfully generates boilerplate dbt-model files for you so you don't have to go through the copy/paste when you're developing a new model.- Note: this command will not over-write existing
schema.ymlfiles. It will default to printing templates to the console, but you can create new files by using the--write-filesflag.
- Note: this command will not over-write existing
show-upstream: Inspect the dbt graph and show the relations that are "upstream" from (i.e., the "parents" of) the selected relation. Print to the terminal.show-downstream: The same asshow-upstreambut in the other direction -- show dependents 'downstream' from (i.e., the "children" of) the selected relationfind: Find the compiled.sqlfile for a model by providing the model name only. You can also find the source or run.sqlfiles for a model by using the appropriate flag. Useful when working in large dbt projects and you want to find files quickly wihout having to navigate a file tree.open: Open the compiled.sqlfile for a model by providing the model name only. Works the same as find, but directly opens the file in your text editor.retry-failed: Rerun models that errored or were skipped on your previous dbt run.
As one might hope, you can view the command line options directly from the tool by using the help functionality:
dbt-helper --help
Usage Details:
compare
$ dbt-helper compare
Comparing local models to the database catalog. Checking schemas:
- dev_downstream
- dev_example
Warning: The following relations do not match any models found in this project:
TABLE "dev_example"."b"
VIEW "dev_downstream"."d"
bootstrap
$ dbt-helper bootstrap --schemas dev_example
Bootstrapping the following schemas:
- dev_example
--------------------
Design for relation: dev_example.my_first_dbt_model
--------------------
version: 2
models:
name: my_first_dbt_model
- columns:
- name: id
description: 'TODO: Replace me'
--------------------
Design for relation: dev_example.b
--------------------
version: 2
models:
name: b
- columns:
- name: city
- name: count
description: 'TODO: Replace me'
show-upstream
$ dbt-helper show-upstream d
--------------------------------------------------------------------------------
downstream.d
--------------------------------------------------------------------------------
example.b
--------------------------------------------------------------------------------
show-downstream
see show-upstream
find
# Find the compiled version of model
$ dbt-helper find my_model
# Same as above
$ dbt-helper find my_model --compiled
# Find the run version of the model
$ dbt-helper find my_model --run
# Find the source version of the model
$ dbt-helper find my_model --source
Understanding the flags:
- The
--compiledflag will find the relevant.sqlfile in thetarget/compileddirectory. This file contains the compiledSELECTquery. - The
--runflag will find the relevant.sqlfile in thetarget/rundirectory. Therunversion of the model contains the compiled query wrapped in the DDL (i.e.CREATE) statements to materialize it in the warehouse. - The
--sourceflag will find the relevant jinja-flavored.sqlfile in themodels/directory.
Without a flag, dbt-helper will find the compiled model.
open
# Open the compiled version of model
$ dbt-helper open my_model
# Same as above
$ dbt-helper open my_model --compiled
# Open the run version of the model
$ dbt-helper open my_model --run
# Open the source version of the model
$ dbt-helper open my_model --source
# Print model to STDOUT rather than opening in a text editor
$ dbt-helper open my_model --print
These flags work exactly the same was find above, so check there for more detail on these.
If dbt-helper is opening your file in the wrong text editor, update your
$EDITOR environment variable to be the command you normally use to open a file
in your text editor, e.g. vim, emacs, atom, code, or subl. If you
would prefer to not update your $EDITOR env var, you can instead create a
$DBT_HELPER_EDITOR environment variable for your command.
dbt-helper will try to use one of the following commands to open the file (in order of precedence):
$DBT_HELPER_EDITOR. If this is not set, then it will use:$EDITOR: A standard environment variable used across a number of applications (e.g. git prompts launch in the editor specified here). If this is not set, then use:"Open": Generic command that will open a file in the default application for the associated file type.
retry-failed
$ dbt-helper retry-failed
dbt run --models my_failed_model my_skipped_model
Running with dbt=0.13.0
Found 8 models, 20 tests, 0 archives, 0 analyses, 113 macros, 0 operations, 3 seed files, 0 sources
17:40:11 | Concurrency: 1 threads (target='dev')
17:40:11 |
17:40:11 | 1 of 2 START view model dev_example.my_failed_model.................. [RUN]
17:40:12 | 1 of 2 OK created view model dev_example.my_failed_model............. [CREATE VIEW in 0.65s]
17:40:12 | 2 of 2 START table model dev_example.my_skipped_model................ [RUN]
17:40:29 | 2 of 2 OK created table model dev_example.my_skipped_model........... [SELECT in 16.86s]
17:41:31 |
17:41:31 | Finished running 1 view models, 1 table models in 80.96s.
Contributing
Install locally for development:
pip install -e .
Conventions
We follow a few conventions in this repository.
Git stuff
Please name your branch names with a prefix denoting the type of change you're making. For example:
feature/my-descriptive-feature-name
fix/describe-the-fix
docs/whats-changing
Please also follow generally accepted git best practices by using descriptive commit messages and squashing your commits (where possible) before submitting your PR (or after making any required changes).
Python formatting
We use black to format python code in this project. Please use it! (And consider using it for all of your other python projects while you're at it.)
Running the tests
Run the tests using your local environment:
tox -e dev -- --nocapture
Run the tests locally using docker:
docker-compose run test tox -- --nocapture
If you want to specify a particular test to run, you can pass in a path to an integration test to either of the above commands, e.g.:
tox -e dev -- --nocapture test/integration/001_compare_test/
Giving Thanks
- Thanks to Claire Carroll for the
openandretry-failedfunctions - Thanks to Drew Banin for invaluable discussion and code-review on early
dbt-helperfeatures. - Thanks to John Lynch for early beta-testing and feedback.
- Thanks to Leon Tchikindas for the blog post (and code) inspiring this command-line graph builder.