python-ci
python-ci copied to clipboard
Sample CI setup for Python projects
Standard Python project CI setup
There are many ways to set up continuous integration for your Python project. This is my personal flavour of doing things. Feel free to pick-and-choose the parts that you like.
This README includes some justification and references for the choices made in this setup.
Table of contents
- pre-commit
- pytest
- Makefile
- Poetry
- GitHub Actions
pre-commit
pre-commit is an awesome framework which many Python projects use. It allows you to select 'hooks' for various formatters and linters you want to use.
Run pre-commit install after setting up your local environment to enable pre-commit to run all hooks whenever you do a git commit. The commit will be cancelled if not all hooks run successfully. To commit anyway, run with --no-verify.
The following hooks have been selected for this CI setup:
- pre-commit-hooks: Some generic hooks not specific to Python.
- ruff: An extremely fast Python linter and formatter. Includes lints and formatting popularized by various other tools like
black,flake8andpyupgrade, all in one tool. Replaces all linting and autoformatting tools except formypy. Install the VSCode or PyCharm extension for the best developer experience. Adjust settings in thepyproject.tomlas desired. - mypy: mypy is a static type checker for Python. One of the best things you can do for your code base is add type hints and be consistent with them. In this repo, mypy is configured with all strictness options enabled. Note that for mypy to work correctly as a pre-commit hook, you must define your main dependencies as
additional_dependenciesin the pre-commit hook. If you have many dependencies, it may be better to remove the mypy pre-commit hook and run mypy alongside your tests.
pytest
pytest is without question the best Python testing framework out there. Tests written in this framework are much more readable than when using Python's built-in unittest framework.
pytest is extensible. I advise using pytest-mock for your mocking needs. pytest-spark is useful when you're working with pyspark.
Test coverage is calculated using the coverage package.
Makefile
The Makefile is used in this repo as a collection of small useful scripts. Most notably:
make fmtruns autoformatting and lintingmake testruns testsmake coverageruns tests and generates a coverage report
Simply run make to get an overview of available commands.
Poetry
Poetry is an amazing, modern tool for developing Python packages. See my Poetry guide for pointers on using Poetry effectively.
Note that the dependency specification for this repository contains two dependency groups:
test: Includes all testing dependencies.lint: Includes all linting dependencies. This can be useful to help your IDE do autoformatting or show in-line linting errors.
Having these development dependencies in separate groups makes it easy to install only the required dependencies in the CI workflows.
GitHub Actions
GitHub Actions is GitHub's CI/CD offering. It allows you to enforce your linting checks and tests for new features, making sure your repo remains in good shape.
I included two separate workflows, one for linting and one for testing. Both workflows utilize caching to speed up subsequent runs, and define concurrency to save some more compute.
For open source repos, I recommend use the official pre-commit CI instead of the linting workflow in this repository. It has some nice bonuses, like keeping your pre-commit hooks up-to-date automatically.
Dependabot
The repo also includes a Dependabot configuration. This can help keep your Python dependencies and GitHub Actions up-to-date.
Because Dependabot can get a bit spammy with its pull requests, it's configured to skip patch versions and only open pull requests once a week.