MoJ Analytical Services
MoJ Analytical Services
splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
airflow-pdf2embeddings
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
coffee-and-coding-public
MoJ coffee and coding sessions that can be made publicly available
etl-pipeline-example
An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information
etl_manager
A python package to create a database on the platform using our moj data warehousing framework
xltabr
xltabr: An R package for writing formatted cross tabulations (contingency tables) to Excel using openxlsx