Tools
The MoJ Analytical Platform comes with various tools including:
Control panel
The main entry point to the Analytical Platform
RStudio
A development environment for writing R code and R Shiny apps
JupyterLab
A development environment for writing Python code
Airflow
A tool for scheduling and monitoring workflows
Data Discovery
The data engineering team maintain a number of databases on the Analytical Platform (curated databases). The best way to find out about these is using the data discovery tool
Data Uploader
Under construction
Create a Derived Table
Implements a tool called dbt, for creating persistent derived tables in Athena. Visit the Create a Derived Table repo to see progress, or sign up for testing on the #ask-data-modelling Slack channel.
Python packages
The data engineering team maintain the following python packages:
R packages
The data engineering team maintain the following R package:
dbtools
A package for accessing Athena databases from the Analytical Platform.
The Analytical Platform community maintain the following R packages, which avoid the need for using Python in R projects:
Rdbtools
A native R package for accessing Athena databases from the Analytical Platform.
Rs3tools
A native R package that is used to access AWS S3 from the Analytical Platform, which is mainly compatible with the legacy package s3tools.