data-science-lab icon indicating copy to clipboard operation
data-science-lab copied to clipboard

Instructions for installing and configuring a data science lab.

Data Science Lab

These files describe how to build a data lab with RStudio professional products. This guide assumes you are using EC2 on AWS, but you can use any sandbox. Here are the requirements. For detailed instructions see the RStudio admin guides

  • A modern Linux operating system
  • An internet connection
  • Sudo access

If you are new to Linux, consider reading The Unix Workbench by Sean Kross. This concise manual explains basic commands for working with Unix/Linux.

Instance

Create a Linux Server on AWS. Or you can use a VM on your local desktop.

  1. Setup. Setup the instance.
  2. Libraries. Install system dependencies.

Data Lab

Install and configure RStudio professional products that allow you to access your data, automate tasks, scale Shiny applications, host API's, and much, much more.

  1. R. Build multiple versions of R from source.
  2. Packages. Install essential packages.
  3. RStudio. Install RStudio Server Pro, RStudio Connect, and RStudio Professional Drivers.

Integration

Install other applications on the server. These will be used to integrate R apps, docs, and API's with other products.

  1. Database. Install and configure PostgreSQL.
  2. Web server. Install and configure Apache web server.
  3. Email. Install and configure sendmail.