Project

General

Profile

Actions

Idea #18337

open

Easy entry into Arvados ecosystem

Added by Tom Clegg over 3 years ago. Updated 3 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Target version:
-
Start date:
01/01/2025
Due date:
06/30/2025 (Due in about 5 months)
Story points:
-
Release:
Release relationship:
Auto

Description

This epic is about minimizing the friction involved in running Arvados at the single host scale. There are a few versions of this:

  • Arvados appliance: Arvados pre-installed on dedicated hardware and shipped to users
  • Self-install: users self-installing Arvados on dedicated hardware similar to the appliance
  • Desktop install: users run Arvados using a container or VM on their laptop, with convenient desktop integration
  • Ephemeral instances: Start a fresh Arvados instance, use it to run computations, export the results, and then the Arvados instance is suspended or wiped out. The exported results should include all of the provenance information.

The goal for all of these is for someone to be able to get an Arvados instance running in a few clicks.

The follow up to this is that once they have access to Arvados, they are not looking at an empty instance, but have easy access to biomedical data and workflows they can use to start doing stuff immediately.

Previously

Setting up a new single-node Arvados cluster should look something like

# add-apt-repository ...
# apt-get install arvados-server
# arvados-server init [some options]
[answer some questions]
OK, you can now log in to https://workbench2.example.com/
# arvados-server health
OK

Scaling up by adding new system nodes should be just as easy.

It is okay if some installation environments (or sysadmin preferences) necessitate additional manual intervention. The idea is to make it extremely easy to start a production-quality cluster on a few suitable types of system that most people can get access to, like an on-prem server or publicly reachable cloud instance running Debian stable.

Things you don't currently get by installing arvados-server-easy on a cloud VM:
  • Cloud storage (currently it just stores on local disk by default, in /var/lib/arvados)
  • Diagnostics tool should use system root token from the config file when run on a server node, instead of requiring env vars #17344

Related issues 8 (5 open3 closed)

Related to Arvados - Idea #16552: "arvados-server init" can get TLS certificates from Let's EncryptResolvedTom Clegg07/14/2022Actions
Related to Arvados - Idea #18338: "arvados-server init" can use a local root CA to sign certificatesNewActions
Related to Arvados - Idea #18341: "arvados-server init" can set up a single-node production clusterNewActions
Related to Arvados - Idea #17344: [boot] Make arvados-server-easy package suitable for demo use caseResolvedTom Clegg07/15/2022Actions
Related to Arvados - Bug #17345: [boot] handle dependency upgrades better in "arvados-package build"NewActions
Related to Arvados Epics - Idea #18685: Synchronize configuration on multi-node clusterNewActions
Related to Arvados - Feature #19709: Apply database migrations if needed when starting arvados-server bootResolvedTom Clegg12/08/2022Actions
Related to Arvados - Feature #16385: add prebuilt container images for Arvados releasesIn ProgressPeter AmstutzActions
Actions

Also available in: Atom PDF