Project

General

Profile

Actions

Feature #17609

closed

arvados-client subcommand to run diagnostics on already installed cluster

Added by Peter Amstutz almost 3 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Deployment
Target version:
Story points:
-
Release relationship:
Auto

Description

This is the list of tests will do

https://docs.google.com/spreadsheets/d/1--O03eo9-5gQYnP5eBti9a6E6ZYApM_lpnRsYZo9pqM/edit#gid=0https://docs.google.com/spreadsheets/d/1--O03eo9-5gQYnP5eBti9a6E6ZYApM_lpnRsYZo9pqM/edit#gid=0

Then once we have the list will include it to the arvados-client test

  • Run the tests that can be run:
    • If config.yml is available, check that
    • If cypress can be run, run browser-based tests
  • Warn about what can be run / cannot be run
  • put everything into a diagnostics project

Ward's 3 electric rails:

  • uploading through keepproxy
  • running workflows
  • properly configured keep-web
    • uploading via webdav
    • downloading via webdav and s3

Nico's tests:

  • Fetching discovery document / public config
  • Check hostnames, ports, certificates of service ExternalURL are valid
  • Check nginx geo section

Tom's modes:

  • User option to run assuming it is inside (check that things treat you as inside)
  • User option to run assuming it is outside (check that things treat you as outside)

Healthcheck:

  • Use healthcheck endpoints, see if some tests can be part of healthcheck
    • Any check that can be done as a healthcheck, probably should be
  • Needs management token
  • Use healthcheck aggregator
$ arvados-client diagnostics --inside
Checking connectivity to https://api.arvados.example.com ...OK
Checking TLS certificate on https://api.arvados.example.com ...FAIL

Guidelines:

  • run arvados-server check-config as early as possible.
  • verbose mode that communicates as much as possible about what each test is trying to do
  • be very explicit about failures

Subtasks 1 (0 open1 closed)

Task #17731: Review 17609-diagnostics-cmdResolvedTom Clegg06/09/2021Actions

Related issues

Related to Arvados Epics - Idea #16444: Improved error detection/reportingClosed03/01/202204/30/2022Actions
Actions

Also available in: Atom PDF