To Be Groomed

Due in over 9 years (01/01/2031) (Sprint start date 12/31/2029)

A fake sprint to hold stories to be groomed


148 issues   (5 closed143 open)

To Be Groomed

Arvados Contributor Wiki

This wiki has information related to the open source project. It's a work in progress, so come back often for updates and feel free to become a contributor.

Platform Overview

Release Notes

Please see our blog

Key Components

Using Arvados

Contributing and hacking



Issues by










Related issues
# Subject Story Points
Arvados Workbench 2 - Feature #17518 Workbench2 lets users auto-login and access dialogs through direct links
Feature #17490 [workbench2] make filter groups creatable/editable
Bug #17428 [deployment] arvados-api-server fails to re-install/upgrade if the postgresql-client and the server versions differ
Bug #17413 [arv-mount] Exit (instead of undefined behavior) after unhandled exception
Feature #17391 [keepstore] expose the volume storage classes
Feature #17388 [arv-copy] Storage classes revisit
Bug #17385 [arvados-dispatch-cloud][crunch-run] CrunchRunArgumentsList parameter is not honored
Bug #17358 [wb2] mis-placed label?
Bug #17357 [wb2] ssh keys for admin user list all of them (in the user tab)
Feature #17350 [keep-balance] Expected behaviour with different "replicas_desired" and "storage_classes_desired" values
Feature #17349 [Keep API] Revisit "replicas_desired" and "storage_classes_desired".
Bug #17314 [API] [controller] [Workbench] 404 on foreign user UUID in admin interface
Feature #17298 remove the need to run get_anonymous_user_token.rb during installation
Feature #17217 [controller] move blob signature calculation from api to controller
Feature #16843 [a-d-c] admin cli
Feature #16802 [packaging] create an arvados-shell-node package
Feature #16738 [a-d-c] introduce concept of "quota partition"
Bug #16727 [FUSE] [cgofuse] Refresh signatures / reload collection instead of using expired blob signatures
Bug #16566 [cli] can't create a token when using a token created with create_superuser_token.rb
Story #8181 [Keepstore] Support a portable data challenge API
Bug #16426 [arv-put] should calculate (worst case) manifest size up front before upload, and abort when it is too large
Story #15925 Separate arvados-git-sync from API server
Story #15909 Systematic review of documentation gaps
Story #15783 Add Arvados services go docs
Feature #15758 [keep-balance] [keepstore] Report blocks/bytes in trash
Story #15697 [doc] explain lifecycle of Keep blocks, and how it affects storage backend usage/cost
Bug #15692 [CWL] test_with_arvbox integration tests use tests inside image
Bug #15579 Staging a large number of files with "loadListing: no_listing" still takes more than 30 mins
Story #15580 [CWL] Register workflow and run from git repo
Story #15540 Paginate crunchstat-summary HTML output
Story #15436 can run offline
Story #15382 Synthetic database generator tool
Story #15381 can populate/use local package cache directories
Story #15339 ExpressionTool can be explicitly run under a CR
Story #15364 Better CWL onboarding
Feature #15320 [API] add integration test for new user signup
Bug #15183 [Workbench2] No errors reported when file is missing a block
Feature #15204 Federation diagnostic tools/APIs
Feature #15279 [cwl] expressions in keep_cache
Feature #15249 [cwl] a-c-r supports --js-console
Bug #15118 [keepstore] Return 5xx (not 4xx) if block is not found due to transient backend device failure
Feature #15135 [Crunch2] Post-run hooks
Story #15259 Invalidate tokens federation-wide on when invalidated on home cluster
Feature #15125 [keep-balance] [keepstore] Procedure to halt/reverse/investigate a suspected data loss incident
Story #9046 [Crunch2] Can issue container request to pull and import Docker images
Feature #15082 [controller] [all services] Publish entire config including secrets
Story #15070 Update search API to support OR queries across text and vocabulary IDs
Story #15015 replace epydoc
Story #14923 [CWL] Reduce memory footprint by using webdav to get file/dir listings in collections (instead of Python SDK)
Feature #15025 [arvados-dispatch-cloud] GCE driver (Google Compute Engine)
Support #14906 [SDK] Maintain Arvados SDKs in Bioconda
Feature #13557 Copying a container request should link to the original container request
Feature #13643 [CWL] Requirement for storage classes of data
Story #14997 [crunchstat-summary] Support multiple data series per graph (e.g. TX/RX, user/sys CPU, etc)
Bug #14650 [arvados-cwl-runner] Inaccessible collection causes long traceback
Bug #14488 [keepstore] Do not try to trash blocks that were already trashed in previous sweep
Feature #14457 [api] send e-mail notification to user when their arvados account is activated
Bug #14414 [keepstore] ManagementToken is not documented in the --help output
Bug #14405 [API] Avoid long-running transactions when sweeping trashed objects
Feature #14257 [controller] refresh manifest API
Feature #14200 [API] Reduce privilege exposure via API tokens in multi-cluster workflows
Feature #11442 [CWL] Resolve symbolic names to collections
Feature #14168 [Running a workflow] Input presets
Feature #14030 [CWL] Use streamable flag as a hint to use keep output
Feature #14021 [crunch-dispatch-slurm] option to set job priority directly instead of using nice values
Feature #14018 [CWL] arvados-cwl-runner should support tagging container requests using properties
Feature #14016 [API] Container request can provide existing collection UUID that will accept CR output
Story #14004 [Controller] Keep database in tune
Story #14001 [Spike] [Controller] Port "update workflow" API to controller
Bug #13998 [keepstore] Cached block fetch for federation
Story #13997 [Controller] Port "collection update" API from Rails
Story #13995 [Controller] Port "collection update" functional tests from Rails API to controller test suite
Bug #13991 crunch-dispatch-slurm does not warn when slurm MaxJobCount reached
Bug #13982 crunch-dispatch-slurm / Go SDK Dispatcher not able to "let go" of deleted containers
Bug #13981 Make keep-web documentation available on
Bug #13972 Listing collections by PDH and name can be very slow
Bug #13960 Workbench takes new users to inexistant tutorial
Story #13925 Default keep cache scales with machine size
Bug #13916 Discovery document inconsistencies
Story #13794 Build Docker images without requiring Docker on client
Bug #13763 [API] groups#contents response should obey caller-provided order (not sort on type first)
Bug #13956 [keep-web] (webdav) Support COPY and MOVE between collections
Story #13384 [Workbench] Search users by email address and shell account
Story #13760 Provide more information to SLURM to make scheduling decisions on HPC
Feature #13675 [Composer/Workbench] Import of public git repo to a new Arvados repository
Bug #13644 [STORAGE-CLASES] Report the storage classes used by a pipeline
Bug #13636 crunch-run takes a very long time for CWL steps with large numbers of File inputs - could use a new kind of mounts entry to address this
Feature #6865 [Documentation] Higher Level Python SDK Reference Page
Bug #13418 Python SDK must have docstrings for all public interfaces
Bug #12684 [Python SDK] Retry on HTTP 5xx errors
Bug #13607 [arv-mount] avoid excessive memory consumption of idle arv-mounts
Story #13390 Document API in API discovery document
Feature #13389 [Epic] Easy cluster install
Feature #13388 [Epic] Zero-downtime upgrade
Story #12666 --ignore-docker-for-reuse not passed to crunch v2 jobs started via --submit --no-wait
Story #13326 As a data provider, I would like the ability to provide metered access to my data on Arvados, including start/end dates, bandwidth caps, and % of file accessed
Story #13325 As a CWL learner, I would like to be able to run CWL workflows easily on playground
Story #13110 [WebDAV] WebDAV performance enhancements
Story #9653 [keepstore] Limit entire process to configured amount or % of system memory (replace -max-buffers=X)
Bug #13162 [SDKs] Remove arvadosclient and keepclient libraries
Bug #13144 [SDKs] Wean all components off old arvadosclient and keepclient libraries
Bug #13102 containers are not reused unless runtime constraints (including RAM) match exactly
Bug #13094 Fix misleading crunch file upload messages
Feature #13062 [SDK] Reduce collection class memory footprint
Bug #13005 [Crunch2] All stdout gets the same timestamp and other logging problems
Bug #11625 [Workbench] Ensure that running processes are visible on the dashboard
Feature #13126 [keep] Investigate using signed URLs to delegate access to cloud buckets
Bug #12854 [crunchstat-summary] broken option for skip child jobs?
Bug #12791 [API] fix race between arrival of trash time and next sweep
Feature #12692 [keep-web] Accept /t=token/c=collection/path (currently only /c=collection/t=token/path)
Feature #12630 [Crunch2] GPU support
Story #12571 arv-mount by_tag directory support for properties
Story #12527 [Composer] Refactor to separate vendor-specific pieces and all them to be replaced
Bug #12178 Activating an account (setting is_active from false to true) does not email the new user
Story #11770 [Python SDK] Implement support for universal newline mode in Collections API
Story #11724 Pull Docker images without requiring Docker on client
Feature #11183 [Deployment] Use consul to report health of services/components
Feature #10767 [API] [arvados-ws] remove all websockets and puma related code from the api server codebase
Bug #10390 crunchstat-summary should explain why "(no report generated)" when possible
Bug #10298 [Crunch2] [API] Accept collection UUID or PDH as container_image in a container request
Story #9425 [API?/Workbench] Search for a link tag value returns linked objects
Story #8573 crunchstat-summary should calculate averages of all metrics
Feature #9136 [Python SDK] Add a Collection method to efficiently concatenate files
Story #6647 Choose an option to make better service discoveries in deployments
Bug #12198 Copying a project with a docker image doesn't copy the docker metadata
Feature #5214 [SDKs] Use sphinx to render for Python SDK docs
Feature #16462 Expand arvados-controller to expose forecast features