Project

General

Profile

Actions

Bug #4956

closed

[API] cannot create collections with very large manifests

Added by Tim Pierce over 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
API
Target version:
Story points:
1.0

Description

From #4621:

A job generating a manifest consisting of one 28MB line of text could not be saved.

The pipeline log for failed job qr1hi-8i9sb-t6qzh3f0vkybftx says:

2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 57 success in 1223 seconds
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 57 output ecf907baa43ecdcc5b74d29b2d1d593f+28007813+A42020335df3e977930d81cc7f2070b1ab35e050b@54c172a9
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  wait for last 0 children to finish
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  status: 62 done, 0 running, 0 todo
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  release job allocation
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  Freeze not implemented
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  collate
2015-01-08_22:00:29 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  Failed to write output collection
2015-01-08_22:00:29 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  finish

The crunch-dispatch logs include some more context:

2015-01-08_21:59:08.90514 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  Freeze not implemented
2015-01-08_21:59:08.90515 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  collate
2015-01-08_21:59:10.18634 dispatch: update compute14 state to {:state=>"idle", :job=>nil}
2015-01-08_22:00:29.67611 qr1hi-8i9sb-t6qzh3f0vkybftx ! Traceback (most recent call last):
2015-01-08_22:00:29.67620 qr1hi-8i9sb-t6qzh3f0vkybftx !   File "<string>", line 1, in <module>
2015-01-08_22:00:29.67627 qr1hi-8i9sb-t6qzh3f0vkybftx !   File "/usr/local/lib/python2.7/dist-packages/oauth2client/util.py", line 132, in positional_wrapper
2015-01-08_22:00:29.91142 qr1hi-8i9sb-t6qzh3f0vkybftx !     return wrapped(*args, **kwargs)
2015-01-08_22:00:29.91149 qr1hi-8i9sb-t6qzh3f0vkybftx !   File "/usr/local/lib/python2.7/dist-packages/apiclient/http.py", line 723, in execute
2015-01-08_22:00:29.91154 qr1hi-8i9sb-t6qzh3f0vkybftx !     raise HttpError(resp, content, uri=self.uri)
2015-01-08_22:00:29.91158 qr1hi-8i9sb-t6qzh3f0vkybftx ! arvados.errors.ApiError: <HttpError 500 when requesting https://qr1hi.arvadosapi.com/arvados/v1/collections?alt=json returned "Internal Server Error">
2015-01-08_22:00:29.94333 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  Failed to write output collection
2015-01-08_22:00:29.94346 qr1hi-8i9sb-t6qzh3f0vkybftx 15570  finish

However, the API server production logs do not have any record of a failed attempt to create this collection.

See if we can reliably reproduce this and determine whether it represents a flaw in the API server, a simple timeout, or something else.


Subtasks 6 (0 open6 closed)

Task #5299: Determine causeResolvedPeter Amstutz01/09/2015Actions
Task #5102: Try to reproduceResolvedPeter Amstutz01/09/2015Actions
Task #5358: In Python SDK, check each request's size against discovered limitResolvedPeter Amstutz01/09/2015Actions
Task #5357: Advertise configurable request size limit in discovery doc (default 128MiB)ResolvedPeter Amstutz01/09/2015Actions
Task #5366: Review 4956-limit-request-sizeResolvedTom Clegg01/09/2015Actions
Task #5308: Decide on explicit, documented limits on maximum manifest size, maximum API server request size.Resolved01/09/2015Actions
Actions

Also available in: Atom PDF