Actions
Bug #4956
closed[API] cannot create collections with very large manifests
Story points:
1.0
Description
From #4621:
A job generating a manifest consisting of one 28MB line of text could not be saved.
The pipeline log for failed job qr1hi-8i9sb-t6qzh3f0vkybftx says:
2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 57 success in 1223 seconds 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 57 output ecf907baa43ecdcc5b74d29b2d1d593f+28007813+A42020335df3e977930d81cc7f2070b1ab35e050b@54c172a9 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 wait for last 0 children to finish 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 status: 62 done, 0 running, 0 todo 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 release job allocation 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 Freeze not implemented 2015-01-08_21:59:08 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 collate 2015-01-08_22:00:29 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 Failed to write output collection 2015-01-08_22:00:29 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 finish
The crunch-dispatch logs include some more context:
2015-01-08_21:59:08.90514 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 Freeze not implemented 2015-01-08_21:59:08.90515 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 collate 2015-01-08_21:59:10.18634 dispatch: update compute14 state to {:state=>"idle", :job=>nil} 2015-01-08_22:00:29.67611 qr1hi-8i9sb-t6qzh3f0vkybftx ! Traceback (most recent call last): 2015-01-08_22:00:29.67620 qr1hi-8i9sb-t6qzh3f0vkybftx ! File "<string>", line 1, in <module> 2015-01-08_22:00:29.67627 qr1hi-8i9sb-t6qzh3f0vkybftx ! File "/usr/local/lib/python2.7/dist-packages/oauth2client/util.py", line 132, in positional_wrapper 2015-01-08_22:00:29.91142 qr1hi-8i9sb-t6qzh3f0vkybftx ! return wrapped(*args, **kwargs) 2015-01-08_22:00:29.91149 qr1hi-8i9sb-t6qzh3f0vkybftx ! File "/usr/local/lib/python2.7/dist-packages/apiclient/http.py", line 723, in execute 2015-01-08_22:00:29.91154 qr1hi-8i9sb-t6qzh3f0vkybftx ! raise HttpError(resp, content, uri=self.uri) 2015-01-08_22:00:29.91158 qr1hi-8i9sb-t6qzh3f0vkybftx ! arvados.errors.ApiError: <HttpError 500 when requesting https://qr1hi.arvadosapi.com/arvados/v1/collections?alt=json returned "Internal Server Error"> 2015-01-08_22:00:29.94333 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 Failed to write output collection 2015-01-08_22:00:29.94346 qr1hi-8i9sb-t6qzh3f0vkybftx 15570 finish
However, the API server production logs do not have any record of a failed attempt to create this collection.
See if we can reliably reproduce this and determine whether it represents a flaw in the API server, a simple timeout, or something else.
Actions