Project

General

Profile

Feature #12746

Updated by Ward Vandewege over 6 years ago

It turns out that the missing net:eth0 stats are due to an architectural change between crunch1 and crunch2.  

 In crunch1 these stats were useful for monitoring the I/O bandwidth during the upload phase when arv-put was run in the container to do the upload. 
 In crunch2, the uploads are done outside the container by the new crunch2 crunch-run component which doesn't log any stats during the upload 

 Crunch2 needs to provide equivalent stats to Crunch1 during the upload phase. The network bandwidth stats are the most important, but memory and CPU stats would be useful as well. 

 It would also be useful to have these stats during the Docker image download phase at the beginning of the job. 

 Another useful piece of information which is missing is the node type and cost which was logged in Crunch1. This should be added to node-info.txt (or any other convenient location in the logs). 

Back