Project

General

Profile

Bug #4334

Updated by Ward Vandewege over 9 years ago

In SLURM, "state*" means "the node was last known to be in state, but I haven't heard from it in a while."    Currently, crunch-dispatch ignores the star.    However, a node in the state "idle*" is usually recently crashed, and probably not usable.    crunch-dispatch should not schedule work on nodes in this state. 

 The quick and easy implementation for this story is probably to change 'idle*' to 'down' in our database, instead of lopping off the * and making it the same as idle. No need to worry about other * states, since we only schedule onto idle nodes, so it's ok if 'down*' gets translated to 'down'. That's the safest path.

Back