[Swift-devel] Re: [alcf-support #60887] Can Cobalt command-line bug on Eureka be fixed?

Andrew Cherry acherry at alcf.anl.gov
Tue Jan 11 17:19:25 CST 2011


Mike-

My initial reaction is that a fix would probably not be doable in the  
next few days, since it would almost certainly require scheduling  
downtime to bring Cobalt down, apply the fix, test, and restart.  But  
I'll ping the Cobalt folks to find out how feasible this would be.  My  
recollections from my previous investigation is that it would require  
changes to the cluster_system component as well as the launcher, so a  
shutdown wouldn't be avoidable.

-Andrew

On Jan 11, 2011, at 4:48 PM, Michael Wilde wrote:

> Hi ALCF Team,
>
> The following known issue in Cobalt is currently preventing us from  
> running Swift on Eureka:
>
>  http://trac.mcs.anl.gov/projects/cobalt/ticket/462
>
> With some additional development effort we can work around this, but  
> it would be much cleaner and better if this were fixed in Cobalt,  
> instead, as suggested in ticket 462 above.
>
> Is there any chance that can be done in the next few days?
> If not, please let me know, and we will implement the work-around  
> instead.
>
> This is holding up work on the DOE ParVis project (Rob Jacob, PI)  
> and we've had to move some work we want to run on Eureka to other  
> platforms in the meantime.
>
> Thanks very much,
>
> Mike
>
> 462 is:
>
> Ticket #462 (new defect)
> Opened 7 months ago
> Cobalt on clusters ignores job script arguments
>
> Reported by:	acherry	
> Priority:	 major	
> Component:	 clients
> 	
> Description
>
> It appears that cobalt-launcher.py does not support running a job  
> script or executable with command arguments, even though qsub will  
> accept the arguments, and the man page and help for qsub indicates  
> that arguments are accepted.
>
> I'm filing this as a bug rather than a feature request, since the  
> behavior isn't consistent with the documentation. But I'd rather the  
> fix for this to be adding support for args, rather than changing the  
> docs to say they aren't accepted. :-)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20110111/1ea82da6/attachment.html>


More information about the Swift-devel mailing list