[Swift-user] coaster workflow hangs then dies

Mihael Hategan hategan at mcs.anl.gov
Wed Mar 10 18:11:30 CST 2010


Nevermind that. You didn't.

I think what happens is that during file transfers the service is
wrongly not considered to be active. It eventually reaches its maximum
allowed idle time and then it shuts down.

I'll try to put in a fix for this.

Thanks for uncovering this.

Mihael

On Wed, 2010-03-10 at 18:07 -0600, Mihael Hategan wrote:
> Did you at some point during the run press CTRL+C or otherwise interrupt
> the swift process?
> 
> On Wed, 2010-03-10 at 13:24 -0600, Neil Best wrote:
> > Please take a look at login.ci:~nbest/bigdata/files/ 
> > mcd12q1-20100310-1227-6gy82bq0.stdout and associated files.  There are  
> > some exceptions in middle of the .stdout abd then it appears to fail  
> > at the end:
> > 
> > Progress:  Selecting site:3519  Stage in:70  Stage out:26  Finished  
> > successfully:189
> > Progress:  Selecting site:3519  Stage in:69  Submitting:1  Stage out: 
> > 26  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:67  Active:2  Checking status: 
> > 1  Stage out:26  Finished successfully:189
> > Cleaning up...
> > Shutdown failed after 5 minutes. Forcefully shutting down
> > Progress:Progress:Shutting down service at https://192.5.86.6:51333
> >    Selecting site:3519  Selecting site:3519  Stage in:60  Stage in:61   
> > Submitting:3Got channel MetaChannel: 1626091176 -> null  Submitting:2
> >    Checking status:5  Stage out:28  Checking status:5  Stage out:28   
> > Finished successfully:189
> >    Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:61  Submitting:2  Active:1   
> > Checking status:4  Stage out:28  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:61  Submitting:2  Active:2   
> > Checking status:4  Stage out:27  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:61  Submitting:3  Active:2   
> > Checking status:3  Stage out:27  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:65  Submitting:1  Checking  
> > status:3  Stage out:27  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:66  Checking status:3  Stage  
> > out:27  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:67  Checking status:2  Stage  
> > out:27  Finished successfully:189
> > Progress:  Selecting site:3519  Stage in:50  Submitting:13  Checking  
> > status:5  Stage out:28  Finished successfully:189
> > + Done
> > 
> > 
> > Can anyone tell me what might be the cause of this?
> > _______________________________________________
> > Swift-user mailing list
> > Swift-user at ci.uchicago.edu
> > http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
> 
> _______________________________________________
> Swift-user mailing list
> Swift-user at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user




More information about the Swift-user mailing list