[Swift-user] consultation about error messages, coaster usage

Michael Wilde wilde at mcs.anl.gov
Mon Apr 6 23:15:36 CDT 2009


OK, will do. I think the fix you applied at 5PM enables us to go back to 
the latest rev. This morning we updated, then reverted back to Tuesday 3/31.

On 4/6/09 10:08 PM, Mihael Hategan wrote:
> You seem to be using a particularly bad version of swift. I suggest
> trying the latest version.
> 
> Mihael
> 
> On Mon, 2009-04-06 at 21:42 -0500, Glen Hocky wrote:
>> Hi Guys,
>> I just ran (and killed) too big runs w/ swift, one on ranger, one on 
>> abe. I stopped them because in each case there were many "Failed but can 
>> retry" jobs, several "Failed to transfer wrapper log" errors and at the 
>> point where i stopped them, many more cpu's allocated than "Active" 
>> jobs. E.g. on ranger there were 14 running jobs in the queue w/ over an 
>> hour left (so 224 cpus) but only 76 "Active" jobs.
>>
>> Could someone take a look at the logs and tell me if things are working 
>> properly? It's a little hard to tell from a user end...
>> On a ci home machine,
>> All run related files for abe are in
>>> /home/hockyg/oops/swift/output/abeoutdir.5/
>> and for ranger
>>
>>> /home/hockyg/oops/swift/output/rangeroutdir.5/
>> In those directories, there will be a file $site.out.5 which has the stdout
>> and xout.XXXXX which has a log of all the commands run including the 
>> swift invocation
>> the tc.data file used is $site.data and the sites.xml file is $site.xml
>>
>> Thanks,
>> Glen
>>
>> _______________________________________________
>> Swift-user mailing list
>> Swift-user at ci.uchicago.edu
>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
> 
> _______________________________________________
> Swift-user mailing list
> Swift-user at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user



More information about the Swift-user mailing list