[Swift-user] Swift crash
William Catino
wcatino at gmail.com
Tue May 6 18:20:31 CDT 2014
Thanks, Mike.
On May 6, 2014 6:05 PM, "Michael Wilde" <wilde at anl.gov> wrote:
> Bill, since you are running on Open Science Grid with no specific Condor
> requirements to select specific sites or node types, it looks like some
> sites that your app tasks are running at do not have a Python environment
> that works with your wrapper script.
>
> This might have something to do with that way you have set up or are using
> virtualenv.
>
> You might want to start by adding requirements tags to specify that your
> tasks should only run on the UC3 cycle-seeder nodes, make sure everything
> works correctly there, and then add tags for one additional pool of OSG
> Connect resources at a time. You can find info on this in the OSG Connect
> Book.
>
> You can also log the site name in your returned .err or .out file, via
> your wrapper, which will help a lot in debugging.
>
> Then you can force tasks to the *bad* sites to debug and fix your python
> environment for those sites.
>
> I can help more on this after Friday. Perhaps others on the Swift team or
> this list can also provide assistance.
>
> - Mike
>
>
> On 5/6/14, 5:53 PM, William Catino wrote:
>
>> I have a script that crashed 3 times, then succeeded, with no change.
>> When it crashed, the .err file in the data directory contained a message
>> about not finding a file whose name included
>>
>> /usr/lib64/libpython2.6.so.1.0
>>
>> The attached log file that corresponds to a crash is:
>> df-20140506-1724-d6au5ct7.log
>> The attached log file that corresponds to a subsequent success is:
>> df-20140506-1727-39v1is9b.log
>>
>> I might have specific files confused, but these files (and others from
>> prior crashes) are located in my home directory on OSG, at
>> /home/wcatino/df.
>> This configuration is identical to that used when working with Mike today,
>> except that the data directory was reduced to 5 files.
>>
>> I also noticed that my PATH is pointing swift to version 0.94 rather than
>> 0.95.
>> Perhaps I should change my PATH to fix this.
>> The instructions I followed had me append the 0.95 path to PATH, which
>> contains a 0.94 path earlier in the PATH string.
>>
>> Thanks.
>>
>
> --
> Michael Wilde
> Mathematics and Computer Science Computation Institute
> Argonne National Laboratory The University of Chicago
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20140506/e2068b87/attachment.html>
More information about the Swift-user
mailing list