[Swift-user] queue problem?

Justin M Wozniak wozniak at mcs.anl.gov
Thu May 19 10:40:14 CDT 2011


Yeah, I was about to suggest that the file might not be there.  Let me 
know what you find.

On Thu, 19 May 2011, Sheri Mickelson wrote:

> Hi Mike,
>
> I was originally running 0.92.1, but I got the "mapper.existing() 
> returned a path [3] that it cannot subsequently map" error using Justin's 
> trunk version.
>
> I went back to an older version of swift and I think I might have found 
> what was causing the initial error (an error in one of my csh scripts 
> that had the wrong path in it).  I'm still looking into it and let you 
> know how it goes.
>
> Justin, the path to my working directory is 
> /home/climate1/mickelso/amwg-swift/test-swift.
>
> -Sheri
>
> Michael Wilde wrote:
>> Also, SHeri - are you using Swift 0.92.1?  This looks a bit like the 
>> bug in 0.92 that was fixed in 0.92.1
>> 
>> - Mike
>> 
>> ----- Original Message -----
>>> Is this a SwiftScript that ran successfully on the MCS machines but
>>> fails
>>> on Fusion? If so, can you point me to the working directory for this
>>> run?
>>> Justin
>>> 
>>> On Mon, 16 May 2011, Sheri Mickelson wrote:
>>> 
>>>> I'm seeing a different error now:
>>>> mapper.existing() returned a path [3] that it cannot subsequently
>>>> map
>>>> 
>>>> It starts up, but dies shortly after that. I attached the log file.
>>>> 
>>>> -Sheri
>>>> 
>>>> Justin M Wozniak wrote:
>>>>> That's probably a perms thing, I just reapplied the permissions,
>>>>> please try
>>>>> again.
>>>>> 
>>>>> On Mon, 16 May 2011, Sheri Mickelson wrote:
>>>>> 
>>>>>> Hi Justin,
>>>>>> 
>>>>>> I'm getting this error when swift tries to run:
>>>>>> 
>>>>>> Exception in thread "main" java.lang.NoClassDefFoundError:
>>>>>> org/griphyn/vdl/karajan/Loader
>>>>>> Caused by: java.lang.ClassNotFoundException:
>>>>>> org.griphyn.vdl.karajan.Loader
>>>>>>     at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>>>>>>     at java.security.AccessController.doPrivileged(Native Method)
>>>>>>     at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>>>>>>     at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
>>>>>>     at
>>>>>>     sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>>>>>>     at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
>>>>>> Could not find the main class: org.griphyn.vdl.karajan.Loader.
>>>>>> Program
>>>>>> will exit.
>>>>>> 
>>>>>> -Sheri
>>>>>> 
>>>>>> Justin M Wozniak wrote:
>>>>>>> Let's go with my trunk-based installation in the location below
>>>>>>> for now.
>>>>>>> I tried testing this again over the weekend but did not get
>>>>>>> through the
>>>>>>> queue. I have already set up the additional logging in this
>>>>>>> installation.
>>>>>>> 
>>>>>>> /homes/wozniak/Public/cog/modules/swift/dist/swift-svn/bin/swift
>>>>>>>
>>>>>>>     Justin
>>>>>>> 
>>>>>>> On Fri, 13 May 2011, Sheri Mickelson wrote:
>>>>>>> 
>>>>>>>> Here's the log file.
>>>>>>>> This is the first time I'm running this version of swift on
>>>>>>>> fusion. I
>>>>>>>> had done my development work with this swift version on an mcs
>>>>>>>> compute
>>>>>>>> machine.
>>>>>>>> 
>>>>>>>> -Sheri
>>>>>>>> 
>>>>>>>> Justin M Wozniak wrote:
>>>>>>>>> Hello
>>>>>>>>>     Can you send the log for this run?
>>>>>>>>>     Is this a new issue that appeared after an update?
>>>>>>>>>     Also, in any future runs regarding this issue, please add
>>>>>>>>> 
>>>>>>>>> log4j.logger.org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor
>>>>>>>>> = DEBUG
>>>>>>>>> 
>>>>>>>>> (one line) to your etc/log4j.properties file.
>>>>>>>>>
>>>>>>>>>     Thanks
>>>>>>>>>     Justin
>>>>>>>>> 
>>>>>>>>> On Fri, 13 May 2011, Sheri Mickelson wrote:
>>>>>>>>> 
>>>>>>>>>> I'm running into a problem running swift version 0.92.1 on
>>>>>>>>>> fusion with
>>>>>>>>>> coasters.
>>>>>>>>>> This is the error I'm seeing:
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>>
>>>>>>>>>> 
>>>>>>>>>> -----------------------------------------------------------------------------
>>>>>>>>>> Progress: Selecting site:168 Submitted:23 Active:2
>>>>>>>>>> Progress: Selecting site:168 Submitted:23 Active:1 Checking
>>>>>>>>>> status:1
>>>>>>>>>> Progress: Selecting site:167 Stage in:1 Submitted:22 Active:2
>>>>>>>>>> Finished successfully:1
>>>>>>>>>> queuedsize > 0 but no job dequeued. Queued: {}
>>>>>>>>>> java.lang.Throwable
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.requeueNonFitting(BlockQueueProcessor.java:252)
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.updatePlan(BlockQueueProcessor.java:520)
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.run(BlockQueueProcessor.java:109)
>>>>>>>>>> queuedsize > 0 but no job dequeued. Queued: {}
>>>>>>>>>> java.lang.Throwable
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.requeueNonFitting(BlockQueueProcessor.java:252)
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.updatePlan(BlockQueueProcessor.java:520)
>>>>>>>>>>     at
>>>>>>>>>> org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.run(BlockQueueProcessor.java:109)
>>>>>>>>>> Shutting down worker
>>>>>>>>>> 
>>>>>>>>>> Shutting down worker
>>>>>>>>>> 
>>>>>>>>>> Shutting down worker
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>>
>>>>>>>>>> 
>>>>>>>>>> -----------------------------------------------------------------------------
>>>>>>>>>> And here's my sites file:
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>>
>>>>>>>>>> 
>>>>>>>>>> -----------------------------------------------------------------------------
>>>>>>>>>> <config>
>>>>>>>>>> <pool handle="fusion">
>>>>>>>>>>  <execution jobmanager="local:pbs" provider="coaster"
>>>>>>>>>>  url="none"/>
>>>>>>>>>>  <profile namespace="globus" key="maxtime">3600</profile>
>>>>>>>>>>  <profile namespace="globus" key="workersPerNode">1</profile>
>>>>>>>>>>  <profile namespace="globus" key="slots">1</profile>
>>>>>>>>>>  <profile namespace="globus" key="nodeGranularity">4</profile>
>>>>>>>>>>  <profile namespace="globus" key="maxNodes">2</profile>
>>>>>>>>>>  <profile namespace="globus" key="queue">batch</profile>
>>>>>>>>>>  <profile namespace="karajan" key="jobThrottle">0.23</profile>
>>>>>>>>>>  <profile namespace="karajan"
>>>>>>>>>>  key="initialScore">10000</profile>
>>>>>>>>>>  <profile namespace="globus" key="project">parvis</profile>
>>>>>>>>>>  <profile namespace="globus"
>>>>>>>>>>  key="lowOverAllocation">100</profile>
>>>>>>>>>>  <profile namespace="globus"
>>>>>>>>>>  key="highOverAllocation">100</profile>
>>>>>>>>>>  <filesystem provider="local"/>
>>>>>>>>>>  <workdirectory>/fusion/gpfs/home/mickelso/amwg-swift/swift/</workdirectory>
>>>>>>>>>> </pool>
>>>>>>>>>> </config>
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>>
>>>>>>>>>> 
>>>>>>>>>> -----------------------------------------------------------------------------
>>>>>>>>>> Do you know what might be causing this?
>>>>>>>>>> 
>>>>>>>>>> Thanks, Sheri
>>>>>>>>>> _______________________________________________
>>>>>>>>>> Swift-user mailing list
>>>>>>>>>> Swift-user at ci.uchicago.edu
>>>>>>>>>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
>>>>>>>>>> 
>>> --
>>> Justin M Wozniak
>>> _______________________________________________
>>> Swift-user mailing list
>>> Swift-user at ci.uchicago.edu
>>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
>> 
>

-- 
Justin M Wozniak



More information about the Swift-user mailing list