[Swift-user] Montage wrapper error

Mihael Hategan hategan at mcs.anl.gov
Tue Aug 3 22:55:01 CDT 2010


<filesystem provider="coaster" url="ssh://tp-login2.ci.uchicago.edu" />

On Tue, 2010-08-03 at 22:49 -0500, Jonathan Monette wrote:
> to use the coaster filesystem i should use local:pbs in the jobmanager?
> 
> On 8/3/10 10:44 PM, Mihael Hategan wrote:
> > Ok. Maybe not. Looks like an SSH issue. Can you try the coaster fs
> > provider instead?
> >
> > On Tue, 2010-08-03 at 22:41 -0500, Jonathan Monette wrote:
> >    
> >> <pool handle="localhost">
> >> <filesystem provider="local" />
> >> <execution provider="local" />
> >> <workdirectory>/home/jonmon/Library/Swift/work/localhost</workdirectory>
> >> <profile namespace="karajan" key="jobThrottle">.05</profile>
> >> </pool>
> >>
> >> <pool handle="teraport">
> >> <execution provider="coaster" url="tp-login2.ci.uchicago.edu"
> >> jobmanager="ssh:pbs" />
> >> <profile namespace="globus" key="maxtime">3000</profile>
> >> <profile namespace="globus" key="workersPerNode">8</profile>
> >> <profile namespace="globus" key="slots">1</profile>
> >> <profile namespace="globus" key="nodeGranularity">1</profile>
> >> <profile namespace="globus" key="maxNodes">10</profile>
> >> <profile namespace="globus" key="queue">short</profile>
> >> <profile namespace="karajan" key="jobThrottle">0.7</profile>
> >> <profile namespace="karajan" key="initialScore">10000</profile>
> >> <filesystem provider="ssh" url="tp-login2.ci.uchicago.edu"/>
> >> <workdirectory>/home/jonmon/Library/swift/work/teraport</workdirectory>
> >> </pool>
> >>
> >> This is my sites file
> >>
> >> On 8/3/10 10:40 PM, Mihael Hategan wrote:
> >>      
> >>> On the remote site there should be something called
> >>> ~/.globus/coasters/coasters.log. It tends to contain useful information.
> >>> As usual, the swift log also tends to contain useful information.
> >>>
> >>> However, Mike has mentioned some problems when using the coaster
> >>> filesystem provider. In the effort to implement provider staging for
> >>> coasters, that may have broke. Is that what you are using? (i.e. post
> >>> sites.xml).
> >>>
> >>> Mihael
> >>>
> >>> On Tue, 2010-08-03 at 21:12 -0500, Jonathan Monette wrote:
> >>>
> >>>        
> >>>> Hello,
> >>>>        Has anyone ever ran into this error:
> >>>>
> >>>>        Failed to transfer wrapper log from
> >>>> m101_montage-20100803-2101-4ihqvdv9/info/s on teraport
> >>>> Execution failed:
> >>>>        Exception in mProjectPP:
> >>>> Arguments: [-X, raw_dir/2mass-atlas-990524n-j0320044.fits,
> >>>> proj_dir/proj_2mass-atlas-990524n-j0320044.fits, template.hdr]
> >>>> Host: teraport
> >>>> Directory: m101_montage-20100803-2101-4ihqvdv9/jobs/s/mProjectPP-sz57orvj
> >>>> stderr.txt:
> >>>>
> >>>> stdout.txt:
> >>>>
> >>>> ----
> >>>>
> >>>> Caused by:
> >>>>
> >>>> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException:
> >>>> Cannot determine the existence of the file
> >>>> Caused by:
> >>>>        The connection has been closed [Unnamed Channel]
> >>>> Cleaning up...
> >>>> Shutting down service at https://128.135.125.117:52276
> >>>> Got channel MetaChannel: 1867624887[1293086287: {}] ->
> >>>> GSSSChannel-11234326669(1)[1293086287: {}]
> >>>> + Done
> >>>>
> >>>> I am testing my wrappers to Montage on a larger scale and I keep getting
> >>>> this error.  There is about 640 images but it only projects about 142
> >>>> images before this error pops up.  If my run will help my run exists at
> >>>> "/home/jonmon/Workspace/Swift/Montage/m101_j_4x4/runs/m101_montage_Aug-03-2010_21-01-09"
> >>>> on the ci machines.  Any help is much appreciated.
> >>>>
> >>>>
> >>>>          
> >>>
> >>>        
> >>      
> >
> >    
> 





More information about the Swift-user mailing list