[Swift-user] errors in file transfer
Mihael Hategan
hategan at mcs.anl.gov
Wed Apr 29 16:23:42 CDT 2009
On Wed, 2009-04-29 at 16:06 -0500, Yue, Chen - BMD wrote:
> Hi Mihael,
>
> I deleted the following line in my sites.xml file for NCSA_Abe and the
> wrapper transfer warnings are gone.
>
> <profile namespace="globus" key="queue">fast</profile>
>
> I can also find jobs queuing on Abe. However, after quite a while, no
> job returned. I guess it is because I didn't set a priority and all
> the jobs are waiting.
When you do qstat, are your jobs in a queued state?
> Is there other way to set priority?
You should be able to specify the queue. The only problem is that you
are specifying a queue that doesn't exist on Abe.
This is what I've found online:
http://www.ncsa.uiuc.edu/UserInfo/Resources/Hardware/Intel64Cluster/Doc/Jobs.html#Queues
You can also log in, and do a qstat -q, which will show the following:
[hategan at honest2 ~]$ qstat -q
server: abem5.ncsa.uiuc.edu
Queue Memory CPU Time Walltime Node Run Que Lm State
---------------- ------ -------- -------- ---- --- --- -- -----
normal -- -- 48:00:00 600 82 928 -- E R
iacat2 -- -- 241:00:0 -- 0 20 -- E R
indprio -- -- 48:00:00 600 0 0 -- E R
long -- -- 168:00:0 600 13 15 -- E R
iacat -- -- 241:00:0 -- 0 0 -- E R
industrial -- -- 336:00:0 600 14 32 -- E R
lincoln -- -- 241:00:0 -- 2 0 -- E R
wide -- -- 48:00:00 1196 6 344 -- E R
mlinglin -- -- 168:00:0 256 2 0 -- E R
debug -- -- 00:30:00 16 0 4 -- E R
fernsler -- -- 168:00:0 32 0 0 -- E R
specreq -- -- 241:00:0 600 2 0 -- E R
----- -----
121 1343
> I will try again later.
>
> I then tested the IU BigRed with my application. Swift showed me the
> following error and I don't know if this is because of my setting:
>
> Progress: Selecting site:1019 Initializing site shared directory:4
> Execution failed:
> Could not initialize shared directory on IU_BigRed
> Caused by:
> org.globus.cog.abstraction.impl.file.FileResourceException:
> Error communicating with the GridFTP server
> Caused by:
> Server refused performing the request. Custom message: Server
> refused GSSAPI authentication. (error code 1) [Nested exception
> message: Custom message: Unexpected reply: 530-globus_xio: Server
> side credential failure
> 530-globus_gsi_gssapi: Error with GSI credential
> 530-globus_gsi_gssapi: Error with gss credential handle
> 530-globus_credential: Error with credential: The host
> credential: /etc/grid-security/hostcert.pem
> 530- with subject: /C=US/O=National Center for Supercomputing
> Applications/CN=gridftp4.bigred.teragrid.iu.edu
> 530- has expired 4459 minutes ago.
> 530-
> 530 End.]
Bigred, it would seem, has an expired host certificate. This is a
problem with the site. I would suggest seding an email to
help at teragrid.org with the above message (from "Server refused
performing the request" to "530 End.]").
More information about the Swift-user
mailing list