[Swift-user] errors in file transfer

Mihael Hategan hategan at mcs.anl.gov
Wed Apr 29 16:23:42 CDT 2009


On Wed, 2009-04-29 at 16:06 -0500, Yue, Chen - BMD wrote:
> Hi Mihael,
>  
> I deleted the following line in my sites.xml file for NCSA_Abe and the
> wrapper transfer warnings are gone. 
>  
> <profile namespace="globus" key="queue">fast</profile>
>  
> I can also find jobs queuing on Abe. However, after quite a while, no
> job returned. I guess it is because I didn't set a priority and all
> the jobs are waiting.

When you do qstat, are your jobs in a queued state?

>  Is there other way to set priority?

You should be able to specify the queue. The only problem is that you
are specifying a queue that doesn't exist on Abe.

This is what I've found online:
http://www.ncsa.uiuc.edu/UserInfo/Resources/Hardware/Intel64Cluster/Doc/Jobs.html#Queues

You can also log in, and do a qstat -q, which will show the following:
[hategan at honest2 ~]$ qstat -q

server: abem5.ncsa.uiuc.edu

Queue            Memory CPU Time Walltime Node  Run Que Lm  State
---------------- ------ -------- -------- ----  --- --- --  -----
normal             --      --    48:00:00   600  82 928 --   E R
iacat2             --      --    241:00:0   --    0  20 --   E R
indprio            --      --    48:00:00   600   0   0 --   E R
long               --      --    168:00:0   600  13  15 --   E R
iacat              --      --    241:00:0   --    0   0 --   E R
industrial         --      --    336:00:0   600  14  32 --   E R
lincoln            --      --    241:00:0   --    2   0 --   E R
wide               --      --    48:00:00  1196   6 344 --   E R
mlinglin           --      --    168:00:0   256   2   0 --   E R
debug              --      --    00:30:00    16   0   4 --   E R
fernsler           --      --    168:00:0    32   0   0 --   E R
specreq            --      --    241:00:0   600   2   0 --   E R
                                               ----- -----
                                                 121  1343


>  I will try again later.
>  
> I then tested the IU BigRed with my application. Swift showed me the
> following error and I don't know if this is because of my setting:
>  
> Progress:  Selecting site:1019  Initializing site shared directory:4
> Execution failed:
>         Could not initialize shared directory on IU_BigRed
> Caused by:
>         org.globus.cog.abstraction.impl.file.FileResourceException:
> Error communicating with the GridFTP server
> Caused by:
>         Server refused performing the request. Custom message: Server
> refused GSSAPI authentication. (error code 1) [Nested exception
> message:  Custom message: Unexpected reply: 530-globus_xio: Server
> side credential failure
> 530-globus_gsi_gssapi: Error with GSI credential
> 530-globus_gsi_gssapi: Error with gss credential handle
> 530-globus_credential: Error with credential: The host
> credential: /etc/grid-security/hostcert.pem
> 530-     with subject: /C=US/O=National Center for Supercomputing
> Applications/CN=gridftp4.bigred.teragrid.iu.edu
> 530-     has expired 4459 minutes ago.
> 530-
> 530 End.]

Bigred, it would seem, has an expired host certificate. This is a
problem with the site. I would suggest seding an email to
help at teragrid.org with the above message (from "Server refused
performing the request" to "530 End.]").





More information about the Swift-user mailing list