[Swift-user] errors in file transfer

Yue, Chen - BMD yuechen at bsd.uchicago.edu
Wed Apr 29 17:30:07 CDT 2009


Hi Mihael,
 
When I do qstat, it shows the following line for all my jobs in the queue:
 
937872.abem5              null             yuechen                0 Q(null) normal
 
It looks like no job is running. 
 
I did the qstat -q. Should I use the following line instead in sites.xml for shorter Walltime?
 
<profile namespace="globus" key="queue">debug</profile>

I will send email to help at teragrid.org about the Bigred certificate problem.
 
Thanks!
 
Chen, Yue
 
 

________________________________

From: Mihael Hategan [mailto:hategan at mcs.anl.gov]
Sent: Wed 4/29/2009 4:23 PM
To: Yue, Chen - BMD
Cc: swift user
Subject: RE: [Swift-user] errors in file transfer



On Wed, 2009-04-29 at 16:06 -0500, Yue, Chen - BMD wrote:
> Hi Mihael,
> 
> I deleted the following line in my sites.xml file for NCSA_Abe and the
> wrapper transfer warnings are gone.
> 
> <profile namespace="globus" key="queue">fast</profile>
> 
> I can also find jobs queuing on Abe. However, after quite a while, no
> job returned. I guess it is because I didn't set a priority and all
> the jobs are waiting.

When you do qstat, are your jobs in a queued state?

>  Is there other way to set priority?

You should be able to specify the queue. The only problem is that you
are specifying a queue that doesn't exist on Abe.

This is what I've found online:
http://www.ncsa.uiuc.edu/UserInfo/Resources/Hardware/Intel64Cluster/Doc/Jobs.html#Queues

You can also log in, and do a qstat -q, which will show the following:
[hategan at honest2 ~]$ qstat -q

server: abem5.ncsa.uiuc.edu

Queue            Memory CPU Time Walltime Node  Run Que Lm  State
---------------- ------ -------- -------- ----  --- --- --  -----
normal             --      --    48:00:00   600  82 928 --   E R
iacat2             --      --    241:00:0   --    0  20 --   E R
indprio            --      --    48:00:00   600   0   0 --   E R
long               --      --    168:00:0   600  13  15 --   E R
iacat              --      --    241:00:0   --    0   0 --   E R
industrial         --      --    336:00:0   600  14  32 --   E R
lincoln            --      --    241:00:0   --    2   0 --   E R
wide               --      --    48:00:00  1196   6 344 --   E R
mlinglin           --      --    168:00:0   256   2   0 --   E R
debug              --      --    00:30:00    16   0   4 --   E R
fernsler           --      --    168:00:0    32   0   0 --   E R
specreq            --      --    241:00:0   600   2   0 --   E R
                                               ----- -----
                                                 121  1343


>  I will try again later.
> 
> I then tested the IU BigRed with my application. Swift showed me the
> following error and I don't know if this is because of my setting:
> 
> Progress:  Selecting site:1019  Initializing site shared directory:4
> Execution failed:
>         Could not initialize shared directory on IU_BigRed
> Caused by:
>         org.globus.cog.abstraction.impl.file.FileResourceException:
> Error communicating with the GridFTP server
> Caused by:
>         Server refused performing the request. Custom message: Server
> refused GSSAPI authentication. (error code 1) [Nested exception
> message:  Custom message: Unexpected reply: 530-globus_xio: Server
> side credential failure
> 530-globus_gsi_gssapi: Error with GSI credential
> 530-globus_gsi_gssapi: Error with gss credential handle
> 530-globus_credential: Error with credential: The host
> credential: /etc/grid-security/hostcert.pem
> 530-     with subject: /C=US/O=National Center for Supercomputing
> Applications/CN=gridftp4.bigred.teragrid.iu.edu
> 530-     has expired 4459 minutes ago.
> 530-
> 530 End.]

Bigred, it would seem, has an expired host certificate. This is a
problem with the site. I would suggest seding an email to
help at teragrid.org with the above message (from "Server refused
performing the request" to "530 End.]").






This email is intended only for the use of the individual or entity to which it is addressed and may contain information that is privileged and confidential. If the reader of this email message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is prohibited. If you have received this email in error, please notify the sender and destroy/delete all copies of the transmittal. Thank you.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20090429/bf1fac8a/attachment.html>


More information about the Swift-user mailing list