[Swift-devel] Another issue regarding uj-pbs-gram2.xml

Zhao Zhang zhaozhang at uchicago.edu
Mon May 4 16:32:23 CDT 2009


So I got the following info
[zzhang at communicado uj]$ globus-job-run osg-ce.grid.uj.ac.za 
/usr/bin/qstat -q

server: gridvm

Queue            Memory CPU Time Walltime Node  Run Que Lm  State
---------------- ------ -------- -------- ----  --- --- --  -----
medium             --   02:00:00    --      --    0   0 50   E R
atlas              --      --    02:00:00   --    0   0 --   E R
gilda              --   02:00:00    --      --    0   0 --   E R
batch              --      --    01:00:00   --    0   0 --   E R
small              --   00:20:00    --      --    0   0 50   E R
default            --      --       --      --    0   0 --   E R
long               --   12:00:00    --      --    0   0 20   E R
verylong           --   72:00:00    --      --    0   0 10   E R
                                               ----- -----
                                                   0     0
Then I used the default queue, now job is running. Thanks!

zhao

Mihael Hategan wrote:
> gram_job_mgr_10273.log:
> ...
> Mon May  4 20:59:33 2009 JM_SCRIPT: qsub returned
> Mon May  4 20:59:33 2009 JM_SCRIPT: qsub stderr qsub: Job exceeds queue
> resource limits MSG=cannot satisfy queue max walltime requirement
> ...
>
> Find out what queues are on that machine (qstat -q), and change
> appropriately. That or set the proper coasterWorkerMaxwalltime.
>
> On Mon, 2009-05-04 at 15:57 -0500, Zhao Zhang wrote:
>   
>> Hey, Here are all of them: 
>> /home/zzhang/swift_coaster/cog/modules/swift/tests/sites/uj    :-)
>>
>> zhao
>>
>> Mihael Hategan wrote:
>>     
>>> On Mon, 2009-05-04 at 15:07 -0500, Zhao Zhang wrote:
>>>   
>>>       
>>>> This one?
>>>>
>>>> /home/zzhang/swift_coaster/cog/modules/swift/tests/sites/gram_job_mgr_18272.log
>>>>     
>>>>         
>>> Nope. I suggest copying them all, and I'll look for the right one.
>>>
>>>   
>>>       
>>>> zhao
>>>>
>>>> Mihael Hategan wrote:
>>>>     
>>>>         
>>>>> On Mon, 2009-05-04 at 15:02 -0500, Zhao Zhang wrote:
>>>>>   
>>>>>       
>>>>>           
>>>>>> I also pulled out the coaster log in the same directory. Then what shall 
>>>>>> we do with this error?
>>>>>>     
>>>>>>         
>>>>>>             
>>>>> Was that the only gram log? What I suspect is that there is another gram
>>>>> log which is more relevant.
>>>>>
>>>>>   
>>>>>       
>>>>>           
>>>>>> zhao
>>>>>>
>>>>>> Mihael Hategan wrote:
>>>>>>     
>>>>>>         
>>>>>>             
>>>>>>> On Mon, 2009-05-04 at 14:46 -0500, Zhao Zhang wrote:
>>>>>>>   
>>>>>>>       
>>>>>>>           
>>>>>>>               
>>>>>>>> Ha, I got it.
>>>>>>>>
>>>>>>>> It is at CI network: 
>>>>>>>> /home/zzhang/swift_coaster/cog/modules/swift/tests/sites/gram_job_mgr_22265.log
>>>>>>>>     
>>>>>>>>         
>>>>>>>>             
>>>>>>>>                 
>>>>>>> There you go :)
>>>>>>>
>>>>>>> I'm not sure this is the right log though. This is the one from the job
>>>>>>> that was used to start the coaster service, and it shows a different
>>>>>>> error (155 - stageout failure).
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>   
>>>>>>>       
>>>>>>>           
>>>>>>>               
>>>>>   
>>>>>       
>>>>>           
>>>   
>>>       
>
>
>   



More information about the Swift-devel mailing list