[Swift-devel] Problem with 0.92 sending jobs to OSG via Condor-G

Mihael Hategan hategan at mcs.anl.gov
Thu Jan 13 11:18:10 CST 2011


On Thu, 2011-01-13 at 09:31 -0600, Michael Wilde wrote:
> 
> ----- Original Message -----
> > Shouldn't we be looking at the diffs in provider-localscheduler?
> 
> I don't *think* so - my tests were using COndor-G directly:
> 
>     <profile namespace="globus" key="jobType">grid</profile>
>     <profile namespace="globus" key="gridResource">gt2 ff-grid3.unl.edu/jobmanager-pbs</profile>
> 
> But in any case, I diff'ed the entire cog and swift trees, and saw
> almost *no* diffs (see later msg). The only one I am suspicious of at
> the moment is the @Override patch.

I highly doubt that the problem is with the @Override(s).

> 
> I need to find when that change was made and whether I somehow compiled *with* the Overrides in place in the older working copy.
> 
> - Mike
> 
> > 
> > -Allan (mobile)
> > 
> > On Jan 13, 2011 11:17 AM, "Michael Wilde" < wilde at mcs.anl.gov > wrote:
> > >
> > >
> > >
> > > ----- Original Message -----
> > > > > I need to check what local mods I had applied, but I think its
> > > > > more
> > > > > likely that some Condor submit file quoting fix fell off in 0.92
> > > > > integration.
> > > >
> > > > Yeah. A svn diff > somefile would help.
> > >
> > > Hmmm. So far svn diffs show no changes within provider-condor,
> > > neither between trunk and 0.92 branch nor within my working copies
> > > of those two on engage-submit, which seem to behave differently
> > > regarding Condor quoting.
> > >
> > > Could the change(s) that were made a long time ago to fix Condor
> > > quoting be in a different module than provider-condor? If so, whats
> > > a likely place to look?
> > >
> > > I'll check vdl-int.k next.
> > >
> > > - Mike
> > >
> > > > >
> > > > > So Marc, sorry - this release is not usable for you yet.
> > > > >
> > > > > - Mike
> > > > >
> > > > >
> > > > > ----- Original Message -----
> > > > > > Im trying my first tests of 0.92 on engage-submit, sending 100
> > > > > > trivial
> > > > > > cat jobs to 10 OSG sites.
> > > > > >
> > > > > > My jobs seem to be all dying with the error "Found illegal
> > > > > > unescaped
> > > > > > double-quote" (see below).
> > > > > >
> > > > > > Has anyone successfully run a Condor-G job on OSG with 0.92?
> > > > > >
> > > > > > I'll dig deeper and try the same test with the older version
> > > > > > of
> > > > > > trunk
> > > > > > that Marc has been using here with better success. Will also
> > > > > > try a
> > > > > > single job run and capture a simpler log and the condor-g
> > > > > > submit
> > > > > > file.
> > > > > >
> > > > > > Allan, have you tried 0.92 against COndor-G? If not, could
> > > > > > you?
> > > > > >
> > > > > > Sarah, we should add some Condor-G-to-GT2 testing to 0.92
> > > > > > validation I
> > > > > > think.
> > > > > >
> > > > > > - Mike
> > > > > >
> > > > > > --
> > > > > >
> > > > > > Caused by:
> > > > > > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
> > > > > > Cannot submit job: Could not submit job (condor_submit
> > > > > > reported an
> > > > > > exit code of 1). Submitting job(s)
> > > > > > Found illegal unescaped double-quote: "" -e /bin/cat -out
> > > > > > outdir/f.0065.out -err stderr.txt -i -d outdir -if data.txt
> > > > > > -of
> > > > > > outdir/f.0065.out -k "" -cdmfile "" -status file -a
> > > > > > data.txtThe
> > > > > > full
> > > > > > arguments you specified were:
> > > > > > /osg/data/engage/tmp/
> > > > > > osg.hpc.ufl.edu/catsn-20110113-0025-vv4p4up3/shared/_swiftwrap
> > > > > > cat-ajxnee4k -jobdir a -scratch "" -e /bin/cat -out
> > > > > > outdir/f.0065.out
> > > > > > -err stderr.txt -i -d outdir -if data.txt -of
> > > > > > outdir/f.0065.out -k
> > > > > > ""
> > > > > > -cdmfile "" -status file -a data.txt
> > > > > >
> > > > > >
> > > > > > Script is:
> > > > > >
> > > > > > e$ cat catsn.swift
> > > > > > type file;
> > > > > >
> > > > > > app (file o) cat (file i)
> > > > > > {
> > > > > > cat @i stdout=@o;
> > > > > > }
> > > > > >
> > > > > > file out[]<simple_mapper; location="outdir",
> > > > > > prefix="f.",suffix=".out">;
> > > > > > foreach j in [1:@toint(@arg("n","1"))] {
> > > > > > file data<"data.txt">;
> > > > > > out[j] = cat(data);
> > > > > > }
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Michael Wilde
> > > > > > Computation Institute, University of Chicago
> > > > > > Mathematics and Computer Science Division
> > > > > > Argonne National Laboratory
> > > > > >
> > > > > > _______________________________________________
> > > > > > Swift-devel mailing list
> > > > > > Swift-devel at ci.uchicago.edu
> > > > > > http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> > > > >
> > >
> > > --
> > > Michael Wilde
> > > Computation Institute, University of Chicago
> > > Mathematics and Computer Science Division
> > > Argonne National Laboratory
> > >
> > > _______________________________________________
> > > Swift-devel mailing list
> > > Swift-devel at ci.uchicago.edu
> > > http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> > >
> 





More information about the Swift-devel mailing list