[Swift-devel] a note on stress testing

Michael Wilde wilde at mcs.anl.gov
Sun Mar 10 11:18:00 CDT 2013


> ... my understanding was that we'd need three types on
> runs based on where swift runs:

> -> Local | Local ( Swift runs locally with jobs also run locally )
> -> Local | Remote ( Swift runs locally but the compute resources
> are remote providers ? )
> -> Remote | Local ( Swift instantiated on a remote system to
> coordinate resources locally )

Right.

> In the case you have pointed out, Swift on Midway submits jobs to
> beagle.

> Is this a remote|remote type which I haven't listed?

Not by itself: its "local|remote".

But you can *initiate* this test remotely, making it "remote|remote".

So while all 4 boxes of this 2x2 matrix make sense, ultimately you could think of all the tests being initiated from a test service like Jenkins, so *all* tests would be "remote" in column one of your list above. That simplifies the way you can organize the test plan and execution mechanism.

> Can we expect to
> see the same errors,
> if we were to run swift on an MCS machine to submit the same jobs to
> beagle?

Yes, I think in this particular case, the main errors were not caused by the environment in the submitting system (midway hosts).  But they very well *could have been*.  For example, if we only test on hosts that have single network interfaces, we'd never see known errors related to not handling the multiple-NIC case correctly.  And in fact in our testing yesterday this was indeed an issue thats still unresolved: whether GLOBUS_HOST or the equivalent sites.xml tag needs to be set to a dotted IP or could accept a DNS FQDN, and under what circumstances. This needs both testing *and* documentation. Similar issues apply to hosts with various firewall/iptables settings.

- Mike



More information about the Swift-devel mailing list