[Swift-commit] r2386 - text/hpdc09submission

noreply at svn.ci.uchicago.edu noreply at svn.ci.uchicago.edu
Thu Jan 1 05:36:26 CST 2009


Author: benc
Date: 2009-01-01 05:36:24 -0600 (Thu, 01 Jan 2009)
New Revision: 2386

Modified:
   text/hpdc09submission/paper.latex
Log:
a few more bits of rambling

Modified: text/hpdc09submission/paper.latex
===================================================================
--- text/hpdc09submission/paper.latex	2008-12-30 15:43:21 UTC (rev 2385)
+++ text/hpdc09submission/paper.latex	2009-01-01 11:36:24 UTC (rev 2386)
@@ -454,6 +454,7 @@
 \hline
 \end{tabular}
 \caption{SwiftScript built-in mappers}
+\label{mappertable}
 \end{table}
 
 \subsection{The execution environment for component programs}
@@ -513,8 +514,21 @@
 value of that variable has not yet been computed, the filename where
 that value will go is already known.
 
+TODO comment (here?) about how this model appears somewhat constrained
+but provides a well defined atomicity that can be used for various
+reliability mechanisms, site portability, on-site efficiency tuning.
+multi-site and reliabilty are already discussed; but the on-site
+efficiency tuning (eg using GPFS and laying out files in a way that is
+sympathetic to that, potentially using Collective IO fs, and using a
+workernode local filesystem) - that discussion could go into the
+'executing efficiently' section, or a different 'executing efficiently'
+section (change titles...)
+
 \section{Execution}
 
+TODO could briefly describe the execution layer here? how much depth is
+interesting?
+
 \subsection{Executing on a remote site}
 
   With the above restrictions, execution of a unix program on a remote
@@ -547,10 +561,20 @@
 the submitting system to the remote system; and with GRAM\cite{GRAM} and a local
 resource manager (LRM) providing an execution mechanism.
 
-Parameterisation of sites occurs in the \emph{site catalog}, an example
-of which is:
+Sites are defined in the \emph{site catalog}, which contains descriptions
+of each site:
 
-TODO site catalog example here
+ \begin{verbatim}
+  <pool handle="tguc">
+    <gridftp
+      url="gsiftp://tg-gridftp.uc.teragrid.org" /> 
+    <execution provider="gt4" jobmanager="PBS" 
+      url="tg-grid.uc.teragrid.org" />
+    <workdirectory>
+      /home/benc/swifttest
+    </workdirectory>
+  </pool>
+ \end{verbatim}
 
 This file may be constructed by hand or mechanically from some
 pre-existing database (such as a grid's existing discovery system).
@@ -628,7 +652,7 @@
   When any of those jobs begins executing, all other replicas will be
 cancelled.
 
-\subsection{Executing efficiently}
+\subsection{Avoiding job submission penalties}
 
 In many applications, the overhead of job submission through commonly
 available mechanisms, such as through GRAM into an LRM, can dominate
@@ -680,6 +704,20 @@
 way to talk about the 'underlying submission system, that underlies
 coasters and clustering' ?
 
+\subsection{Avoiding filesystem inefficiency}
+
+When running a large number of jobs on a site at once, access to the
+shared filesystem on that site can be a bottleneck.
+
+On large systems, the shared file system is commonly provided by
+GPFS\cite{GPFS}. This can scale well but when used na\"ively can
+exhibit pathological behaviour. Early versions of Swift triggered this
+behaviour by targetting too much file system activity at a single
+working directory, so that GPFS lock contention came to dominate execution
+time.
+
+TODO more...
+
 \section{Example applications}
 
 TODO: two or three applications in brief. discuss both the application
@@ -792,6 +830,9 @@
 
 TODO reference the VDC from VDS\cite{VDS}
 
+TODO write about the stuff on provenance db that I did before - that whole
+document of notes...
+
 \subsection{Workflow GUIs as generators of SwiftScript programs - LONI
 Pipeline}
 
@@ -846,8 +887,8 @@
 \subsection{Debugging}
 
  TODO: debugging of distributed system - can have a non-futures section
-on what is available now, as well as mentioning CEDPS\cite{CEDPS} as
-somewhat promising(?) for the future.
+on what is available now - logprocessing module, as well as
+mentioning CEDPS\cite{CEDPS} as somewhat promising(?) for the future.
 
 \section{Implementation status}
 
@@ -866,7 +907,7 @@
 
   Reference Swift as a follow-on project to VDL in VDS; how does XDTM fit
     into this? Is it of any interest other than as part of the
-    project history?
+    project history? And is history of this project interesting? maybe so...
 
 
   Acknowledgement of all developers names?
@@ -926,7 +967,14 @@
 
 \bibitem{MAPREDUCE} - mapreduce
 
+\bibitem{TERAGRID} - teragrid
 
+\bibitem{OSG} - open science grid
+
+\bibitem{ReSS} - ress
+
+\bibitem{GPFS} - GPFS
+
 \end{thebibliography}
 \end{document}
 




More information about the Swift-commit mailing list