From iraicu at cs.iit.edu Thu Dec 15 07:45:48 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Thu, 15 Dec 2011 07:45:48 -0600 Subject: [Swift-user] CFP: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC'12) Message-ID: <4EE9FA0C.40006@cs.iit.edu> **** CALL FOR PAPERS **** The 21st International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC'12) Delft University of Technology, Delft, the Netherlands June 18-22, 2012 http://www.hpdc.org/2012 The ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC) is the premier annual conference on the design, the implementation, the evaluation, and the use of parallel and distributed systems for high-end computing. HPDC'12 will take place in Delft, the Netherlands, a historical, picturesque city that is less than one hour away from Amsterdam-Schiphol airport. The conference will be held on June 20-22 (Wednesday to Friday), with affiliated workshops taking place on June 18-19 (Monday and Tuesday). **** SUBMISSION DEADLINES **** Abstracts: 16 January 2012 Papers: 23 January 2012 (No extensions!) **** HPDC'12 GENERAL CHAIR **** Dick Epema, Delft University of Technology, Delft, the Netherlands **** HPDC'12 PROGRAM CO-CHAIRS **** Thilo Kielmann, Vrije Universiteit, Amsterdam, the Netherlands Matei Ripeanu, The University of British Columbia, Vancouver, Canada **** HPDC'12 WORKSHOPS CHAIR **** Alexandru Iosup, Delft University of Technology, Delft, the Netherlands **** SCOPE AND TOPICS **** Submissions are welcomed on all forms of high-performance parallel and distributed computing, including but not limited to clusters, clouds, grids, utility computing, data-intensive computing, and massively multicore systems. Submissions that explore solutions to estimate and reduce the energy footprint of such systems are particularly encouraged. All papers will be evaluated for their originality, potential impact, correctness, quality of presentation, appropriate presentation of related work, and relevance to the conference, with a strong preference for rigorous results obtained in operational parallel and distributed systems. The topics of interest of the conference include, but are not limited to, the following, in the context of high-performance parallel and distributed computing: - Systems, networks, and architectures for high-end computing - Massively multicore systems - Virtualization of machines, networks, and storage - Programming languages and environments - I/O, storage systems, and data management - Resource management, energy and cost minimizations - Performance modeling and analysis - Fault tolerance, reliability, and availability - Data-intensive computing - Applications of parallel and distributed computing **** PAPER SUBMISSION GUIDELINES **** Authors are invited to submit technical papers of at most 12 pages in PDF format, including figures and references. Papers should be formatted in the ACM Proceedings Style and submitted via the conference web site. No changes to the margins, spacing, or font sizes as specified by the style file are allowed. Accepted papers will appear in the conference proceedings, and will be incorporated into the ACM Digital Library. A limited number of papers will be accepted as posters. Papers must be self-contained and provide the technical substance required for the program committee to evaluate their contributions. Submitted papers must be original work that has not appeared in and is not under consideration for another conference or a journal. See the ACM Prior Publication Policy for more details. **** IMPORTANT DATES **** Abstracts Due: 16 January 2012 Papers Due: 23 January 2012 (No extensions!) Reviews Released to Authors: 8 March 2012 Author Rebuttals Due: 12 March 2012 Author Notifications: 19 March 2012 Final Papers Due: 16 April 2012 Conference Dates: 18-22 June 2012 -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= From iraicu at cs.iit.edu Fri Dec 16 16:54:14 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Fri, 16 Dec 2011 16:54:14 -0600 Subject: [Swift-user] CFP: Workshops at ACM HPDC 2012 Message-ID: <4EEBCC16.9040300@cs.iit.edu> **** WORKSHOPS at ACM HPDC 2012 **** http://www.hpdc.org/2012/workshops/ The ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC) is the premier annual conference on the design, the implementation, the evaluation, and the use of parallel and distributed systems for high-end computing. HPDC'12 will take place in Delft, the Netherlands, a historical, picturesque city that is less than one hour away from Amsterdam-Schiphol airport. The conference will be held on June 20-22 (Wednesday to Friday), with affiliated workshops taking place on June 18-19 (Monday and Tuesday). **** HPDC'12 WORKSHOPS CHAIR **** Alexandru Iosup, Delft University of Technology, Delft, the Netherlands The workshops include: - Astro-HPC: Workshop on High-Performance Computing for Astronomy http://www.pds.ewi.tudelft.nl/~varbanescu/AstroHPC/index.html - ECMLS2012: 3rd Int'l Emerging Computational Methods for the Life Sciences Workshop http://salsahpc.indiana.edu/ECMLS2012 - SocMP'12: First Workshop on Social Media Processing http://www.pds.ewi.tudelft.nl/~iosup/socmp2012 - ScienceCloud: 3rd Workshop on Scientific Cloud Computing http://ceng.usc.edu/~simmhan/ScienceCloud2012 - DIDC: Fifth Int'l Workshop on Data-Intensive Distributed Computing http://www.cse.buffalo.edu/faculty/tkosar/didc2012 - ISDP: In-Situ Data Processing technologies http://users.nccs.gov/~sklasky/isdp - LSAP2012: 4th Workshop on Large-scale Systems and Applications Performance http://www.lsap2012.org/ - MapReduce'12: The Third Int'l Workshop on MapReduce and its Applications http://graal.ens-lyon.fr/mapreduce - VTDC-2012: 6th Int'l Workshop on Virtualization Technologies in Distributed Computing http://www.grid-appliance.org/wiki/index.php/VTDC12 -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= From iraicu at cs.iit.edu Tue Dec 20 18:21:16 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Tue, 20 Dec 2011 18:21:16 -0600 Subject: [Swift-user] CFP: 8th IEEE Int. Conf. on eScience 2012, Chicago IL USA Message-ID: <4EF1267C.8070101@cs.iit.edu> CALL FOR PAPERS 8th IEEE International Conference on eScience http://www.ci.uchicago.edu/escience2012/ October 8-12, 2012 Chicago, IL, USA Researchers in all disciplines are increasingly adopting digital tools, techniques and practices, often in communities and projects that span disciplines, laboratories, organizations, and national boundaries. The eScience 2012 conference is designed to bring together leading international and interdisciplinary research communities, developers, and users of eScience applications and enabling IT technologies. The conference serves as a forum to present the results of the latest applications research and product/tool developments and to highlight related activities from around the world. Also, we are now entering the second decade of eScience and the 2012 conference gives an opportunity to take stock of what has been achieved so far and look forward to the challenges and opportunities the next decade will bring. A special emphasis of the 2012 conference is on advances in the application of technology in a particular discipline. Accordingly, significant advances in applications science and technology will be considered as important as the development of new technologies themselves. Further, we welcome contributions in educational activities under any of these disciplines. As a result, the conference will be structured around two e-Science tracks: ? eScience Algorithms and Applications ? eScience application areas, including: ? Physical sciences ? Biomedical sciences ? Social sciences and humanities ? Data-oriented approaches and applications ? Compute-oriented approaches and applications ? Extreme scale approaches and applications ? Cyberinfrastructure to support eScience ? Novel hardware ? Novel uses of production infrastructure ? Software and services ? Tools The conference proceedings will be published by the IEEE Computer Society Press, USA and will be made available online through the IEEE Digital Library. Selected papers will be invited to submit extended versions to a special issue of the Future Generation Computer Systems (FGCS)journal. SUBMISSION PROCESS Authors are invited to submit papers with unpublished, original work of not more than 8 pages of double column text using single spaced 10 point size on 8.5 x 11 inch pages, as per IEEE 8.5 x 11 manuscript guidelines. (Up to 2 additional pages may be purchased for US$150/page) Templates are available from http://www.ieee.org/conferences_events/conferences/publishing/templates.html. Authors should submit a PDF file that will print on a PostScript printer to https://www.easychair.org/conferences/?conf=escience2012 (Note that paper submitters also must submit an abstract in advance of the paper deadline. This should be done through the same site where papers are submitted.) It is a requirement that at least one author of each accepted paper attend the conference. IMPORTANT DATES Abstract submission (required): 4 July 2012 Paper submission: 11 July 2012 Paper author notification: 22 August 2012 Camera-ready papers due: 10 September 2012 Conference: 8-12 October 2012 CONFERENCE ORGANIZATION General Chair ? Ian Foster, University of Chicago& Argonne National Laboratory, USA Program Co-Chairs ? Daniel S. Katz, University of Chicago& Argonne National Laboratory, USA ? Heinz Stockinger, SIB Swiss Institute of Bioinformatics, Switzerland Program Vice Co-Chairs ? eScience Algorithms and Applications Track ? David Abramson, Monash University, Australia ? Gabrielle Allen, Louisiana State University, USA ? Cyberinfrastructure to support eScience Track ? Rosa M. Badia, Barcelona Supercomputing Center / CSIC, Spain ? Geoffrey Fox, Indiana University, USA Early Results and Works-in-Progress Posters Chair ? Roger Barga, Microsoft, USA Workshops Chair ? Ruth Pordes, FNAL, USA Sponsorship Chair ? Charlie Catlett, Argonne National Laboratory, USA Conference Manager and Finance Chair ? Julie Wulf-Knoerzer, University of Chicago& Argonne National Laboratory, USA Publicity Chairs ? Kento Aida, National Institute of Informatics, Japan ? Ioan Raicu, Illinois Institute of Technology, USA ? David Wallom, Oxford e-Research Centre, UK Local Organizing Committee ? Ninfa Mayorga, University of Chicago, USA ? Evelyn Rayburn, University of Chicago, USA ? Lynn Valentini, Argonne National Laboratory, USA Program Committee ? eScience Algorithms and Applications Track ? Srinivas Aluru, Iowa State University, USA ? Ashiq Anjum, University of Derby, UK ? David A. Bader, Georgia Institute of Technology, USA ? Jon Blower, University of Reading, UK ? Paul Bonnington, Monash University, Australia ? Simon Cox, University of Southampton, UK ? David De Roure, Oxford e-Research Centre, UK ? George Djorgovski, California Institute of Technology, USA ? Anshu Dubey, University of Chicago& Argonne National Laboratory, USA ? Yuri Estrin, Monash University, Australia ? Dan Fay, Microsoft, USA ? Jeremy Frey, University of Southampton, UK ? Wolfgang Gentzsch, HPC Consultant, Germany ? Lutz Gross, The University of Queensland, Austrialia ? Sverker Holmgren, Uppsala University, Sweden ? Bill Howe, University of Washington, USA ? Marina Jirotka, University of Oxford, UK ? Timoleon Kipouros, University of Cambridge, UK ? Kerstin Kleese van Dam, Pacific Northwest National Laboratory, USA ? Arun S. Konagurthu, Monash University, Australia ? Peter Kunszt, SystemsX.ch, Switzerland ? Alexey Lastovetsky, University College Dublin, Ireland ? Andrew Lewis, Griffith University, Australia ? Sergio Maffioletti, University of Zurich, Switzerland ? Amitava Majumdar, San Diego Supercomputer Center, University of California at San Diego, USA ? Rui Mao, Shenzhen University, China ? Madhav V. Marathe, Virginia Tech, USA ? Maryann Martone, University of California at San Diego, USA ? Louis Moresi, Monash University, Australia ? Riccardo Murri, University of Zurich, Switzerland ? Silvia D. Olabarriaga, Academic Medical Center of the University of Amsterdam, Netherlands ? Enrique S. Quintana-Ort?, Universidad Jaume I, Spain ? Abani Patra, University at Buffalo, USA ? Rob Pennington, NSF, USA ? Andrew Perry, Monash University, Australia ? Beth Plale, Indiana University, USA ? Michael Resch, University of Stuttgart, Germany ? Adrian Sandu, Virginia Tech, USA ? Mark Savill, Cranfield University, UK ? Erik Schnetter, Perimeter Institute for Theoretical Physics, Canada ? Edward Seidel, Louisiana State University, USA ? Suzanne M. Shontz, The Pennsylvania State University, USA ? David Skinner, Lawrence Berkeley National Laboratory, USA ? Alan Sussman, University of Maryland, USA ? Alex Szalay, Johns Hopkins University, USA ? Domenico Talia, ICAR-CNR& University of Calabria, Italy ? Jian Tao, Louisiana State University, USA ? David Wallom, Oxford e-Research Centre, UK ? Shaowen Wang, University of Illinois at Urbana-Champaign, USA ? Michael Wilde, Argonne National Laboratory& University of Chicago, USA ? Nancy Wilkins-Diehr, San Diego Supercomputer Center, University of California at San Diego, USA ? Wu Zhang, Shanghai University, China ? Yunquan Zhang, Chinese Academy of Sciences, China ? Cyberinfrastructure to support eScience Track ? Deb Agarwal, Lawrence Berkeley National Laboratory, USA ? Ilkay Altintas, San Diego Supercomputer Center, University of California at San Diego, USA ? Henri Bal, Vrije Universiteit, Netherlands ? Roger Barga, Microsoft, USA ? Martin Berzins, University of Utah, USA ? John Brooke, University of Manchester, UK ? Thomas Fahringer, University of Innsbruck, Austria ? Gilles Fedak, INRIA, France ? Jos? A. B. Fortes, University of Florida, USA ? Yolanda Gil, ISI/USC, USA ? Madhusudhan Govindaraju, SUNY Binghamton, USA ? Thomas Hacker, Purdue University, USA ? Ken Hawick, Massey University, New Zealand ? Marty Humphrey, University of Virginia, USA ? Hai Jin, Huazhong University of Science and Technology, China ? Thilo Kielmann, Vrije Universiteit, Netherlands ? Scott Klasky, Oak Ridge National Laboratory, USA ? Isao Kojima, AIST, Japan ? Tevfik Kosar, University at Buffalo, USA ? Dieter Kranzlmueller, LMU& LRZ Munich, Germany ? Erwin Laure, KTH, Sweden ? Jysoo Lee, KISTI, Korea ? Li Xiaoming, Peking University, China ? Bertram Lud?scher, University of California, Davis, USA ? Andrew Lumsdaine, Indiana University, USA ? Tanu Malik, University of Chicago, USA ? Satoshi Matsuoka, Tokyo Institute of Technology, Japan ? Reagan Moore, University of North Carolina at Chapel Hill, USA ? Shirley Moore, University of Kentucky, USA ? Steven Newhouse, EGI, Netherlands ? Dhabaleswar K. (DK) Panda, The Ohio State University, USA ? Manish Parashar, Rutgers University, USA ? Ron Perrott, University of Oxford, UK ? Depei Qian, Beihang University, China ? Judy Qui, Indiana University, USA ? Ioan Raicu, Illinois Institute of Technology, USA ? Lavanya Ramakrishnan, Lawrence Berkeley National Laboratory, USA ? Omer Rana, Cardiff University, UK ? Paul Roe, Queensland University of Technology, Australia ? Bruno Schulze, LNCC, Brazil ? Marc Snir, Argonne National Laboratory& University of Illinois at Urbana-Champaign, USA ? Xian-He Sun, Illinois Institute of Technology, USA ? Yoshio Tanaka, AIST, Japan ? Michela Taufer, University of Delaware, USA ? Kerry Taylor, CSIRO, Australia ? Douglas Thain, University of Notre Dame, USA ? Paul Watson, Newcastle University, UK ? Jun Zhao, University of Oxford, UK -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= From iraicu at cs.iit.edu Tue Dec 20 18:28:08 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Tue, 20 Dec 2011 18:28:08 -0600 Subject: [Swift-user] Call for Workshops at IEEE eScience 2012 Message-ID: <4EF12818.4000204@cs.iit.edu> CALL FOR WORKSHOPS 8th IEEE International Conference on eScience http://www.ci.uchicago.edu/escience2012/ October 8-12, 2012 Chicago, IL, USA The 8th IEEE eScience conference (e-Science 2012), sponsored by the IEEE Computer Society's Technical Committee for Scalable Computing (TCSC), will be held in Chicago Illinois from 8-12th October 2012. The eScience 2011 conference is designed to bring together leading international and interdisciplinary research communities, developers, and users of eScience applications and enabling IT technologies. Multiple e-Science 2012 Workshops will be held on Monday and Tuesday, 8th and 9th October, co- located with the main conference. Workshops are an important part of the conference in providing opportunity for researchers to present their work in a more focused way than the conference itself and to have discussion of particular topics of interest to the community. We cordially invite you to submit workshop proposals on any eScience related topic to the Workshop Chair. To help those interested know their purpose and scope, workshop proposals should include: ? A description of the workshop, its focus, goals, and outcome ? A draft call for papers ? Names and affiliations of the organizers and tentative composition of the committees ? Expected numbers of submissions and accepted papers ? Prior history of this workshop, if any. Please include: number of submissions, number of accepted papers, and attendee count. Workshop organizers are responsible for establishing a program committee, collecting and evaluating submissions, notifying authors of acceptance or rejection in due time, ensuring a transparent and fair selection process, organizing selected papers into sessions, and assigning session chairs. Proposals will be selected that show clear focus and objectives in areas of emerging or developing interest guaranteed to generate significant interest in the community. Once accepted, the workshop should establish its own paper submission system. For each paper selected for publication, an author must be registered for eScience 2012. Each paper must be presented in person by at least one of the authors. It is expected that the proceedings of the eScience 2012 workshops will be published by the IEEE Computer Society Press, USA and will be made available online through the IEEE Digital Library. SUBMISSION PROCESS Workshop proposals should be emailed toescience2012-workshops at fnal.gov IMPORTANT DATES Workshop submissions due: 23rd January 2012 Notification of workshop acceptance: 6th February 2012 While it is up to the workshop organizers to work with the authors of any papers to be published from the workshop presenters, it should be noted that information about these will be needed by 27th August 2012 and final camera ready papers are needed by 17th September 2012. Workshops: 8-9 October 2012 -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= From davidk at ci.uchicago.edu Wed Dec 21 08:55:18 2011 From: davidk at ci.uchicago.edu (David Kelly) Date: Wed, 21 Dec 2011 08:55:18 -0600 (CST) Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: Message-ID: <1860279107.41299.1324479318780.JavaMail.root@zimbra-mb2.anl.gov> Sarah, Could you please send the submit files that were generated from this run? That should help narrow it down a bit. Thanks, David ----- Original Message ----- > From: "Sarah Kenny" > To: "Swift Devel" , "Swift User" > Sent: Wednesday, December 21, 2011 6:57:32 AM > Subject: [Swift-devel] swift on ranger > getting this when submitting to ranger with both the latest and our > previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > successfully:100 > The following warnings have occurred: > 1. > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > Cannot submit job: Could not submit job (qsub reported an exit code of > 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: > Could not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this > seems to be popping up now, but it's rather consistent. all jobs are > finishing successfully, so it's rather confusing...any idea what i > might be missing here? > > thanks > ~sk > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel From ketancmaheshwari at gmail.com Wed Dec 21 09:58:59 2011 From: ketancmaheshwari at gmail.com (Ketan Maheshwari) Date: Wed, 21 Dec 2011 09:58:59 -0600 Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: References: Message-ID: Sarah, I checked my sites.xml. The only difference between yours and mine being the value of jobspernode which is 16 in my case. I have had this value in other multiples of 16 which has worked fine for me. On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny wrote: > getting this when submitting to ranger with both the latest and our > previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > successfully:100 > The following warnings have occurred: > 1. org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > Cannot submit job: Could not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a time > limit...--> Checking that you specified a queue...--> Setting project...--> > Checking that you specified a parallel environment...--> Checking that you > specified a valid parallel environment name...--> Checking that the minimum > and maximum PE counts are the same...--> Checking that the number of PEs > requested is valid...------------------> Rejecting job > <------------------Your slot (or core) request is not a multiple of > 16.Syntax: -pe where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: Could > not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a time > limit...--> Checking that you specified a queue...--> Setting project...--> > Checking that you specified a parallel environment...--> Checking that you > specified a valid parallel environment name...--> Checking that the minimum > and maximum PE counts are the same...--> Checking that the number of PEs > requested is valid...------------------> Rejecting job > <------------------Your slot (or core) request is not a multiple of > 16.Syntax: -pe where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this seems to > be popping up now, but it's rather consistent. all jobs are finishing > successfully, so it's rather confusing...any idea what i might be missing > here? > > thanks > ~sk > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > -- Ketan -------------- next part -------------- An HTML attachment was scrubbed... URL: From jonmon at mcs.anl.gov Wed Dec 21 10:04:48 2011 From: jonmon at mcs.anl.gov (Jonathan Monette) Date: Wed, 21 Dec 2011 10:04:48 -0600 Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: References: Message-ID: Also, shouldn't node granularity be set to 16 on ranger an not 64? On Dec 21, 2011, at 9:58 AM, Ketan Maheshwari wrote: > Sarah, > > I checked my sites.xml. The only difference between yours and mine being the value of jobspernode which is 16 in my case. I have had this value in other multiples of 16 which has worked fine for me. > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny wrote: > getting this when submitting to ranger with both the latest and our previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished successfully:100 > The following warnings have occurred: > 1. org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Cannot submit job: Could not submit job (qsub reported an exit code of 1). -------------------------------------------------------------------------- Welcome to TACC's Ranger System, an NSF XD Resource ----------------------------------------------------------------------------> Checking that you specified -V...--> Checking that you specified a time limit...--> Checking that you specified a queue...--> Setting project...--> Checking that you specified a parallel environment...--> Checking that you specified a valid parallel environment name...--> Checking that the minimum and maximum PE counts are the same...--> Checking that the number of PEs requested is valid...------------------> Rejecting job <------------------Your slot (or core) request is not a multiple of 16.Syntax: -pe where is a multiple of 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: org.globus.cog.abstraction.impl.scheduler.common.ProcessException: Could not submit job (qsub reported an exit code of 1). -------------------------------------------------------------------------- Welcome to TACC's Ranger System, an NSF XD Resource ----------------------------------------------------------------------------> Checking that you specified -V...--> Checking that you specified a time limit...--> Checking that you specified a queue...--> Setting project...--> Checking that you specified a parallel environment...--> Checking that you specified a valid parallel environment name...--> Checking that the minimum and maximum PE counts are the same...--> Checking that the number of PEs requested is valid...------------------> Rejecting job <------------------Your slot (or core) request is not a multiple of 16.Syntax: -pe where is a multiple of 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this seems to be popping up now, but it's rather consistent. all jobs are finishing successfully, so it's rather confusing...any idea what i might be missing here? > > thanks > ~sk > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > > > > -- > Ketan > > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user -------------- next part -------------- An HTML attachment was scrubbed... URL: From wilde at mcs.anl.gov Wed Dec 21 10:20:12 2011 From: wilde at mcs.anl.gov (Michael Wilde) Date: Wed, 21 Dec 2011 10:20:12 -0600 (CST) Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: Message-ID: <1932609068.87907.1324484412645.JavaMail.root@zimbra.anl.gov> Node granularity is the size increment, in nodes, of the number of nodes requested in each coaster block. So it can be anything that the user wants, as long as its valid for the local scheduler. We recently discussed the need to improve and clarify the user guide documentation on how to specify node request parameters for the coaster provider. Im going to file this as a ticket now for 0.94. - Mike ----- Original Message ----- > From: "Jonathan Monette" > To: "Ketan Maheshwari" > Cc: "Sarah Kenny" , "Swift Devel" , "Swift User" > > Sent: Wednesday, December 21, 2011 10:04:48 AM > Subject: Re: [Swift-devel] [Swift-user] swift on ranger > Also, shouldn't node granularity be set to 16 on ranger an not 64? > > > > > On Dec 21, 2011, at 9:58 AM, Ketan Maheshwari < > ketancmaheshwari at gmail.com > wrote: > > > > > > Sarah, > > I checked my sites.xml. The only difference between yours and mine > being the value of jobspernode which is 16 in my case. I have had this > value in other multiples of 16 which has worked fine for me. > > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny < skenny at uci.edu > wrote: > > > getting this when submitting to ranger with both the latest and our > previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > successfully:100 > The following warnings have occurred: > 1. > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > Cannot submit job: Could not submit job (qsub reported an exit code of > 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: > Could not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this > seems to be popping up now, but it's rather consistent. all jobs are > finishing successfully, so it's rather confusing...any idea what i > might be missing here? > > thanks > ~sk > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > > > > -- > Ketan > > > > > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel -- Michael Wilde Computation Institute, University of Chicago Mathematics and Computer Science Division Argonne National Laboratory From skenny at uci.edu Wed Dec 21 06:57:32 2011 From: skenny at uci.edu (Sarah Kenny) Date: Wed, 21 Dec 2011 04:57:32 -0800 Subject: [Swift-user] swift on ranger Message-ID: getting this when submitting to ranger with both the latest and our previous version of swift (swift-r5259 cog-r3313) Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished successfully:100 The following warnings have occurred: 1. org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Cannot submit job: Could not submit job (qsub reported an exit code of 1). -------------------------------------------------------------------------- Welcome to TACC's Ranger System, an NSF XD Resource ----------------------------------------------------------------------------> Checking that you specified -V...--> Checking that you specified a time limit...--> Checking that you specified a queue...--> Setting project...--> Checking that you specified a parallel environment...--> Checking that you specified a valid parallel environment name...--> Checking that the minimum and maximum PE counts are the same...--> Checking that the number of PEs requested is valid...------------------> Rejecting job <------------------Your slot (or core) request is not a multiple of 16.Syntax: -pe where is a multiple of 16.----------------------------------------------------- Unable to run job: JSV rejected job.Exiting. at org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) at org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) at org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) at org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) Caused by: org.globus.cog.abstraction.impl.scheduler.common.ProcessException: Could not submit job (qsub reported an exit code of 1). -------------------------------------------------------------------------- Welcome to TACC's Ranger System, an NSF XD Resource ----------------------------------------------------------------------------> Checking that you specified -V...--> Checking that you specified a time limit...--> Checking that you specified a queue...--> Setting project...--> Checking that you specified a parallel environment...--> Checking that you specified a valid parallel environment name...--> Checking that the minimum and maximum PE counts are the same...--> Checking that the number of PEs requested is valid...------------------> Rejecting job <------------------Your slot (or core) request is not a multiple of 16.Syntax: -pe where is a multiple of 16.----------------------------------------------------- Unable to run job: JSV rejected job.Exiting. at org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) at org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) ... 3 more ################### sites file 86400 02:00:00 1 64 4096 normal 1.28 TG-DBS080004N 16way 10000 /work/00043/tg457040/swiftwork same settings we've been using for a while, i'm not sure why this seems to be popping up now, but it's rather consistent. all jobs are finishing successfully, so it's rather confusing...any idea what i might be missing here? thanks ~sk -------------- next part -------------- An HTML attachment was scrubbed... URL: From skenny at uci.edu Wed Dec 21 16:50:48 2011 From: skenny at uci.edu (Sarah Kenny) Date: Wed, 21 Dec 2011 14:50:48 -0800 Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: References: Message-ID: yr right ketan, if i change it to: 16 the warning message goes away. however, there are times i don't want to run 16 jobs per node...e.g. bcs a single job needs all the available memory so even though the node has 16 processors i can't actually use them all. so perhaps this is just a scheduling issue with ranger/sge in that they don't want you to submit a job that's going to leave processors idle? that seems a bit restrictive though... On Wed, Dec 21, 2011 at 7:58 AM, Ketan Maheshwari < ketancmaheshwari at gmail.com> wrote: > Sarah, > > I checked my sites.xml. The only difference between yours and mine being > the value of jobspernode which is 16 in my case. I have had this value in > other multiples of 16 which has worked fine for me. > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny wrote: > >> getting this when submitting to ranger with both the latest and our >> previous version of swift (swift-r5259 cog-r3313) >> >> Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished >> successfully:100 >> The following warnings have occurred: >> 1. org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: >> Cannot submit job: Could not submit job (qsub reported an exit code of 1). >> -------------------------------------------------------------------------- >> Welcome to TACC's Ranger System, an NSF XD Resource >> ----------------------------------------------------------------------------> >> Checking that you specified -V...--> Checking that you specified a time >> limit...--> Checking that you specified a queue...--> Setting project...--> >> Checking that you specified a parallel environment...--> Checking that you >> specified a valid parallel environment name...--> Checking that the minimum >> and maximum PE counts are the same...--> Checking that the number of PEs >> requested is valid...------------------> Rejecting job >> <------------------Your slot (or core) request is not a multiple of >> 16.Syntax: -pe where is a multiple of >> 16.----------------------------------------------------- >> Unable to run job: JSV rejected job.Exiting. >> >> at >> org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) >> at >> org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) >> at >> org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) >> at >> org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) >> Caused by: >> org.globus.cog.abstraction.impl.scheduler.common.ProcessException: Could >> not submit job (qsub reported an exit code of 1). >> -------------------------------------------------------------------------- >> Welcome to TACC's Ranger System, an NSF XD Resource >> ----------------------------------------------------------------------------> >> Checking that you specified -V...--> Checking that you specified a time >> limit...--> Checking that you specified a queue...--> Setting project...--> >> Checking that you specified a parallel environment...--> Checking that you >> specified a valid parallel environment name...--> Checking that the minimum >> and maximum PE counts are the same...--> Checking that the number of PEs >> requested is valid...------------------> Rejecting job >> <------------------Your slot (or core) request is not a multiple of >> 16.Syntax: -pe where is a multiple of >> 16.----------------------------------------------------- >> Unable to run job: JSV rejected job.Exiting. >> >> at >> org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) >> at >> org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) >> ... 3 more >> >> ################### sites file >> >> >> >> >> >> 86400 >> 02:00:00 >> 1 >> 64 >> 4096 >> normal >> 1.28 >> TG-DBS080004N >> 16way >> 10000 >> /work/00043/tg457040/swiftwork >> >> >> >> same settings we've been using for a while, i'm not sure why this seems >> to be popping up now, but it's rather consistent. all jobs are finishing >> successfully, so it's rather confusing...any idea what i might be missing >> here? >> >> thanks >> ~sk >> >> >> >> >> >> _______________________________________________ >> Swift-devel mailing list >> Swift-devel at ci.uchicago.edu >> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel >> >> > > > -- > Ketan > > > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > -- Sarah Kenny Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III University of California Irvine, Dept. of Neurology ~ 773-818-8300 -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidk at ci.uchicago.edu Wed Dec 21 17:41:00 2011 From: davidk at ci.uchicago.edu (David Kelly) Date: Wed, 21 Dec 2011 17:41:00 -0600 (CST) Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: Message-ID: <773619107.42639.1324510860877.JavaMail.root@zimbra-mb2.anl.gov> I am able to reproduce this problem.. hoping to have a fix soon. David ----- Original Message ----- > From: "Sarah Kenny" > To: "Ketan Maheshwari" > Cc: "Swift Devel" , "Swift User" > Sent: Wednesday, December 21, 2011 4:50:48 PM > Subject: Re: [Swift-devel] [Swift-user] swift on ranger > yr right ketan, if i change it to: key="jobsPerNode">16 the warning message goes away. however, > there are times i don't want to run 16 jobs per node...e.g. bcs a > single job needs all the available memory so even though the node has > 16 processors i can't actually use them all. so perhaps this is just a > scheduling issue with ranger/sge in that they don't want you to submit > a job that's going to leave processors idle? that seems a bit > restrictive though... > > > On Wed, Dec 21, 2011 at 7:58 AM, Ketan Maheshwari < > ketancmaheshwari at gmail.com > wrote: > > > Sarah, > > I checked my sites.xml. The only difference between yours and mine > being the value of jobspernode which is 16 in my case. I have had this > value in other multiples of 16 which has worked fine for me. > > > > > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny < skenny at uci.edu > wrote: > > > > > > getting this when submitting to ranger with both the latest and our > previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > successfully:100 > The following warnings have occurred: > 1. > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > Cannot submit job: Could not submit job (qsub reported an exit code of > 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: > Could not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this > seems to be popping up now, but it's rather consistent. all jobs are > finishing successfully, so it's rather confusing...any idea what i > might be missing here? > > thanks > ~sk > > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > > > > -- > Ketan > > > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > > > > -- > Sarah Kenny > Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III > University of California Irvine, Dept. of Neurology ~ 773-818-8300 > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel From hategan at mcs.anl.gov Wed Dec 21 17:53:53 2011 From: hategan at mcs.anl.gov (Mihael Hategan) Date: Wed, 21 Dec 2011 15:53:53 -0800 Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: References: Message-ID: <1324511633.20131.2.camel@blabla> On Wed, 2011-12-21 at 14:50 -0800, Sarah Kenny wrote: > yr right ketan, if i change it to: key="jobsPerNode">16 the warning message goes away. however, > there are times i don't want to run 16 jobs per node...e.g. bcs a > single job needs all the available memory so even though the node has > 16 processors i can't actually use them all. so perhaps this is just a > scheduling issue with ranger/sge in that they don't want you to submit > a job that's going to leave processors idle? that seems a bit > restrictive though... Hmmm. I think that one should be able to request 16 cores and run only two jobs at a time on them. Have you tried pe=2way? From davidk at ci.uchicago.edu Thu Dec 22 01:54:25 2011 From: davidk at ci.uchicago.edu (David Kelly) Date: Thu, 22 Dec 2011 01:54:25 -0600 (CST) Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: Message-ID: <2130988117.43047.1324540465638.JavaMail.root@zimbra-mb2.anl.gov> Sarah, Can you please give this another try? I believe it should work now with your original sites.xml. David ----- Original Message ----- > From: "Sarah Kenny" > To: "Ketan Maheshwari" > Cc: "Swift Devel" , "Swift User" > Sent: Wednesday, December 21, 2011 4:50:48 PM > Subject: Re: [Swift-devel] [Swift-user] swift on ranger > yr right ketan, if i change it to: key="jobsPerNode">16 the warning message goes away. however, > there are times i don't want to run 16 jobs per node...e.g. bcs a > single job needs all the available memory so even though the node has > 16 processors i can't actually use them all. so perhaps this is just a > scheduling issue with ranger/sge in that they don't want you to submit > a job that's going to leave processors idle? that seems a bit > restrictive though... > > > On Wed, Dec 21, 2011 at 7:58 AM, Ketan Maheshwari < > ketancmaheshwari at gmail.com > wrote: > > > Sarah, > > I checked my sites.xml. The only difference between yours and mine > being the value of jobspernode which is 16 in my case. I have had this > value in other multiples of 16 which has worked fine for me. > > > > > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny < skenny at uci.edu > wrote: > > > > > > getting this when submitting to ranger with both the latest and our > previous version of swift (swift-r5259 cog-r3313) > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > successfully:100 > The following warnings have occurred: > 1. > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > Cannot submit job: Could not submit job (qsub reported an exit code of > 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > at > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > at > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > at > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > Caused by: > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: > Could not submit job (qsub reported an exit code of 1). > -------------------------------------------------------------------------- > Welcome to TACC's Ranger System, an NSF XD Resource > ----------------------------------------------------------------------------> > Checking that you specified -V...--> Checking that you specified a > time limit...--> Checking that you specified a queue...--> Setting > project...--> Checking that you specified a parallel environment...--> > Checking that you specified a valid parallel environment name...--> > Checking that the minimum and maximum PE counts are the same...--> > Checking that the number of PEs requested is > valid...------------------> Rejecting job <------------------Your slot > (or core) request is not a multiple of 16.Syntax: -pe > where is a multiple of > 16.----------------------------------------------------- > Unable to run job: JSV rejected job.Exiting. > > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > at > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > ... 3 more > > ################### sites file > > > > > > 86400 > 02:00:00 > 1 > 64 > 4096 > normal > 1.28 > TG-DBS080004N > 16way > 10000 > /work/00043/tg457040/swiftwork > > > > same settings we've been using for a while, i'm not sure why this > seems to be popping up now, but it's rather consistent. all jobs are > finishing successfully, so it's rather confusing...any idea what i > might be missing here? > > thanks > ~sk > > > > > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > > > > -- > Ketan > > > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > > > > -- > Sarah Kenny > Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III > University of California Irvine, Dept. of Neurology ~ 773-818-8300 > > > _______________________________________________ > Swift-devel mailing list > Swift-devel at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel From wilde at mcs.anl.gov Thu Dec 22 05:28:32 2011 From: wilde at mcs.anl.gov (Michael Wilde) Date: Thu, 22 Dec 2011 05:28:32 -0600 (CST) Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: <1324511633.20131.2.camel@blabla> Message-ID: <1417017812.90783.1324553312587.JavaMail.root@zimbra.anl.gov> The Ranger user guide: http://www.tacc.utexas.edu/user-services/user-guides/ranger-user-guide Has a section (below) on using < 16 cores per node. (Im not sure if "15" below is a typo; I think 16 was intended). - Mike Using fewer than 16 cores per node When you want to use less than 16 MPI tasks per node, the choice of tasks per node is limited to the set of numbers {1, 2, 4, 8, 12, and 15}. When the number of tasks you need is equal to "Number of Tasks per Node x Number of Nodes", then use the following command: #$ -pe way where is a number in the set {1, 2, 4, 8, 12, 15}. If the Total number of Tasks that you need is less than "Number of Tasks per Node x Number of Nodes", then set the MY_NSLOTS environment variable to the Total number of Tasks needed. In a job script, use the following -pe option and environment variable statement: #$ -pe way export MY_NSLOTS= # For Bourne shells or setenv MY_NSLOTS # For C shells where is a number in the set {1, 2, 4, 8, 12, 15}. For example, using a Bourne shell: #$ -pe 8way 64 # Use 8 Tasks per Node, 4 Nodes requested export MY_NSLOTS=31 # 31 tasks are launched ----- Original Message ----- > From: "Mihael Hategan" > To: "Sarah Kenny" > Cc: "Swift Devel" , "Swift User" > Sent: Wednesday, December 21, 2011 5:53:53 PM > Subject: Re: [Swift-user] [Swift-devel] swift on ranger > On Wed, 2011-12-21 at 14:50 -0800, Sarah Kenny wrote: > > yr right ketan, if i change it to: > key="jobsPerNode">16 the warning message goes away. > > however, > > there are times i don't want to run 16 jobs per node...e.g. bcs a > > single job needs all the available memory so even though the node > > has > > 16 processors i can't actually use them all. so perhaps this is just > > a > > scheduling issue with ranger/sge in that they don't want you to > > submit > > a job that's going to leave processors idle? that seems a bit > > restrictive though... > > Hmmm. I think that one should be able to request 16 cores and run only > two jobs at a time on them. Have you tried pe=2way? > > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user -- Michael Wilde Computation Institute, University of Chicago Mathematics and Computer Science Division Argonne National Laboratory From yclin at cse.nsysu.edu.tw Thu Dec 22 02:32:50 2011 From: yclin at cse.nsysu.edu.tw (Dr. Ying Chih Lin) Date: Thu, 22 Dec 2011 16:32:50 +0800 Subject: [Swift-user] HELP for using Swift~ Message-ID: <20111222083149.M49574@cse.nsysu.edu.tw> Dear Sir or Madam, I am a beginner on Swift, and writing to request information about running Swift on many sites. Here are two problems on the document siteguide.pdf: 1). What is the PADS and Fusion in the siteguide.pdf? Could it be possible to give simple introductions or websites for these terminologies in guide documents? 2). Section 2.1.4 in the siteguide.pdf says that copying catsn.swift and data.txt to the current directory, but there is no data.txt in the swift-0.93RC5 package. Thank you very much for your kindness. Yours sincerely, Ian From wilde at mcs.anl.gov Thu Dec 22 05:55:15 2011 From: wilde at mcs.anl.gov (Michael Wilde) Date: Thu, 22 Dec 2011 05:55:15 -0600 (CST) Subject: [Swift-user] HELP for using Swift~ In-Reply-To: <20111222083149.M49574@cse.nsysu.edu.tw> Message-ID: <245875943.90802.1324554915789.JavaMail.root@zimbra.anl.gov> Dear Ian, ----- Original Message ----- > From: "Dr. Ying Chih Lin" > To: swift-user at ci.uchicago.edu > Sent: Thursday, December 22, 2011 2:32:50 AM > Subject: [Swift-user] HELP for using Swift~ > Dear Sir or Madam, > > I am a beginner on Swift, and writing to request information about > running Swift on > many sites. Here are two problems on the document siteguide.pdf: > 1). What is the PADS and Fusion in the siteguide.pdf? These are two specific clusters in our local environment here at UChicago and Argonne. We should clarify that in the guide and explain how to adapt that information to your own local clusters. > Could it be > possible to give > simple introductions or websites for these terminologies in guide > documents? Yes, we should do that. > 2). Section 2.1.4 in the siteguide.pdf says that copying catsn.swift > and data.txt to > the current directory, but there is no data.txt in the swift-0.93RC5 > package. Thanks you for this feedback. We will post a clarification and inform the list. Also, please subscribe to this email list so that your posts go through without moderation. Thanks, - Mike > Thank you very much for your kindness. > > > Yours sincerely, > Ian > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user -- Michael Wilde Computation Institute, University of Chicago Mathematics and Computer Science Division Argonne National Laboratory From skenny at uci.edu Thu Dec 22 16:44:37 2011 From: skenny at uci.edu (Sarah Kenny) Date: Thu, 22 Dec 2011 14:44:37 -0800 Subject: [Swift-user] [Swift-devel] swift on ranger In-Reply-To: <2130988117.43047.1324540465638.JavaMail.root@zimbra-mb2.anl.gov> References: <2130988117.43047.1324540465638.JavaMail.root@zimbra-mb2.anl.gov> Message-ID: yeah, latest build works...i don't get a warning when specifying 1 job per node with 16way pe. On Wed, Dec 21, 2011 at 11:54 PM, David Kelly wrote: > Sarah, > > Can you please give this another try? I believe it should work now with > your original sites.xml. > > David > > ----- Original Message ----- > > From: "Sarah Kenny" > > To: "Ketan Maheshwari" > > Cc: "Swift Devel" , "Swift User" < > swift-user at ci.uchicago.edu> > > Sent: Wednesday, December 21, 2011 4:50:48 PM > > Subject: Re: [Swift-devel] [Swift-user] swift on ranger > > yr right ketan, if i change it to: > key="jobsPerNode">16 the warning message goes away. however, > > there are times i don't want to run 16 jobs per node...e.g. bcs a > > single job needs all the available memory so even though the node has > > 16 processors i can't actually use them all. so perhaps this is just a > > scheduling issue with ranger/sge in that they don't want you to submit > > a job that's going to leave processors idle? that seems a bit > > restrictive though... > > > > > > On Wed, Dec 21, 2011 at 7:58 AM, Ketan Maheshwari < > > ketancmaheshwari at gmail.com > wrote: > > > > > > Sarah, > > > > I checked my sites.xml. The only difference between yours and mine > > being the value of jobspernode which is 16 in my case. I have had this > > value in other multiples of 16 which has worked fine for me. > > > > > > > > > > > > > > On Wed, Dec 21, 2011 at 6:57 AM, Sarah Kenny < skenny at uci.edu > wrote: > > > > > > > > > > > > getting this when submitting to ranger with both the latest and our > > previous version of swift (swift-r5259 cog-r3313) > > > > Final status: time: Wed, 21 Dec 2011 04:49:15 -0800 Finished > > successfully:100 > > The following warnings have occurred: > > 1. > > org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: > > Cannot submit job: Could not submit job (qsub reported an exit code of > > 1). > > > -------------------------------------------------------------------------- > > Welcome to TACC's Ranger System, an NSF XD Resource > > > ----------------------------------------------------------------------------> > > Checking that you specified -V...--> Checking that you specified a > > time limit...--> Checking that you specified a queue...--> Setting > > project...--> Checking that you specified a parallel environment...--> > > Checking that you specified a valid parallel environment name...--> > > Checking that the minimum and maximum PE counts are the same...--> > > Checking that the number of PEs requested is > > valid...------------------> Rejecting job <------------------Your slot > > (or core) request is not a multiple of 16.Syntax: -pe > > where is a multiple of > > 16.----------------------------------------------------- > > Unable to run job: JSV rejected job.Exiting. > > > > at > > > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:63) > > at > > > org.globus.cog.abstraction.impl.common.AbstractTaskHandler.submit(AbstractTaskHandler.java:45) > > at > > > org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.submit(ExecutionTaskHandler.java:57) > > at > > > org.globus.cog.abstraction.coaster.service.job.manager.LocalQueueProcessor.run(LocalQueueProcessor.java:40) > > Caused by: > > org.globus.cog.abstraction.impl.scheduler.common.ProcessException: > > Could not submit job (qsub reported an exit code of 1). > > > -------------------------------------------------------------------------- > > Welcome to TACC's Ranger System, an NSF XD Resource > > > ----------------------------------------------------------------------------> > > Checking that you specified -V...--> Checking that you specified a > > time limit...--> Checking that you specified a queue...--> Setting > > project...--> Checking that you specified a parallel environment...--> > > Checking that you specified a valid parallel environment name...--> > > Checking that the minimum and maximum PE counts are the same...--> > > Checking that the number of PEs requested is > > valid...------------------> Rejecting job <------------------Your slot > > (or core) request is not a multiple of 16.Syntax: -pe > > where is a multiple of > > 16.----------------------------------------------------- > > Unable to run job: JSV rejected job.Exiting. > > > > at > > > org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.start(AbstractExecutor.java:108) > > at > > > org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.submit(AbstractJobSubmissionTaskHandler.java:53) > > ... 3 more > > > > ################### sites file > > > > > > > > > > > > 86400 > > 02:00:00 > > 1 > > 64 > > 4096 > > normal > > 1.28 > > TG-DBS080004N > > 16way > > 10000 > > /work/00043/tg457040/swiftwork > > > > > > > > same settings we've been using for a while, i'm not sure why this > > seems to be popping up now, but it's rather consistent. all jobs are > > finishing successfully, so it's rather confusing...any idea what i > > might be missing here? > > > > thanks > > ~sk > > > > > > > > > > > > > > _______________________________________________ > > Swift-devel mailing list > > Swift-devel at ci.uchicago.edu > > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > > > > > > > > > > -- > > Ketan > > > > > > > > _______________________________________________ > > Swift-user mailing list > > Swift-user at ci.uchicago.edu > > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > > > > > > > > -- > > Sarah Kenny > > Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III > > University of California Irvine, Dept. of Neurology ~ 773-818-8300 > > > > > > _______________________________________________ > > Swift-devel mailing list > > Swift-devel at ci.uchicago.edu > > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user > -- Sarah Kenny Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III University of California Irvine, Dept. of Neurology ~ 773-818-8300 -------------- next part -------------- An HTML attachment was scrubbed... URL: From iraicu at cs.iit.edu Thu Dec 22 17:03:24 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Thu, 22 Dec 2011 17:03:24 -0600 Subject: [Swift-user] CFP: The 9th Int. Conf. on Autonomic Computing (ICAC) 2012 Message-ID: <4EF3B73C.3020204@cs.iit.edu> CALL FOR PAPERS and WORKSHOP PROPOSALS The 9th International Conference on Autonomic Computing (ICAC 2012) September 17-21, 2012. San Jose, CA, USA http://icac2012.cs.fiu.edu/ ----------------------------------------------------------------- IMPORTANT DATES Paper and Poster Submission: March 9, 2012, 11:59pm PST Notification: May 18, 2012 Camera-ready Due: June 8, 2012 Workshop Proposal Submission: February 10, 2012 ----------------------------------------------------------------- OVERVIEW ICAC is the leading conference on autonomic computing techniques, foundations, and applications. Autonomic computing refers to methods and means for automated management of performance, fault, security, and configuration with little involvement of users or administrators. Systems introducing new autonomic features are becoming increasingly prevalent, motivating research that spans a variety of areas, from computer systems, networking, software engineering, and data management to machine learning, control theory, and bio-inspired computing. ICAC brings together researchers and practitioners across these disciplines to address multiple facets of adaptation and self-management in computing systems and applications from different perspectives. Autonomic computing solutions are sought for clouds, grids, data centers, enterprise software, internet services, data services, smart phones, embedded systems, and sensor networks. In these environments, resources and applications must be managed to maximize performance and minimize cost, while maintaining predictable and reliable behavior in the face of varying workloads, failures, and malicious threats. Papers are solicited from all areas of autonomic computing, including (but not limited to): * End-to-end techniques for management of resources, workloads, performance, faults, power/cooling, security, and others. * Self-managing components, such as server, storage, network protocols, or specific application elements, and embedded and mobile end systems such as smart phones. * Decision and analysis techniques and their use, such as machine learning, control theory, predictive methods, probability and stochastic processes, queuing theory methodologies, emergent behavior, rule-based systems, and bio-inspired techniques. * Monitoring systems for autonomic computing. * Hypervisor, operating systems, hardware, or application support for autonomic computing. * Novel human interfaces for monitoring and controlling autonomic systems. * Management topics, such as specification and modeling of service-level agreements, behavior enforcement and tie-in with IT governance. * Toolkits, frameworks, principles and architectures, from software engineering practices and experimental methodologies to agent-based techniques and virtualization. * Fundamental science and theory of self-managing systems: understanding, controlling or exploiting system behaviors to enforce autonomic properties. * Applications of autonomic computing and experiences with prototyped or deployed systems solving real-world problems in science, engineering, business and society. Papers will be judged on originality, significance, interest, correctness, clarity and relevance to the broader community. Papers should report on experiences, measurements, user studies, or other evaluations, as appropriate. Evaluations of a prototype or large-scale deployment of systems and applications is expected. PAPER AND POSTER SUBMISSIONS Full papers (a maximum of 10 pages in the two-column ACM proceedings format) and posters (2 pages) are invited on a wide variety of topics relating to autonomic computing. Submitted papers must be original work, and may not be under consideration for another conference or journal. Complete formatting and submission instructions can be found on the conference web site. Accepted papers and posters will appear in proceedings distributed at the conference and available electronically. Relevant top ICAC'12 papers will be invited for "fast-track" submissions to the ACM Transactions on Autonomous and Adaptive Systems (TAAS). WORKSHOPS, DEMONSTRATIONS AND EXHIBITION ICAC'12 welcomes proposals for co-located workshops on topics of interest to the autonomic computing community. Workshop proposals should be submitted to the Workshop Chair, Fred Douglis (f.douglis at computer.org) by February 10, 2012. Workshops are expected to publish proceedings, and should cover areas that complement the main program. ICAC'12 will also feature a demonstration and exhibition session consisting of prototypes and technology artifacts such as demonstrating autonomic software or autonomic computing principles. Entries will be judged by a separate committee led by the demo/exhibit chair. INDUSTRY SESSION One of ICAC's important roles is to bring together researchers and practitioners from academia and industry. In its industry session, ICAC helps fulfill this role by presenting an industry viewpoint on technologies, products, and market needs. The industry session also addresses current challenges, and opportunities for academic and corporate research collaborations. We encourage industry leaders, including entrepreneurs, product developers, architects, managers, marketers and end users, to submit their papers and posters reflecting such industry perspectives as part of the regular submission process. ------------------------------------------------------------------ ORGANIZERS GENERAL CHAIR Dejan Milojicic, HP Labs PROGRAM CHAIRS Dongyan Xu, Purdue University Vanish Talwar, HP Labs INDUSTRY CHAIR Xiaoyun Zhu, VMware WORKSHOPS CHAIR Fred Douglis, EMC POSTERS/DEMO/EXHIBITS CHAIR Eno Thereska, Microsoft Research FINANCE CHAIR Michael Kozuch, Intel LOCAL ARRANGEMENT CHAIR Jessica Blaine PUBLICITY CHAIRS Daniel Batista, University of S?o Paulo Vartan Padaryan, ISP/Russian Academy of Sci. Ioan Raicu, Illinois Inst. of Technology Jianfeng Zhan, ICT/Chinese Academy of Sci. Ming Zhao, Florida Intl. University PROGRAM COMMITTEE Tarek Abdelzaher, UIUC Umesh Bellur, IIT, Bombay Ken Birman, Cornell University Rajkumar Buyya, Univ. of Melbourne Rocky Chang, Hong Kong Polytechnic University Yuan Chen, HP Labs Alva Couch, Tufts University Peter Dinda, Northwestern University Fred Douglis, EMC Renato Figueiredo, University of Florida Mohamed Hefeeda, Qatar Computing Research Institute Joe Hellerstein, Google Geoff Jiang, NEC Labs Jeff Kephart, IBM Research Emre Kiciman, Microsoft Research Fabio Kon, University of S?o Paulo Michael Kozuch, Intel Dejan Milojicic, HP Labs Klara Nahrstedt, UIUC Priya Narasimhan, CMU Manish Parashar, Rutgers University Ioan Raicu, Illinois Inst. of Technology Omer Rana, Cardiff University Masoud Sadjadi, Florida Intl. University Rick Schlichting, AT&T Labs Hartmut Schmeck, KIT Karsten Schwan, Georgia Tech Onn Shehory, IBM Research Eno Thereska, Microsoft Research Xiaoyun Zhu, VMware -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= From iraicu at cs.iit.edu Thu Dec 22 17:39:59 2011 From: iraicu at cs.iit.edu (Ioan Raicu) Date: Thu, 22 Dec 2011 17:39:59 -0600 Subject: [Swift-user] CFP: 21st International Heterogeneity in Computing Workshop at IEEE IPDPS 2012 Message-ID: <4EF3BFCF.4000300@cs.iit.edu> *HCW 2012 Call for Papers * ** *The Twenty First International * In conjunction with IPDPS 2012 , May 21, 2012, Shanghai, China Sponsored by the IEEE Computer Society, through the Technical Committee on Parallel Processing (TCPP), and by the U.S. Office of Naval Research (ONR). * * * Today, most computing systems have elements of heterogeneity. Heterogeneity springs from the richness of environments where diversity and resource abundance prevail. Recognizing, capturing, and efficiently exploiting this diversity in an integrated and coherent manner are key goals of heterogeneous computing. Heterogeneous computing systems are those with a range of diverse computing resources that can be on a chip, within a computer, or on a local or geographically distributed network. The development of heterogeneous multi-core chips and the pervasive use of networks by all segments of society mean that the number and types of heterogeneous computing resources are growing rapidly. This growth creates the need and opportunity for new research to effectively utilize these resources in innovative and novel ways. For example, cluster computing, grid computing, peer-to-peer computing, and cloud computing all involve elements of heterogeneity. The effective implementation of efficient applications in these environments, however, requires that a host of issues be addressed that simply do not occur in homogeneous systems. Whereas many researchers and practitioners that use computers have a peripheral awareness of heterogeneity in their respective fields, few critically approach their fields from the heterogeneous perspective. This is not particularly surprising, because each field has its own unique challenges and imperatives that propel investigations in search of solutions to pressing problems. Addressing computing problems from the heterogeneous perspective offers at least three advantages: (i) the design and development of more advanced high-performance computing platforms, (ii) insight into new solution approaches, and (iii) exposure to new research opportunities and relationships among distinct research areas. HCW encourages the examination of both hardware and software systems from the perspective of heterogeneity. With the increasing number of components in heterogeneous parallel and distributed systems, failure is becoming a critical factor that impacts application performance. High-performance computing systems, especially those heading towards exascale, are also becoming increasingly heterogeneous and hierarchical, expecting to routinely employ heterogeneous multicores, various accelerators (such as GPUs), deeper memory hierarchies, and heterogeneous and hierarchical communication networks. This year, HCW is specifically encouraging (but not limited to) submissions that explore paradigms, algorithms, and techniques for high performance heterogeneous computing. *TOPICS* ------------------------------------------------------------------------ Areas or research interest include, but are not limited to, heterogeneity aspects of: * Parallel algorithms for heterogeneous and hierarchical systems, including manycores and hardware accelerators (FPGAs, GPUs, etc.) * Parallel algorithms for efficient problem solving on heterogeneous platforms * Performance models and their use in the design of parallel and distributed algorithms for heterogeneous platforms * Programming paradigms and tools * Fault tolerance * Resource allocation and scheduling * Computer architectures * Performance evaluation and management * High performance computing * Cluster, Grid and Cloud computing * Peer-to-peer computing * Ubiquitous computing * Application case studies * Task and communication scheduling * Task coordination and workflow *IMPORTANT DATES* ------------------------------------------------------------------------ Paper submission: January 7, 2012 Author Notification: February 7, 2012 Camera-ready: February 21, 2012 *PAPER SUBMISSIONS* ------------------------------------------------------------------------ Prospective authors are encouraged to visit the HCW 2012 website (hcw.wsu.edu) for instructions on how to submit their papers. *PUBLICATION* ------------------------------------------------------------------------ The HCW 2012 proceedings will be published through the IEEE Computer Society Press as part of the IPDPS CD-ROM. The authors of accepted papers will be also invited to submit extended versions of their work to a special issue of the Journal of Parallel and Distributed Computing (JPDC) on Heterogeneity in Parallel and Distributed Computing that is planned for publication in 2012. *WORKSHOP ORGANIZATION* ------------------------------------------------------------------------ *General Chair*: Behrooz Shirazi, Washington State University, U.S.A. *Program Chair*: Alexey Lastovetsky, University College Dublin, Ireland *Steering Committee*: H. J. Siegel, Colorado State University, U.S.A., Chair John Antonio, University of Oklahoma, U.S.A. Francine Berman, Rensselaer Polytechnic Institute, U.S.A. Jack Dongarra, University of Tennessee, U.S.A. Jerry Potter, Colorado State University, U.S.A. Viktor K. Prasanna, University of Southern California, U.S.A. Yves Robert, Ecole Normale Superieure de Lyon, France Arnold Rosenberg, Colorado State University, Northeastern University, U.S.A. Vaidy Sunderam, Emory University, U.S.A. *Program Committee*: Shoukat Ali, IBM, Ireland Francisco Almeida, University of La Laguna, Spain Rosa M. Badia, Barcelona Supercomputing Center and CSIC, Spain Ioana Banicescu, Mississippi State University, U.S.A. Olivier Beaumont, INRIA, France Shuvra S. Bhattacharyya, University of Maryland, U.S.A. George Bosilca, University of Tennessee, U.S.A. Eddy Caron, ENS-Lyon, France Domingo Gimenez, University of Murcia, Spain Alexey Kalinov, Cadence Design Systems, Russia Tahar Kechadi, University College Dublin, Ireland Jong-Kook Kim, Korea University, South Korea Thomas Ludwig, University of Hamburg, Germany Tony Maciejewski, Colorado State University, U.S.A. John P. Morrison, University College Cork, Ireland Dana Petcu, Western University of Timisoara, Romania Antonio Plaza, University of Extremadura, Spain Xiao Qin, Auburn University, U.S.A. Enrique Quintana, University of Jaume I of Castellon, Spain Ioan Raicu, Northwestern University, U.S.A. Alistair Rendell, Australian National University, Australia Gudula Runger, TU Chemnitz, Germany Vladimir Rychkov, University College Dublin, Ireland Uwe Schwiegelshohn, University of Dortmund, Germany Stephen L. Scott, Tennessee Tech University & Oak Ridge National Laboratory, U.S.A. James T. Smith, DigitalGlobe, U.S.A. Leonel Sousa, Technical University of Lisbon, Portugal Stanimire Tomov, University of Tennessee, U.S.A. Denis Trystram, IMAG, France Carlos Varela, Rensselaer Polytechnic Institute, U.S.A. Qin Zheng, A*Star, Singapore * -- ================================================================= Ioan Raicu, Ph.D. Assistant Professor, Illinois Institute of Technology (IIT) Guest Research Faculty, Argonne National Laboratory (ANL) ================================================================= Data-Intensive Distributed Systems Laboratory, CS/IIT Distributed Systems Laboratory, MCS/ANL ================================================================= Cel: 1-847-722-0876 Office: 1-312-567-5704 Email: iraicu at cs.iit.edu Web: http://www.cs.iit.edu/~iraicu/ Web: http://datasys.cs.iit.edu/ ================================================================= ================================================================= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: hcw_logo_small.gif Type: image/gif Size: 22276 bytes Desc: not available URL: From davidk at ci.uchicago.edu Fri Dec 23 23:13:18 2011 From: davidk at ci.uchicago.edu (David Kelly) Date: Fri, 23 Dec 2011 23:13:18 -0600 (CST) Subject: [Swift-user] HELP for using Swift~ In-Reply-To: <20111222083149.M49574@cse.nsysu.edu.tw> Message-ID: <900034283.45778.1324703598498.JavaMail.root@zimbra-mb2.anl.gov> Ian, Thank you for the email. I have modified the siteguide to reflect the issues you brought to our attention. The data.txt file has been added to the source repository and will be available in the 0.93 release. Please let us know if you see any other ways in which the documentation can be improved. Thank you! Regards, David ----- Original Message ----- > From: "Dr. Ying Chih Lin" > To: swift-user at ci.uchicago.edu > Sent: Thursday, December 22, 2011 2:32:50 AM > Subject: [Swift-user] HELP for using Swift~ > Dear Sir or Madam, > > I am a beginner on Swift, and writing to request information about > running Swift on > many sites. Here are two problems on the document siteguide.pdf: > 1). What is the PADS and Fusion in the siteguide.pdf? Could it be > possible to give > simple introductions or websites for these terminologies in guide > documents? > 2). Section 2.1.4 in the siteguide.pdf says that copying catsn.swift > and data.txt to > the current directory, but there is no data.txt in the swift-0.93RC5 > package. > > Thank you very much for your kindness. > > > Yours sincerely, > Ian > _______________________________________________ > Swift-user mailing list > Swift-user at ci.uchicago.edu > https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user