[Swift-devel] ssh-cl how to tell coaster bootstrap to run with limited java heap space
Mihael Hategan
hategan at mcs.anl.gov
Thu Sep 19 15:48:30 CDT 2013
Are the login nodes 64 bit by any chance?
On Thu, 2013-09-19 at 15:43 -0500, Ketan Maheshwari wrote:
> SDSC Gordon admins have limited java heap space to 256 on login nodes.
>
> This is enabled via the following environment variable:
>
> JAVA_TOOL_OPTIONS=-Xmx256m
>
> It seems coaster bootstrap does not like this:
>
> mdw$ swift -sites.file sites.gordon.xml -tc.file apps -config cf
> workflow.swift
> Swift trunk swift-r7089 cog-r3775
> RunID: 20130919-2038-jef0ns83
> Progress: time: Thu, 19 Sep 2013 20:38:42 +0000
> Progress: time: Thu, 19 Sep 2013 20:38:43 +0000 Submitting:2
>
> Execution failed:
> Exception in matrixgen:
> Arguments: [2544, 3300, mA.dat]
> Host: gordon
> Directory: workflow-20130919-2038-jef0ns83/jobs/a/matrixgen-a1rnkhfl
> exception @ swift-int-staging.k, line: 162
> Caused by: null
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Could
> not submit job
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Could
> not start coaster service
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Task
> ended before registration was received.
>
> Picked up JAVA_TOOL_OPTIONS: -Xmx256m
> /bin/bash: line 54: 33675 Aborted /usr/java/latest/bin/java
> -Djava=/usr/java/latest/bin/java -DGLOBUS_TCP_PORT_RANGE=50000,51000
> -DX509_USER_PROXY=/home/ketan/.globus/sshproxy-316831905-1379663604
> -DX509_CERT_DIR=/home/ketan/.globus/sshCAcert-316831905-1379663604.pem
> -DGLOBUS_HOSTNAME=gordon.sdsc.xsede.org -Duser.home=/home/ketan -jar
> /tmp/bootstrap.RWIqFu http://swift.rcc.uchicago.edu:50001
> https://128.135.112.73:50000 11836079986
>
>
> Do I understand right that this is indeed the java heap space issue? or is
> it something else that I could work around with? Thanks for any ideas.
>
> SDSC Gordon admins have limited java heap space to 256 on login nodes.
>
>
> This is enabled via the following environment variable:
>
> JAVA_TOOL_OPTIONS=-Xmx256m
>
>
> It seems coaster bootstrap does not like this:
>
> mdw$ swift -sites.file sites.gordon.xml -tc.file apps -config cf
> workflow.swift
> Swift trunk swift-r7089 cog-r3775
> RunID: 20130919-2038-jef0ns83
> Progress: time: Thu, 19 Sep 2013 20:38:42 +0000
> Progress: time: Thu, 19 Sep 2013 20:38:43 +0000 Submitting:2
>
> Execution failed:
> Exception in matrixgen:
> Arguments: [2544, 3300, mA.dat]
> Host: gordon
> Directory:
> workflow-20130919-2038-jef0ns83/jobs/a/matrixgen-a1rnkhfl
> exception @ swift-int-staging.k, line: 162
> Caused by: null
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
> Could not submit job
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
> Could not start coaster service
> Caused by:
> org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
> Task ended before registration was received.
>
> Picked up JAVA_TOOL_OPTIONS: -Xmx256m
> /bin/bash: line 54: 33675
> Aborted /usr/java/latest/bin/java
> -Djava=/usr/java/latest/bin/java -DGLOBUS_TCP_PORT_RANGE=50000,51000
> -DX509_USER_PROXY=/home/ketan/.globus/sshproxy-316831905-1379663604
> -DX509_CERT_DIR=/home/ketan/.globus/sshCAcert-316831905-1379663604.pem
> -DGLOBUS_HOSTNAME=gordon.sdsc.xsede.org -Duser.home=/home/ketan
> -jar /tmp/bootstrap.RWIqFu http://swift.rcc.uchicago.edu:50001
> https://128.135.112.73:50000 11836079986
>
>
>
>
> Do I understand right that this is indeed the java heap space issue?
> or is it something else that I could work around with? Thanks for any
> ideas.
>
>
> --
> Ketan
>
>
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
More information about the Swift-devel
mailing list