<html><head><style type='text/css'>p { margin: 0; }</style></head><body><div style='font-family: Times New Roman; font-size: 12pt; color: #000000'><span><br>Cookie is back up.<br><span name="x"></span><br>Daniel Lowell<br>(630)252-0092<br><br><span name="x"></span><br></span><br><hr id="zwchr"><b>From: </b>"Daniel Lowell" <redratio1@gmail.com><br><b>To: </b>"Kenneth J. Raffenetti , Boyana Norris , Sri Hari Krishna , Sa-Lin Cheng Bernstein , Van Bui" <raffenet@mcs.anl.gov><br><b>Sent: </b>Thursday, November 10, 2011 11:50:59 AM<br><b>Subject: </b>[Cookie-users] GPU zombie and reboot on cookie<br><br><span class="Apple-style-span" style="border-collapse: separate; font-family: Monaco; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; font-size: medium;"><div><div style="font-family: 'Times New Roman'; font-size: 12pt; color: rgb(0, 0, 0);"><span>There is a zombie process on cookie's GPU. You can take a look with nvidia-smi -a<br>Utilization is 99%, meaning no work can be done on the GPU until it is free. Which will be never until cookie is rebooted. nvidia-smi for sdk 4.0 does not have the capability of resetting the GPU on the fly, however the sdk 4.1 version will have it. Until then...<br><br>Not sure when is a good time for a reboot. I know Krishna has a pbound job running, so I'd like to get a consensus of cookie users.<br><br>Let me know. Thanks.<br><span></span><br>Daniel Lowell<br>(630)252-0092<br><br><span></span><br></span><br></div></div></span><br>_______________________________________________<br>Cookie-users mailing list<br>Cookie-users@lists.mcs.anl.gov<br>https://lists.mcs.anl.gov/mailman/listinfo/cookie-users<br></div></body></html>