[Darshan-users] Hang on post-process
Jeffrey Layton
laytonjb at gmail.com
Tue Jul 13 12:41:47 CDT 2021
Good afternoon,
Apologies for posting yet another problem :) I'm trying to use Darshan on
a Tensorflow/Keras script. It's a simple model operating on the CIFAR-10
data set (fairly small). Darshan produces the output files but when I try
to post-process one using darshan-job-summary.pl, it hangs and I end up
having to kill the process (I waited about an hour - just to be sure).
I run the script using the following:
export DARSHAN_EXCLUDE_DIRS=/proc,/etc,/dev,/sys
env LD_PRELOAD=/home/laytonjb/bin/darshan-3.3.1/lib/libdarshan.so python3
cifar10-4-checkpoint.py
(I can provide the script if needed). It produces four files:
$ ls -s
total 72
4 laytonjb_ptxas_id6210-6210_7-13-47480-2131301613401632697_1.darshan 60
laytonjb_python3_id6041-6041_7-13-47475-2131301613401632697_1.darshan
4 laytonjb_ptxas_id6211-6211_7-13-47480-2131301613401632697_1.darshan 4
laytonjb_uname_id6056-6056_7-13-47475-2131301613401632697_1.darshan
I chose to post-process the "python3" output but this is where it hangs.
I'm attaching the darshan output file if that is of any help.
Thanks for any help.
Jeff
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/darshan-users/attachments/20210713/72c5a4a8/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: laytonjb_python3_id6041-6041_7-13-47475-2131301613401632697_1.darshan
Type: application/octet-stream
Size: 60979 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/darshan-users/attachments/20210713/72c5a4a8/attachment-0001.obj>
More information about the Darshan-users
mailing list