[Darshan-users] Hang on post-process

Jeffrey Layton laytonjb at gmail.com
Tue Jul 13 12:41:47 CDT 2021


Good afternoon,

Apologies for posting yet another problem :)  I'm trying to use Darshan on
a Tensorflow/Keras script. It's a simple model operating on the CIFAR-10
data set (fairly small). Darshan produces the output files but when I try
to post-process one using darshan-job-summary.pl, it hangs and I end up
having to kill the process (I waited about an hour - just to be sure).

I run the script using the following:

export DARSHAN_EXCLUDE_DIRS=/proc,/etc,/dev,/sys
env LD_PRELOAD=/home/laytonjb/bin/darshan-3.3.1/lib/libdarshan.so python3
cifar10-4-checkpoint.py

(I can provide the script if needed). It produces four files:

$ ls -s
total 72
 4 laytonjb_ptxas_id6210-6210_7-13-47480-2131301613401632697_1.darshan  60
laytonjb_python3_id6041-6041_7-13-47475-2131301613401632697_1.darshan
 4 laytonjb_ptxas_id6211-6211_7-13-47480-2131301613401632697_1.darshan   4
laytonjb_uname_id6056-6056_7-13-47475-2131301613401632697_1.darshan


I chose to post-process the "python3" output but this is where it hangs.
I'm attaching the darshan output file if that is of any help.

Thanks for any help.

Jeff
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/darshan-users/attachments/20210713/72c5a4a8/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: laytonjb_python3_id6041-6041_7-13-47475-2131301613401632697_1.darshan
Type: application/octet-stream
Size: 60979 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/darshan-users/attachments/20210713/72c5a4a8/attachment-0001.obj>


More information about the Darshan-users mailing list