<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">Hi Ed,<br>
<br>
I was thrown off by the extra column from nl at first, but you
aren't supposed to get two records with identical ranks (-1 in
this case, meaning the stats are shared by all ranks) and file
name hashes (14116...). As you observed in your test program,
they should have merged into one record.<br>
<br>
For completeness, it is definitely legal to see the same file show
up twice for *different* ranks. For example, if MPI ranks 0 and 5
both opened atomselect.tcl , but no one else did, then you would
see two entries. One for rank 0 and one for rank 5. Darshan only
collapses shared files into an aggregate record when everyone
opened the same file. If a subset of ranks opened it then they
remain separate records.<br>
<br>
In summary it looks like that log is broken for some reason, but I
would suggest logging that it happened if you are loading a lot of
logs into a data base at once to make sure it's not a widespread
problem.<br>
<br>
thanks,<br>
-Phil<br>
<br>
On 03/27/2018 12:58 PM, Ed Karrels wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CA+xfkFtc8EqNxCLwOQTSV+dkTu-BzSpFjjaRk9OGsnTH31cycQ@mail.gmail.com">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<div dir="ltr">
<div>
<div>Is it normal for a file to be listed more than once in
darshan output for the same process rank?<br>
<br>
</div>
I've got a logfile (generated by runtime
version 2.3.0) for which darshan-parser (version 2.3.1)
outputs stats for a few of the accessed files twice. <br>
<font size="1"><span style="font-family:monospace,monospace"><br>
$ darshan-parser
stonej_vmd_BLUEWATERS_id1340701_1-19-39124-18216319424980466532_1.darshan.gz
| nl | egrep '41554.CP_F_(OPEN|CLOSE)_TIMESTAMP'<br>
388 -1 14116071308113141554
CP_F_OPEN_TIMESTAMP 0.314585
.../atomselect.tcl /mnt/a lustre<br>
391 -1 14116071308113141554
CP_F_CLOSE_TIMESTAMP 0.363727
.../atomselect.tcl /mnt/a lustre<br>
30034 -1 14116071308113141554
CP_F_OPEN_TIMESTAMP 0.398742
.../atomselect.tcl /mnt/a lustre<br>
30037 -1 14116071308113141554
CP_F_CLOSE_TIMESTAMP 0.399672
.../atomselect.tcl /mnt/a lustre<br>
</span></font><br>
</div>
<div>
<div>It looks like the file was opened and closed twice, but
when I tried this in a test program with darshan runtime and
util version 2.3.1, the accesses were combined into one set
of records in Darshan. I'm wondering if this is normal and I
should deal with it when loading darshan data into a
database, or if this file is flawed and I can ignore it.<br>
<br>
</div>
<div>thanks,<br>
</div>
<div>Ed<br>
<br>
</div>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Darshan-users mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Darshan-users@lists.mcs.anl.gov">Darshan-users@lists.mcs.anl.gov</a>
<a class="moz-txt-link-freetext" href="https://lists.mcs.anl.gov/mailman/listinfo/darshan-users">https://lists.mcs.anl.gov/mailman/listinfo/darshan-users</a>
</pre>
</blockquote>
<p><br>
</p>
</body>
</html>