<div dir="ltr"><div>Hello,</div><div>Here is the log I generated, so total_STDIO_F_META_TIME is 67 seconds, runtime is (end-start+1)=3 seconds (as computed in <a href="http://darshan-job-summary.pl">darshan-job-summary.pl</a>), and nprocs is 12</div><div>So that would mean I have 67 / 36 > 1 Metadata time percentage, which seems weird.</div><div>Is the log file wrong, or maybe the application I want to analyze I/O has a strange behavior ?<br></div><div><br></div><div>Thank you for your help.</div><div>Cordially, Florian</div><div><br></div><div><br></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Le mer. 4 août 2021 à 16:26, Harms, Kevin <<a href="mailto:harms@alcf.anl.gov">harms@alcf.anl.gov</a>> a écrit :<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Florian,<br>
<br>
that sounds like an issue. If you can provide the log, we can take a closer look at the counters.<br>
<br>
kevin<br>
<br>
________________________________________<br>
From: Florian Lecomte <<a href="mailto:flo.lecomte17@gmail.com" target="_blank">flo.lecomte17@gmail.com</a>><br>
Sent: Wednesday, August 4, 2021 3:14 AM<br>
To: Harms, Kevin<br>
Cc: <a href="mailto:darshan-users@lists.mcs.anl.gov" target="_blank">darshan-users@lists.mcs.anl.gov</a><br>
Subject: Re: [Darshan-users] Does Darshan uses CPU time ?<br>
<br>
In fact, what I wanted to know is : how does Darshan get the read times, write times and metadata times ? I suppose it uses CPU clocks, but I use several nodes, with 2 processes on each node, and each node has 64 cores, So I wanted to know if the average time per process was already divided by the number of cores too, because cores work in parallel, and so this time couldn't be compared with real runtime.<br>
I can send you the log file later today but basically, I have 36 secs of metadata operations, 3 seconds of runtime and 10 processes.<br>
<br>
Thank you.<br>
Cordially, Florian<br>
<br>
Le mar. 3 août 2021 à 17:20, Harms, Kevin <<a href="mailto:harms@alcf.anl.gov" target="_blank">harms@alcf.anl.gov</a><mailto:<a href="mailto:harms@alcf.anl.gov" target="_blank">harms@alcf.anl.gov</a>>> a écrit :<br>
Florian,<br>
<br>
STDIO_F_META_TIME should be the time spent in metadata accumulated across the processes if it is shared. The rank for the file would be -1 if it is shared.<br>
<br>
So for example:<br>
4 processes run for 10 seconds<br>
each process opens foo.txt for 1 second<br>
1 process stats foo.txt for 1 second<br>
<br>
the results would be foo.txt shows 5 seconds of meta time for foo.txt. So you can do 5 seconds / 4 processes for an average of 1.25s or you could do 5s / (10s * 4p) = 0.125 or 12.5%.<br>
<br>
If the rank value is > -1, the the reported time is just for that one rank and you need to potentially sum them up.<br>
<br>
If you meta time for a single file exceeds the runtime * number of processes, then something must be wrong with the timer collection. Can you send the log file? You can also look at the _START_TIMESTAMP and _END_TIMESTAMP and see if those agree with runtime or not?<br>
<br>
kevin<br>
<br>
<br>
<br>
________________________________________<br>
From: Darshan-users <<a href="mailto:darshan-users-bounces@lists.mcs.anl.gov" target="_blank">darshan-users-bounces@lists.mcs.anl.gov</a><mailto:<a href="mailto:darshan-users-bounces@lists.mcs.anl.gov" target="_blank">darshan-users-bounces@lists.mcs.anl.gov</a>>> on behalf of Florian Lecomte <<a href="mailto:flo.lecomte17@gmail.com" target="_blank">flo.lecomte17@gmail.com</a><mailto:<a href="mailto:flo.lecomte17@gmail.com" target="_blank">flo.lecomte17@gmail.com</a>>><br>
Sent: Tuesday, August 3, 2021 9:37 AM<br>
To: <a href="mailto:darshan-users@lists.mcs.anl.gov" target="_blank">darshan-users@lists.mcs.anl.gov</a><mailto:<a href="mailto:darshan-users@lists.mcs.anl.gov" target="_blank">darshan-users@lists.mcs.anl.gov</a>><br>
Subject: [Darshan-users] Does Darshan uses CPU time ?<br>
<br>
Good evening,<br>
I'd like to know if I have to divide metrics by the number of CPUs of the machine I use if I want to know the percentage of time spent on write operations for example.<br>
Because when I divide for example STDIO_F_META_TIME by [real runtime (time spent in "real world") * number of processes], I often get something bigger than 1, which is not supposed to happen.<br>
So to sum it up : Does Darshan give the average metric value per process per CPU, or only per process, and so it can not be compared with real spent time ?<br>
<br>
Thank you very much.<br>
Cordially, Florian, student in the HPC field.<br>
</blockquote></div></div>