[Darshan-users] Darshan error on Cray system with static compilation

Carns, Philip H. carns at mcs.anl.gov
Sat Jul 25 13:24:35 CDT 2020


Hi George,

We've ever seen that assertion triggered before as far as I know (it's just defensive programming, not something that is supposed to happen).  It indicates that Darshan observed inconsistent results out of a binary search tree; possibly brought on by a memory corruption of some sort?

Unfortunately I'm not sure what to suggest on this one; we might need more information or a reproducer.

The application might have an I/O workload that triggers a buggy code path in Darshan.  It's also plausible that there is a memory corruption outside of Darshan (in the application or another library) that is just impacting that Darshan data structure by chance.

-Phil

________________________________
From: Darshan-users <darshan-users-bounces at lists.mcs.anl.gov> on behalf of Markomanolis, George <markomanolig at ornl.gov>
Sent: Thursday, July 23, 2020 2:36 PM
To: darshan-users at lists.mcs.anl.gov <darshan-users at lists.mcs.anl.gov>
Subject: [Darshan-users] Darshan error on Cray system with static compilation


Hi,



I just send an error that we can’t reproduce, it happens sometimes and it is on a system that I don’t even have access but they informed me about this error:



fms_MOM6_SIS2_compile.x: lib/darshan-common.c:262: darshan_track_common_val_counters: Assertion `found == counter' failed. forrtl: error (76): Abort trap signal Image PC Routine Line Source



This is a Cray system with static compilation. This error kills the application. Do you have any idea or it is difficult with so minimal information?



Regards,

George


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/darshan-users/attachments/20200725/975718cf/attachment.html>


More information about the Darshan-users mailing list