<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none"><!--P{margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr" style="font-size:12pt;color:#000000;background-color:#FFFFFF;font-family:Calibri,Arial,Helvetica,sans-serif;">
<p>Hi,</p>
<p><br>
</p>
<p>Do you have any performance results of the libxsmm comparing with stand-alone "mxm" ?</p>
<p><br>
</p>
<p>In the master branch of Nek5000, the libxsmm function "libxsmm_dgemm" is only called in file "</p>
<p>mxm_wrapper.f"<br>
</p>
<p>...<br>
</p>
<p>#ifdef XSMM</p>
<p> call libxsmm_dgemm('N','N',n1,n3,n2,1.0,a,n1,b,n2,0.0,c,n1)<br>
#endif<br>
</p>
<p>...</p>
<p><br>
</p>
<p>I could not find any optimization works such as dispatch and streaming updates in the kernels that were used in e.g. NekBox.
</p>
<p><br>
</p>
<p>What is the status about the implementation of libxsmm or other SIMD techniques within Nek5000 right now?</p>
<p><br>
</p>
<p>Thanks. /Jing <br>
</p>
</body>
</html>