[Nek5000-users] Problem on cluster

nek5000-users at lists.mcs.anl.gov nek5000-users at lists.mcs.anl.gov
Mon Apr 16 18:12:51 CDT 2018


Looks like your are using an outdated/old MPI implementation. Can you try again with a more recent version?

On 16 Apr 2018, at 17:46, "nek5000-users at lists.mcs.anl.gov <mailto:nek5000-users at lists.mcs.anl.gov> " <nek5000-users at lists.mcs.anl.gov <mailto:nek5000-users at lists.mcs.anl.gov> > wrote:

Dear Nek users, I'm having a problem when I use the latest version of Nek on a HPC cluster. I can compile, but when I run my simulations they finish. The logfile of a generic case is like the following:


===========================================================


/----------------------------------------------------------\\
|      _   __ ______ __ __  ______  ____   ____   ____     |
|     / | / // ____// //_/ / ____/ / __ \\/ __ \\/ __ \\   |
|    /  |/ // __/  / ,<   /___ \\ / / / // / / // / / /    |
|   / /|  // /___ / /| | ____/ / / /_/ // /_/ // /_/ /     |
|  /_/ |_//_____//_/ |_|/_____/  \\___/ \\___/ \\___/      |
|                                                          |
|----------------------------------------------------------|
|                                                          |
| NEK5000:  Open Source Spectral Element Solver            |
| COPYRIGHT (c) 2008-2017 UCHICAGO ARGONNE, LLC            |
| Version:  17.0-rc1                                       |
| Web:      http://nek5000.mcs.anl.gov <http://nek5000.mcs.anl.gov>                      |
|                                                          |
\\----------------------------------------------------------/
                                                              

 Number of processors:          80
 REAL    wdsize      :           8
 INTEGER wdsize      :           4
 Timer accuracy      : 0.00E+00
   Reading /home/jrobinson/casos/Placa_6/Placa_6.rea                                                                                           
 Reading /home/jrobinson/casos/Placa_6/Placa_6.re2                                                                                           
 mapping elements to processors
 Reading /home/jrobinson/casos/Placa_6/Placa_6.ma2                                                                                           
 RANK     0 IEG    1754    1755    1756    1757    1758    1759    1760    1774
                   1775    1776    1777    1778    1779    1780    1794    1795
                   1796    1797    1798    1799    1800    1814    1815    1816
                   1817    1818    1819    1820    1834    1835    1836    1837
                   1838    1839    1840    1853    1854    1855    1856    1857
                   1858    1859    1860    1873    1874    1875    1876    1877
                   1878    1879    1880    1893    1894    1895    1896    1897
                   1898    1899    1913    1914    1915    1916    1917    1918
                   1919    1933    1934    1935    1936    1937    1938    1939
                   1953    1954    1955    1956    1957    1958    1974    1975
                   1976    1977    1978    1994    1995    1996    1997    1998
                   1999    2000    2014    2015    2016    2017    2018    2019
                   2020    9783    9784    9785    9786    9787    9788    9789
                   9790    9791    9792    9793    9794    9795    9796    9797
                   9798    9799    9800    9801    9802    9803    9804    9805
                   9806    9807    9808    9809    9810    9811    9812    9813
                   9814    9815    9816    9817    9818    9819    9820    9821
                   9822    9823    9824    9825    9826    9827    9828    9829
                   9830    9849    9855    9856    9861    9862
   element load imbalance:            1         150         151
 done :: mapping   0.32155     sec

    preading mesh 

=============================================================




So the last line is "preading mesh".

This doesn't give too much information, but the cluster generates a file with the following errors (at the end of this text).

When I use an old version of Nek on this same cluster, I have no problem running my cases. The problem is that I need to use the latest version because I'm using exo2nek routine for my meshes generated with Trelis (Cubit).


Any idea of what could I do?

Thank you all.

Juan Pablo.






=========================================================


This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 4A,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 84, length 10800
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 11,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 32484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 3
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 3)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 11,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 86484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 8
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 8)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 14,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 248484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 23
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 23)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 108084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 10
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 10)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 10,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 259284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 24
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 24)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 118884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 11
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 11)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 14,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 140484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 13
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 13)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 10,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 280884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 26
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 26)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 151284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 14
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 14)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 14,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 291684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 27
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 27)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 162084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 15
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 15)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 10,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 302484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 28
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 28)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 172884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 16
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 16)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 10,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 324084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 30
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 30)
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 334884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 31
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 31)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 21684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 2
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 2)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 345684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 32
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 32)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 43284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 4
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 4)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 356484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 33
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 33)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 11,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 54084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 5
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 5)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 367284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 34
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 34)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 64884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 6
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 6)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 378084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 35
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 35)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 75684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 7
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 7)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 388884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 36
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 36)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 97284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 9
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 9)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 12,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 421284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 39
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 39)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 129684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 12
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 12)
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 216084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 20
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 20)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 183684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 17
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 17)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 226884, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 21
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 21)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd F,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 194484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 18
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 18)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 10,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 237684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 22
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 22)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 205284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 19
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 19)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 313284, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 29
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 29)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 11,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 10884, length 10800
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 399684, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 37
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 37)
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 1
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 1)
This requires fcntl(2) to be implemented. As of 8/25/2011 it is not. Generic MPICH Message: File locking failed in ADIOI_Set_lock(fd 13,cmd F_SETLKW/7,type F_RDLCK/0,whence 0) with return value FFFFFFFF and errno 26.
- If the file system is NFS, you need to use NFS version 3, ensure that the lockd daemon is running on all the machines, and mount the directory with the 'noac' option (no attribute caching).
- If the file system is LUSTRE, ensure that the directory is mounted with the 'flock' option.
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 410484, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 38
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 38)
slurmstepd: error: *** STEP 11073373.0 ON leftraru1 CANCELLED AT 2018-04-16T18:38:42 ***
ADIOI_Set_lock:: Function not implemented
ADIOI_Set_lock:offset 270084, length 10800
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 25
In: PMI_Abort(1, application called MPI_Abort(MPI_COMM_WORLD, 1) - process 25)
srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
srun: error: leftraru1: tasks 0-19: Killed
srun: Terminating job step 11073373.0
srun: error: leftraru4: tasks 60-79: Killed
srun: error: leftraru3: tasks 40-59: Killed
srun: error: leftraru2: tasks 20-39: Killed






_______________________________________________

Nek5000-users mailing list

Nek5000-users at lists.mcs.anl.gov <mailto:Nek5000-users at lists.mcs.anl.gov> 

https://lists.mcs.anl.gov/mailman/listinfo/nek5000-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/nek5000-users/attachments/20180417/9a1e3b7a/attachment-0001.html>


More information about the Nek5000-users mailing list