Corrupted mesh and solution files when using MPI and NFS (Network File System) #229
Unanswered
TRPrasanna
asked this question in
Q&A
Replies: 1 comment 4 replies
-
I asked the horses developers at UPM and we don't have experience working with horses and NFS storage |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Has anyone experienced file corruption of the mesh and solution files when using more than 1 computing node on a computing cluster with a network file system (NFS) storage? This seems to happen to me when using approximately 6 or more computing nodes. The corrupted file (either the mesh file, or solution file, or both) invokes the error "Array found in file dimensions does not match that of the introduced variable. File dimensions:" in subroutine getSolutionFileArrayDimensions in Solver/src/libs/io/SolutionFile.f90. The bug seems to be very similar to what I faced on a different code some years ago. In short, the corrupted files have some zeros in them instead of having data, prompting the program (horses3d.ns while attempting to resume the simulation from the file or horses2plt when attempting to convert it to a post file) to crash at the aforementioned subroutine (on second thoughts, resuming from the corrupted file to different polynomial order may also result in "Error reading restart file: wrong dimension for element xx", in addition to above error). Is anyone facing this bug?
Beta Was this translation helpful? Give feedback.
All reactions