OpenFOAM9“加载共享库时出错”

OpenFOAM9 "error while loading shared libraries"

提问人:Tim Drake 提问时间:7/6/2023 更新时间:7/6/2023 访问量:66

问:

背景信息:

runAirfoil2D.sh

echo "Running simulation with $numberOfSubdomains processes."

executable=/opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam

mpirun --hostfile $machines -np $numberOfSubdomains --display-allocation $executable -parallel

终端错误消息:

mpiuser@master:$FOAM_RUN/tutorials/incompressible/simpleFoam$ ./runAirfoil2D.sh
Running simulation with 8 processes.

ALLOCATED NODES   ======================
master: flags=0x11 slots=4 max_slots=0 slots_inuse=0 state=UP
acl-n1: flags=0x13 slots=4 max_slots=0 slots_inuse=0 state=UNKNOWN
# =================================================================
# /opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam: error while loading shared libraries: libmomentumTransportModels.so: cannot open shared object file: No such file or directory
# /opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam: error while loading shared libraries: libmomentumTransportModels.so: cannot open shared object file: No such file or directory
# /opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam: error while loading shared libraries: libmomentumTransportModels.so: cannot open shared object file: No such file or directory
# /opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam: error while loading shared libraries: libmomentumTransportModels.so: cannot open shared object file: No such file or directory
# --------------------------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
# --------------------------------------------------------------------------
# --------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

Process name: [[434,1],4]
Exit code:    127
# --------------------------------------------------------------------------

我正在为我的教授整理一个集群。有 20 台计算机,每台计算机有 4 个内核,但我现在只在主节点和第一个节点上工作。

似乎每当我使用 4 个子域运行 simpleFoam 时,它都能正常运行,因为它只是在主计算机的 4 个内核上运行。但是当我尝试合并第一个节点(因此 8 个子域)时,它会抛出错误,说它找不到共享对象文件,即使它在那里:

mpiuser@master:$FOAM_RUN/tutorials/incompressible/simpleFoam$ ldd /opt/openfoam9/platforms/linux64GccDPInt32Opt/bin/simpleFoam
linux-vdso.so.1 (0x00007ffe7bd25000)
libmomentumTransportModels.so => /opt/openfoam9/platforms/linux64GccDPInt32Opt/lib/libmomentumTransportModels.so (0x00007f18b8600000)

and so on....

我有一个使用 openMPI 和 NFS 的 hello world 脚本,所以我知道我的连接和挂载是正确的。

我哪里出错了?

Ubuntu 集群计算 OpenMPI Openfoam

评论

0赞 fdireito 7/12/2023
在子节点中可以看到吗?

答: 暂无答案