[BUG]: Using cluster manager as "slurm" is not working #1110
Replies: 5 comments 6 replies
-
|
If you are using only one node, you don't need to use the cluster manager, it should work with normal settings. The error is weird but might be from conflicting Julia installations somewhere in your system |
Beta Was this translation helpful? Give feedback.
-
|
Thank you for the response! This is a test case where I am running only on one node. But I intend to use multiple nodes. Regarding the error, it is indeed weird; maybe you might be right regarding the multiple Julia installations. I will try to fix that and see if it will work. |
Beta Was this translation helpful? Give feedback.
-
|
Oh wait. I think I might know what it is and it could be a real bug. I just noticed you are using a template expression. Can you share your template expression definition? I think it could be that the template expression function is not getting defined on the worker nodes. I've never actually tried using template expressions in a distributed setup before. |
Beta Was this translation helpful? Give feedback.
-
|
@anirudh-nemmani you can ask for a specific Julia version like this: import juliapkg
juliapkg.require_julia("~1.11")
# Make sure to import pysr only afterwards
import pysr |
Beta Was this translation helpful? Give feedback.
-
|
@anirudh-nemmani can you please try installing the master branch version of pysr? I fixed some slurm issues (and now actually have testing!!) in #1111. I'm curious if it fixes the issues you saw. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What happened?
I started running PySR in a cluster with the following configuration.
And when I submit it in Slurm with the following Slurm settings
When I run with this configuration, I am noticing that in each worker, when Julia is initialised and the code is running, it gives an error during compilation.
I attached the entire logs also
Version
1.5.9
Operating System
Linux
Package Manager
Conda
Interface
Script (i.e.,
python3 script.py)Relevant log output
Extra Info
No response
Beta Was this translation helpful? Give feedback.
All reactions