为什么对 python 模块的变量所做的修改不会传播到新的并行进程？-解网

问：

我有一个令人尴尬的平行任务列表，列出了我想要执行的任务。目前，我正在将这些任务的配置作为模块导入。

单行 configuration.py 示例：

result_folder = "aFolder"

到目前为止，我一直在串联而不是并行调用我的函数

def embarassing(x, conf):
    print(x)
    print(conf.result_folder)
    # ... do complicated things and return a value

if __name__ == "main":
    import configuration as conf
    x = 1
    y = embarassing(x, conf)

现在，我更新了代码，以利用并行运行这些任务。

from dask.distributed import Client
# ...
if __name__ == "main":
    import configuration as conf
    client = Client(n_workers=1)
    x = 1
    future = client.submit(embarassing, x, conf)
    y = future.result()

这一切都很好。问题是有时我想运行一组临时案例，直到现在我总是可以添加

import configuration as conf
x = 2
conf.result_folder = "newFold"

代码将打印出来

2
newFold

但在并行代码下，它会打印

2
aFolder

为什么我不能再将此模块作为参数传递？

python-multiprocessing python-module dask-distributed

为什么对 python 模块的变量所做的修改不会传播到新的并行进程？

Why aren't modifications made to a python module's variables propagating to new parallel processes?

评论