批量处理还是逐个图像处理？（恐龙V1）-解网

问：

我一直在尝试为个人项目重新创建 Dino V1 训练设置。为此，我从这个存储库中获取了大部分代码：https://github.com/facebookresearch/dino[dinov1 link]1

rn 我几乎完成了它，除了 main_dino.py 文件中的一部分之外，有一个名为 train_one_epoch 的函数，在第 318 行中他们给出了：

teacher_output= teacher (images[:2]) # only the 2 global views pass through the teacher

现在我知道 pytorch 张量索引/切片是如何工作的。因此，如果图像是结构的一批图像：

（批量大小、作物数量、C、H、W）

做图像[：2]如何让你获得给定批次中所有图像的全局裁剪？
他们在这里是批量处理图像，还是这里的“图像”列表只是一个列表，其中包含来自单个输入图像的多个裁剪？

机器学习深度学习 PyTorch 训练-数据自监督-学习

class MultiCropWrapper(nn.Module):
    """
    Perform forward pass separately on each resolution input.
    The inputs corresponding to a single resolution are clubbed and single
    forward is run on the same resolution inputs. Hence we do several
    forward passes = number of different resolutions used. We then
    concatenate all the output features and run the head forward on these
    concatenated features.
    """

因此，这个 MultiCropWrapper 类处理前向传递，并且还提到它为不同的分辨率执行多个前向传递。

上一个：tensorflow 1 中 Session.run（）函数中的“fetches”参数到底是什么？

下一个：在 Keras 中使用相同的数据集和模型进行训练和验证之间的准确性不一致

批量处理还是逐个图像处理？（恐龙V1）

Batch-wise processing or image-by-image processing? (DINO V1)

评论