Numpy 可变切片大小（可能为零）-解网

问：

假设我有一些时间序列数据：

import numpy as np
import matplotlib.pyplot as plt
np.random.seed(42)
x = np.linspace(0, 10, num=100)
time_series = np.sin(x) + np.random.random(100)
plt.plot(x, time_series)

如果我想将时间序列“延迟”一些时间序列，我可以这样做：

delay = 10
x_delayed = x[delay:]
time_series_delayed = time_series[:-delay]

plt.plot(x, time_series, label='original')
plt.plot(x_delayed, time_series_delayed, label='delayed')
plt.legend()

这一切都很好，但我想保持代码干净，同时仍然允许为零。就目前而言，我收到一个错误，因为切片只是计算出始终是空切片，而不是完整的数组。delaymy_arr[:-0]my_arr[:0]

>>> time_series[:-0]
array([], dtype=float64)

这意味着，如果我想对零延迟与原始数组相同的想法进行编码，则每次使用切片时都必须进行特殊情况处理。这很乏味且容易出错：

# Make 3 plots, for negative, zero, and positive delays
for delay in (0, 5, -5):

    if delay > 0:
        x_delayed = x[delay:]
        time_series_delayed = time_series[:-delay]

    elif delay < 0:
        # Negative delay is the complement of positive delay
        x_delayed = x[:delay]
        time_series_delayed = time_series[-delay:]

    else:
        # Zero delay just copies the array
        x_delayed = x[:]
        time_series_delayed = time_series[:]
    # Add the delayed time series to the plot
    plt.plot(
        x_delayed, 
        time_series_delayed, 
        label=f'delay={delay}',
        # change the alpha to make things less cluttered
        alpha=1 if delay == 0 else 0.3
    )
plt.legend()

我看过麻木的切片对象和np._s，但我似乎无法弄清楚。

有没有一种简洁/pythonic 的方法来编码零延迟是原始数组的想法？

python numpy 索引切片

啊，我明白了 - 我想不出一种方法来实现这一目标，而没有基于标志的声明和单独的行为。这里有一些食谱可以同时使用这两个符号 - stackoverflow.com/questions/30399534/... - 但是（a）通常它们在引擎盖下有一个声明，（b）它们不是减少数组的长度，而是将 NaN 抛在后面。（当然，删除这些 NaN 很容易。ifif

0赞 beyarkay 5/15/2023 #2

我采用的解决方案使用的事实相当于：my_arr[2:]my_arr[2:None]

arr[(d if d > 0 else None):(d if d < 0 else None)]

更具可读性：

arr = [0, 1, 2, 3, 4, 5]
delay = 3

start_delay = delay if delay > 0 else None
finish_delay = delay if delay < 0 else None

delayed_arr = arr[start_delay:finish_delay]

用一个很好的方法包装起来，并用一些断言来证明它有效：

def delay_array(array, delay):
    """Delays the values in `array` by the amount `delay`.

    Regular slicing struggles with this since negative slicing (which goes from
    the end of the array) and positive slicing (going from the front of the
    array) meet at zero and don't play nicely.

    We use the fact that Python's slicing syntax treats `None` as though it
    didn't exist, so `arr[2:]` is equivalent to `arr[2:None]`.

    This can be used on numpy arrays, but also works on native python lists.
    """
    start_index = delay if delay > 0 else None
    finish_index = delay if delay < 0 else None
    return array[start_index:finish_index]

arr = [0, 1, 2, 3, 4, 5]
# Zero delay results in the same array
assert delay_array(arr,  0) == [0, 1, 2, 3, 4, 5]

# Delay greater/less than zero removes `delay` elements from the front/back
# of the array
assert delay_array(arr, +3) == [         3, 4, 5]
assert delay_array(arr, -3) == [0, 1, 2,        ]

# A delay longer than the array results in an empty array
assert delay_array(arr, +6) == []
assert delay_array(arr, -6) == []

总而言之：

def delay_array(array, delay):
    start_index = delay if delay > 0 else None
    finish_index = delay if delay < 0 else None
    return array[start_index:finish_index]

np.random.seed(42)
x = np.linspace(0, 10, num=100)
time_series = np.sin(x) + np.random.random(100)

for delay in (0, 5, -5):
    x_delayed = delay_array(x, delay)
    time_series_delayed = delay_array(time_series, -delay)
    plt.plot(
        x_delayed, 
        time_series_delayed, 
        label=f'delay={delay}',
        alpha=1 if delay == 0 else 0.3
    )
plt.legend()

上一个：你能用开始和停止规则切片一个numpy 2D数组，而不是在索引处，而是在大于1的值上吗？

下一个：如何在 numpy 中使用高级索引从数组中复制/切片特定部分？

Numpy 可变切片大小（可能为零）

Numpy variable slice size (possibly zero)

评论

评论