如何从 Pandas 上的嵌套列表列中获取最小值？为什么numpy.min（）在numpy.mean（）工作的情况下不起作用？-解网

问：

我有一小段代码需要修改，我没有找到为什么 np.mean（）在 pandas 列由嵌套列表组成的特定情况下 np.min（）不起作用。也许这里有人可以澄清一下？

这里的代码片段非常有效：

import pandas as pd
import numpy as np


def transformation(custom_df):
    dic = dict(zip(custom_df['customers'], custom_df['values']))
    custom_df['values'] = np.where(custom_df['values'].isna() & (custom_df['valid_neighbors'] >= 1),
                                   custom_df['neighbors'].apply(
                                       lambda row: np.mean([dic[v] for v in row if dic.get(v)])),
                                   custom_df['values'])
    return custom_df


customers = [1, 2, 3, 4, 5, 6]
values = [np.nan, np.nan, 10, np.nan, 11, 12]
neighbors = [[6], [3], [], [3, 5], [6], [5]]
vn = [1, 1, 0, 2, 1, 1]
df2 = pd.DataFrame({'customers': customers, 'values': values, 'neighbors': neighbors, 'valid_neighbors': vn})


   customers  values neighbors  valid_neighbors
0          1     NaN       [6]                1
1          2     NaN       [3]                1
2          3    10.0        []                0
3          4     NaN    [3, 5]                2
4          5    11.0       [6]                1
5          6    12.0       [5]                1

df2 = transformation(df2)

结果：

   customers  values neighbors  valid_neighbors
0          1    12.0       [6]                1
1          2    10.0       [3]                1
2          3    10.0        []                0
3          4    10.5    [3, 5]                2
4          5    11.0       [6]                1
5          6    12.0       [5]                1

但是，如果我在“transformation（）”函数上将 np.mean（）更改为 np.min（），它将返回一个 ValueError，让我想知道为什么当我调用 np.mean（）函数时它不会发生：

ValueError: zero-size array to reduction operation minimum which has no identity

我想知道我没有满足哪些条件，以及我能做些什么来获得预期的结果，这将是：

   customers  values neighbors  valid_neighbors
0          1    12.0       [6]                1
1          2    10.0       [3]                1
2          3    10.0        []                0
3          4    10.0    [3, 5]                2
4          5    11.0       [6]                1
5          6    12.0       [5]                1

pandas numpy numpy-ndarray 嵌套列表

def transformation(custom_df):
    dic = dict(zip(custom_df['customers'], custom_df['values']))
    custom_df['values'] = np.where(custom_df['values'].isna() & (custom_df['valid_neighbors'] >= 1),
                                   custom_df['neighbors'].apply(
                                       lambda row: np.min([dic[v] for v in row if dic.get(v)]) if len(row) else 0),
                                   custom_df['values'])
    return custom_df

上一个：如何从数据框列表中删除行

下一个：如何通过将列的字符串列表与python组合来创建列表列表？

如何从 Pandas 上的嵌套列表列中获取最小值？为什么numpy.min（）在numpy.mean（）工作的情况下不起作用？

How to get the minimum value from a nested-list-column on Pandas? Why numpy.min() doesn't work in the situation that numpy.mean() works?

评论

评论