如何在 pandas 中的特定列索引处插入列？-解网

问：

我可以在 pandas 中的特定列索引处插入列吗？

import pandas as pd
df = pd.DataFrame({'l':['a','b','c','d'], 'v':[1,2,1,2]})
df['n'] = 0

这会把列作为的最后一列，但是有没有办法告诉放在开头？ndfdfn

Python 索引 pandas

0赞 cs95 2/6/2019

在 DataFrame 的开头（最左端）插入一列 - 更多解决方案 + 用于插入任何序列（而不仅仅是常量值）的通用解决方案。

答：

17赞 Nic 9/7/2013 #1

您可以尝试将列提取为列表，根据需要对其进行按摩，然后重新索引您的数据帧：

>>> cols = df.columns.tolist()
>>> cols = [cols[-1]]+cols[:-1] # or whatever change you need
>>> df.reindex(columns=cols)

   n  l  v
0  0  a  1
1  0  b  2
2  0  c  1
3  0  d  2

编辑：这可以在一行中完成;但是，这看起来有点丑陋。也许会有一些更干净的提议......

>>> df.reindex(columns=['n']+df.columns[:-1].tolist())

   n  l  v
0  0  a  1
1  0  b  2
2  0  c  1
3  0  d  2

696赞 Jeff 9/7/2013 #2

请参阅文档：http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.insert.html

使用 loc = 0 将在开头插入

df.insert(loc, column, value)

df = pd.DataFrame({'B': [1, 2, 3], 'C': [4, 5, 6]})

df
Out: 
   B  C
0  1  4
1  2  5
2  3  6

idx = 0
new_col = [7, 8, 9]  # can be a list, a Series, an array or a scalar   
df.insert(loc=idx, column='A', value=new_col)

df
Out: 
   A  B  C
0  7  1  4
1  8  2  5
2  9  3  6

33赞 Peter Maguire 1/25/2017

对于将来的用户，新参数是“loc”、“column”和“value”。源

1赞 mLstudent33 10/18/2020

打印后，我数了数并重新计算了值的长度和索引的长度，但一直得到ValueError: Length of values does not match length of index

11赞 Sulphur 5/10/2021

对于将来的用户，如果要借助特定列名而不是索引进行插入，请使用：。不直接支持列名用例，但您可以从列名中获取列索引并传递该索引。df.insert(df.columns.get_loc('col_name'), 'new_col_name', ser_to_insert)insert

0赞 DanielBell99 4/28/2022

替换为valuepd.Series(value_list)

75赞 Hugo Vares 12/23/2019 #3

如果要为所有行使用单个值：

df.insert(0,'name_of_column','')
df['name_of_column'] = value

编辑：

您还可以：

df.insert(0,'name_of_column',value)

0赞 Brian Wylie 2/15/2021

这正是我需要的。谢谢:)df.insert(0,'name_of_column',value)

3赞 rra 6/19/2020 #4

这是一个非常简单的答案（只有一行）。

在将“n”列添加到 df 后，您可以执行此操作，如下所示。

import pandas as pd
df = pd.DataFrame({'l':['a','b','c','d'], 'v':[1,2,1,2]})
df['n'] = 0

df
    l   v   n
0   a   1   0
1   b   2   0
2   c   1   0
3   d   2   0

# here you can add the below code and it should work.
df = df[list('nlv')]
df

    n   l   v
0   0   a   1
1   0   b   2
2   0   c   1
3   0   d   2



However, if you have words in your columns names instead of letters. It should include two brackets around your column names. 

import pandas as pd
df = pd.DataFrame({'Upper':['a','b','c','d'], 'Lower':[1,2,1,2]})
df['Net'] = 0
df['Mid'] = 2
df['Zsore'] = 2

df

    Upper   Lower   Net Mid Zsore
0   a       1       0   2   2
1   b       2       0   2   2
2   c       1       0   2   2
3   d       2       0   2   2

# here you can add below line and it should work 
df = df[list(('Mid','Upper', 'Lower', 'Net','Zsore'))]
df

   Mid  Upper   Lower   Net Zsore
0   2   a       1       0   2
1   2   b       2       0   2
2   2   c       1       0   2
3   2   d       2       0   2

0赞 Amir 1/28/2021

如果我们想将另一列添加到 df 的中，并将几列添加到 df 的末尾，该怎么办？df_otherloc 0df_other

19赞 mhc 2/19/2021 #5

df.insert(loc, column_name, value)

如果没有其他同名列，这将起作用。如果数据帧中已存在具有您提供的名称的列，它将引发 ValueError。

您可以传递带有 value 的可选参数，以创建具有现有列名的新列。allow_duplicatesTrue

下面是一个示例：



    >>> df = pd.DataFrame({'b': [1, 2], 'c': [3,4]})
    >>> df
       b  c
    0  1  3
    1  2  4
    >>> df.insert(0, 'a', -1)
    >>> df
       a  b  c
    0 -1  1  3
    1 -1  2  4
    >>> df.insert(0, 'a', -2)
    Traceback (most recent call last):
      File "", line 1, in 
      File "C:\Python39\lib\site-packages\pandas\core\frame.py", line 3760, in insert
        self._mgr.insert(loc, column, value, allow_duplicates=allow_duplicates)
      File "C:\Python39\lib\site-packages\pandas\core\internals\managers.py", line 1191, in insert
        raise ValueError(f"cannot insert {item}, already exists")
    ValueError: cannot insert a, already exists
    >>> df.insert(0, 'a', -2,  allow_duplicates = True)
    >>> df
       a  a  b  c
    0 -2 -1  1  3
    1 -2 -1  2  4

0赞 nikhil int 3/30/2022

这太棒了，实际上在 Pandas 官方文档中也提出了建议。谢谢你提出这个问题@mhc

4赞 Ka Wa Yip 3/15/2022 #6

一般的 4 行例程

每当您想创建新列并插入到特定位置时，都可以使用以下 4 行例程。loc

df['new_column'] = ... #new column's definition
col = df.columns.tolist()
col.insert(loc, col.pop()) #loc is the column's index you want to insert into
df = df[col]

在您的示例中，它很简单：

df['n'] = 0
col = df.columns.tolist()
col.insert(0, col.pop()) 
df = df[col]

上一个：使用 tidyverse 重新定位行

下一个：将字典条目转换为变量 [duplicate]

如何在 pandas 中的特定列索引处插入列？

how do I insert a column at a specific column index in pandas?

评论

评论

评论

这是一个非常简单的答案（只有一行）。

评论

评论

一般的 4 行例程