python两列数据连接_python – 使用两列之间的映射在pandas数据帧中创建链

这是一个测试数据帧.我想使用EmpID和MgrID之间的关系来进一步映射新列中的MgrID管理器.

Test_df = pd.DataFrame({'EmpID':['1','2','3','4','5','6','7','8','9','10'],

'MgrID':['4','4','4','6','8','8','10','10','10','12']})

Test_df

如果我为初始关系创建一个字典,我将能够创建链的第一个链接,但我担心我需要遍历每个新列以创建一个新列.

ID_Dict = {'1':'4',

'2':'4',

'3':'4',

'4':'6',

'5':'8',

'6':'8',

'7':'10',

'8':'10',

'9':'10',

'10':'12'}

Test_df['MgrID_L2'] = Test_df['MgrID'].map(ID_Dict)

Test_df

最有效的方法是什么?

谢谢!

最佳答案 这是一个简单的while循环方式.注意我将MgrID的名称更改为MgrID_1

Test_df = pd.DataFrame({'EmpID':['1','2','3','4','5','6','7','8','9','10'],

'MgrID_1':['4','4','4','6','8','8','10','10','10','12']})

d = Test_df.set_index('EmpID').MgrID_1.to_dict()

s = 2

while s:

Test_df['MgrID_'+str(s)] = Test_df['MgrID_'+str(s-1)].map(d)

if Test_df['MgrID_'+str(s)].isnull().all():

Test_df = Test_df.drop(columns='MgrID_'+str(s))

s = 0

else:

s+=1

Ouptut:Test_df

EmpID MgrID_1 MgrID_2 MgrID_3 MgrID_4 MgrID_5

0 1 4 6 8 10 12

1 2 4 6 8 10 12

2 3 4 6 8 10 12

3 4 6 8 10 12 NaN

4 5 8 10 12 NaN NaN

5 6 8 10 12 NaN NaN

6 7 10 12 NaN NaN NaN

7 8 10 12 NaN NaN NaN

8 9 10 12 NaN NaN NaN

9 10 12 NaN NaN NaN NaN

你可能感兴趣的:(python两列数据连接)