0

用于df选择两列 col_1col_2然后以数据框格式存储在变量groupbyloca

Python代码

import pandas as pd
import numpy as np

df = pd.DataFrame(dict(Gender=['M', 'F', 'M', 'F',
                           'F', 'M', 'M', np.nan],
                   State=['NY','IL', 'NY', 'CA',
                          'IL', 'CA', 'CA', 'IL'],
                   col_1=[10,20,30,np.nan,40,50,60,70],
                   col_2=[1,6,2,4,6,9,np.nan,3]))

d = df.groupby(['Gender','State']).sum()
d.loc['F']

执行结果

       col_1    col_2
State       
CA     0.0      4.0
IL     60.0     12.0

预期结果 (数据框格式)

  a = 
           col_1    col_2
   
           0.0      4.0
           60.0     12.0
4

1 回答 1

1

您可以在获取 grupby 数据后重新设置索引d

print(d.loc['F'].reset_index(drop=True))

   col_1  col_2
0    0.0    4.0
1   60.0   12.0
于 2021-07-08T09:08:55.500 回答