用于df选择两列 col_1,col_2然后以数据框格式存储在变量groupby中loca
Python代码
import pandas as pd
import numpy as np
df = pd.DataFrame(dict(Gender=['M', 'F', 'M', 'F',
'F', 'M', 'M', np.nan],
State=['NY','IL', 'NY', 'CA',
'IL', 'CA', 'CA', 'IL'],
col_1=[10,20,30,np.nan,40,50,60,70],
col_2=[1,6,2,4,6,9,np.nan,3]))
d = df.groupby(['Gender','State']).sum()
d.loc['F']
执行结果
col_1 col_2
State
CA 0.0 4.0
IL 60.0 12.0
预期结果 (数据框格式)
a =
col_1 col_2
0.0 4.0
60.0 12.0