将 pandas 数据框转换为唯一元组列表的最有效方法是什么?
在下面的代码中,我尝试提取包含所有唯一的元组列表PostalCode
and Age
.
from typing import NamedTuple, Sequence, Tuple
import pandas as pd
data = [["tom", 10, "ab 11"], ["nick", 14, "ab 22"], ["juli", 10, "ab 11"]]
people = pd.DataFrame(data, columns=["Name", "Age", "PostalCode"])
subset = pd.unique(people[["PostalCode", "Age"]].values.ravel())
# This produce an error
output = list(subset.itertuples(index=False))
# Expected output
# [('ab 11', 10), ('ab 22', 14)]
Try Groupby.groups https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.core.groupby.GroupBy.groups.html:
list(people.groupby(['PostalCode', 'Age']).groups)
[out]
[('ab 11', 10), ('ab 22', 14)]
本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)