将数据帧分成两部分并使用波形符 ~ 作为变量

2023-12-22

我想在 Python 3 中用 Pandas 做 2 个类似的操作。一个带波浪号，另一个不带波浪号。

1 - df = df[~(df.teste.isin(["Place"]))] 
2 - df = df[(df.teste.isin(["Place"]))]

我尝试将波形符声明为变量，这样我就可以只写一行，然后决定是否要使用波形符。但它不起作用：

tilde = ["~", ""]
df = df[tilde[0](df.teste.isin(["Place"]))]

是否可以做一些可以减少我的代码的事情？因为我写了很多相同的行只是交换波浪号......

Thanks!

为什么我想要波浪号作为变量：

def server_latam(df):
    df.rename(columns={'Computer:OSI':'OSI'}, inplace=True) 
    df = df[~(df.teste.isin(["Place"]))]

    df1 = df.loc[df.model != 'Virtual Platform', 'model'].count()
    print("LATAM")
    print("Physical Servers: ",df1)
    df2 = df.loc[df.model == 'Virtual Platform', 'model'].count()
    print("Virtual Servers: ",df2)
    df3 = df.groupby('platformName').size().reset_index(name='by OS: ')
    print(df3)

def server_latam_without_tilde(df):
    df.rename(columns={'Computer:OSI':'OSI'}, inplace=True) 
    df = df[(df.teste.isin(["Place"]))]

    df1 = df.loc[df.model != 'Virtual Platform', 'model'].count()
    print("LATAM")
    print("Physical Servers: ",df1)
    df2 = df.loc[df.model == 'Virtual Platform', 'model'].count()
    print("Virtual Servers: ",df2)
    df3 = df.groupby('platformName').size().reset_index(name='by OS: ')
    print(df3)

每个函数的第二行都会出现波形符。

对于您有限的用例，您所要求的好处有限。

GroupBy

Your real然而，问题是您必须创建的变量数量。你可以通过以下方式将它们减半GroupBy和一个经过计算的石斑鱼：

df = pd.DataFrame({'teste': ['Place', 'Null', 'Something', 'Place'],
                   'value': [1, 2, 3, 4]})

dfs = dict(tuple(df.groupby(df['teste'] == 'Place')))

{False:        teste  value
        1       Null      2
        2  Something      3,

 True:         teste  value
            0  Place      1
            3  Place      4}

然后通过访问您的数据框dfs[0] and dfs[1], since False == 0 and True == 1. There is最后一个例子的好处是。现在您不再需要不必要地创建新变量。您的数据框是有组织的，因为它们存在于同一个字典中。

函数调度

您的精准要求can可以通过operator模块和恒等函数：

from operator import invert

tilde = [invert, lambda x: x]

mask = df.teste == 'Place'  # don't repeat mask calculations unnecessarily

df1 = df[tilde[0](mask)]
df2 = df[tilde[1](mask)]

顺序拆包

如果您的目的是使用一行，请使用序列解包：

df1, df2 = (df[func(mask)] for func in tilde)

请注意，您可以复制GroupBy结果通过：

dfs = dict(enumerate(df[func(mask)] for func in tilde)

但这是冗长且令人费解的。坚持GroupBy解决方案。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

pandas

DataFrame

tilde