Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Count not being supported in multiset agg function #358

Open
HYPERTONE opened this issue Aug 25, 2023 · 0 comments
Open

Count not being supported in multiset agg function #358

HYPERTONE opened this issue Aug 25, 2023 · 0 comments

Comments

@HYPERTONE
Copy link

HYPERTONE commented Aug 25, 2023

GroupByOps has AggNames listed as a set:
AggNames = { "count", "cumsum", "cummin", "cummax", "first", "last", "max", "mean", "median", "min", "nanmax", "nanmean", "nanmedian", "nanmin", "nanstd", "nansum", "nanvar", "nth", "std", "sum", "var", }

However, upon performing a groupby with 'count', the following occurs:
dataset.groupby(['Category', 'SubCategory']).agg({'Identifier' : 'count', 'Value' : 'max'})

TypeError: count() takes 1 positional argument but 2 were given

This can similarly be done in pandas via the following:

df.groupby(['Category', 'SubCategory']).agg({'Identifier' : 'count', 'Value' : 'max'})

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant