Pandas: Count Unique Values after Resample

Malcolm Bastien Published at Dev

Malcolm Bastien

I'm just getting started with Pandas and am trying to combine: Grouping my data by date, and counting the unique values in each group.

Here's what my data looks like:

                  User, Type
Datetime
2014-04-15 11:00:00, A, New
2014-04-15 12:00:00, B, Returning
2014-04-15 13:00:00, C, New
2014-04-20 14:00:00, D, New
2014-04-20 15:00:00, B, Returning
2014-04-20 16:00:00, B, Returning
2014-04-20 17:00:00, D, Returning

And here's what I would like to get to: Resample the datetime index to the day (which I can do), and also count the unique users for each day. I'm not interested in the 'Type' column yet.

Day, Unique Users
2014-04-15, 3
2014-04-20, 2

I'm trying df.user.resample('D', how='count').unique but it doesn't seem to give me the right answer.

Karl D.

You don't need to do a resample to get the desired output in your question. I think you can get by with just a groupby on date:

print df.groupby(df.index.date)['User'].nunique()

2014-04-15    3
2014-04-20    2
dtype: int64

And then if you want to you could resample to fill in the time series gaps after you count the unique users:

cnt = df.groupby(df.index.date)['User'].nunique()
cnt.index = cnt.index.to_datetime()
print cnt.resample('D')

2014-04-15     3
2014-04-16   NaN
2014-04-17   NaN
2014-04-18   NaN
2014-04-19   NaN
2014-04-20     2
Freq: D, dtype: float64

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-03-15

Comments

0 comments

From Dev

Related Related

Article

Pandas: Count Unique Values after Resample

Pandas: Count Unique Values after Resample

Cumulative count of unique values in pandas

pandas NaN after resample

python pandas resample count and sum

Count the unique values after array Join in Presto

Count unique values with pandas per groups

Get count unique values in a row in pandas

Count of current unique values in a pandas df

Count unique values using pandas groupby

How to count unique values in a dictionary of lists with Pandas?

Count unique values of a series based on condition - Pandas

Groupby and count the number of unique values (Pandas)

Pivot table in pandas to count unique values

pandas count unique values considering column

Pandas Dataframes Remove rows by unique count of values

Count unique values in csv column without pandas

Pandas Dataframe resample on ms values

Resample after rolling mean in pandas

How to count unique values in pandas column base on dictionary values

How to iterate over unique values in pandas and count frequency of associated values

Count the values after an underscore in a Pandas Series

sub count of column values after group by pandas

Pandas groupby multiple columns, count, and resample

Pandas dataframe resample and count events per day

Pandas datetime resample count non-zero

Pandas resample by day and count occurrences to new column

How can I count the unique values in a Pandas Dataframe?

unique combinations of values in selected columns in pandas data frame and count

one year rolling count of unique values by group in pandas

Is there a way to use the unique values of a count of occurences into column headers pandas?