iterate over GroupBy object in dask

Someone

Is it possible to iterate over a dask GroupBy object to get access to the underlying dataframes? I tried:

import dask.dataframe as dd
import pandas as pd
pdf = pd.DataFrame({'A':[1,2,3,4,5], 'B':['1','1','a','a','a']})
ddf = dd.from_pandas(pdf, npartitions = 3)
groups = ddf.groupby('B')
for name, df in groups:
    print(name)

However, this results in an error: KeyError: 'Column not found: 0'

More generally speaking, what kind of interactions does the dask GroupBy object allow, except from the apply method?

Steven G

you could iterate through groups doing this with dask, maybe there is a better way but this works for me.

import dask.dataframe as dd
import pandas as pd
pdf = pd.DataFrame({'A':[1, 2, 3, 4, 5], 'B':['1','1','a','a','a']})
ddf = dd.from_pandas(pdf, npartitions = 3)
groups = ddf.groupby('B')

for group in pdf['B'].unique():
    print groups.get_group(group)

this would return

dd.DataFrame<dataframe-groupby-get_group-e3ebb5d5a6a8001da9bb7653fface4c1, divisions=(0, 2, 4, 4)>
dd.DataFrame<dataframe-groupby-get_group-022502413b236592cf7d54b2dccf10a9, divisions=(0, 2, 4, 4)>

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集
0

コメントを追加

0

関連記事

分類Dev

Iterate over each object In array

分類Dev

How to iterate over keys of a generic object in TypeScript?

分類Dev

[Swift[ How to iterate over all the properties of an object

分類Dev

How to iterate over MethodCall object to get the parameters

分類Dev

Iterate over Pandas on MultiIndex levels and groupby to get totals

分類Dev

How to iterate over a collection of structs as an iterator of trait object references?

分類Dev

How to use Promise.all iterate over object keys

分類Dev

Iterate over template int

分類Dev

Unable to iterate over an ElementArrayFinder

分類Dev

Dask:Groupbyと 'First' / 'Last' in agg

分類Dev

Dask.groupby turns multiple partitions into one

分類Dev

How to iterate over and filter an array?

分類Dev

iterate over unique values in PANDAS

分類Dev

how to iterate over tuple items

分類Dev

How to iterate over the lines in a string?

分類Dev

Iterate over a tensor dimension in Tensorflow

分類Dev

How to iterate over dates in a dataframe?

分類Dev

Iterate over a custom set in SAS

分類Dev

Dynamically iterate over static information

分類Dev

How to iterate over block of text

分類Dev

Iterate/enumerate over part of a list?

分類Dev

Jersey howto iterate over parameters?

分類Dev

How to use joins and groupby in laravel and to iterate through?

分類Dev

Iterate over cell versions on BigTable row

分類Dev

Is there a way to iterate over a slice in reverse in Go?

分類Dev

java iterate over map except last iteration

分類Dev

How can we iterate over an HashMap in JSTL?

分類Dev

How to define and iterate over map in Jenkinsfile

分類Dev

iterate over certain columns in data frame

Related 関連記事

ホットタグ

アーカイブ