Create subcolumns in pandas dataframe python

debugcn 投稿 Dev

Khan

I have a dataframe with multiple columns

df = pd.DataFrame({"cylinders":[2,2,1,1],
                  "horsepower":[120,100,89,70],
                  "weight":[5400,6200,7200,1200]})


 cylinders horsepower weight
0  2          120       5400
1  2          100       6200 
2  1           80       7200
3  1           70       1200

i would like to create a new dataframe and make two subcolumns of weight with the median and mean while gouping it by cylinders. example:

                        weight
  cylinders horsepower  median  mean
0  1          100       5299    5000
1  1          120       5100    5200
2  2           70       7200    6500
3  2           80       1200    1000

For my example tables i have used random values. I cant manage to achieve that. I know how to get median and mean its described here in this stackoverflow question. :

df.weight.median()
df.weight.mean()
df.groupby('cylinders') #groupby cylinders

But how to create this subcolumn?

DYZ

The following code fragment adds the two requested columns. It groups the rows by cylinders, calculates the mean and median of weight, and combines the original dataframe and the result:

result = df.join(df.groupby('cylinders')['weight']\
           .agg(['mean', 'median']))\
           .sort_values(['cylinders', 'mean']).ffill()
#   cylinders  horsepower  weight    mean  median
#2          1          80    7200  5800.0  5800.0
#3          1          70    1200  5800.0  5800.0
#1          2         100    6200  4200.0  4200.0
#0          2         120    5400  4200.0  4200.0

You cannot have "subcolumns" for select columns in pandas. If a column has "subcolumns," all other columns must have "subcolumns," too. It is called multiindexing.

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集2021-06-9

コメントを追加

サインイン

分類Dev

Related 関連記事

記事

Create subcolumns in pandas dataframe python

Create subcolumns in pandas dataframe python

Python Pandas: Create DataFrame Fast

Python Pandas Create Dataframe using a text file

Python: How to create a step plot with offline plotly for a pandas DataFrame?

Making one dataframe out of two dataframes as separate subcolumns in pyspark

Filling DataFrame Pandas Python

Python : Pandas DataFrame to CSV

Use lookup values to create new pandas dataframe

How to create a pandas dataframe with a column as array

How to create a Pandas DataFrame from a list of OrderedDicts?

HoloViews: create boxplots for every column in a pandas dataframe

Python: pandas dataframe condition for slice of a dataframe

Calculating Percentile in Python Pandas Dataframe

pandas DataFrame to_sql Python

python pandas: nested dictionary to dataframe

python pandas loop append dataframe

Aggregating rows in python pandas dataframe

Expand rows in python pandas dataframe

Complicated aggregation of DataFrame in Python Pandas?

Python pandas dataframe slicing, with if condition

python pandas dataframe from file

python pandas dataframe from file

Improvement in pandas dataframe conversion in Python

how to create all zero dataframe in Python

Create a buffer in a dataframe based on multiple columns - Python

Pandas Dataframe Filter contains（ '。'）、Python 3.6

Python 3.4 : Pandas DataFrame not responding to ordered dictionary

python - pandas - check if date exists in dataframe

How to sort a Dataframe by the ocurrences in a column in Python (pandas)

Saving statmodels Tukey hsd into a Python pandas dataframe