replacing a pandas dataframe row overwrites all columns' dtypes

fantabolous Published at Dev

fantabolous

When I replace a row of a df, it causes an existing column of dtype=int to become float. I would like to keep it as int.

I create the df:

testdate = pd.datetime(2014, 1, 1)
adddata = {'intcol':0,'floatcol':0.0}
df = pd.DataFrame(data=adddata, index=pd.date_range(testdate, periods=1))

As desired, one column is int and the other is float, as confirmed by df.dtypes:

floatcol    float64
intcol        int64
dtype: object

Then I overwrite an existing row (in this case there's only 1) using df.ix[testdate] = pd.Series(adddata). I purposely use the same data to show the issue: the intcol has become float. df.dtypes:

floatcol    float64
intcol      float64
dtype: object

Note that I can change the cells individually (e.g. df.ix[testdate,'floatcol'] = 0.0) and the column dtypes are maintained, but in reality I have far more than 2 columns that I want to overwrite simultaneously so doing them one at a time is cumbersome.

behzad.nouri

interesting that even specifying the data type as object does not help:

>>> df.loc[testdate,:] = pd.Series(adddata, dtype='object')
>>> df.dtypes
floatcol    float64
intcol      float64
dtype: object

someone may have a better solution, but i noticed that this works:

>>> df.loc[testdate,:] = pd.Series(list(adddata.values()), adddata.keys(), dtype='object')
>>> df.dtypes
floatcol    float64
intcol        int64
dtype: object

but, if the row values are in dict format, probably this would be easier:

>>> df.loc[testdate,:] = list(map(adddata.get, df.columns))
>>> df.dtypes
floatcol    float64
intcol        int64
dtype: object

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-02-7

Comments

0 comments

From Dev

Related Related

Article

replacing a pandas dataframe row overwrites all columns' dtypes

replacing a pandas dataframe row overwrites all columns' dtypes

Python Pandas: dtypes not show column types for all columns

pandas dataframe how to sum all value of bigger columns per row

automatic encoding of all strings in a pandas dataframe.dtypes

Replacing values in pandas dataframe columns by criteria

Pandas: Replacing column values in dataframe columns

Pandas Dataframe: Replacing NaN with row average

Replacing Columns from one dataframe with columns from another dataframe in pandas

Replacing Columns from one dataframe with columns from another dataframe in pandas

Pandas Converting columns to different dtypes

Assign pandas dataframe column dtypes

what are all the dtypes that pandas recognizes?

shifting all the columns in a dataframe to extreme end replacing all nan

Concatenate all columns in a pandas dataframe

Pandas Dataframe: Expand rows with lists to multiple row with desired indexing for all columns

Transpose/Pivot DataFrame but not all columns in the same row

Pandas dataframe: Is there a difference in performance in replacing values by column and row?

Converting tuples in a row to a new columns in pandas Dataframe

Set values for particular columns in a row of a Pandas Dataframe

pandas dataframe column based on row and multiple columns

pandas unique values multiple columns different dtypes

Replacing Values in Pandas Columns

Replacing row values in pandas

How to set dtypes by column in pandas DataFrame

How to show all of columns name on pandas dataframe?

Search for String in all Pandas DataFrame columns and filter

pandas how to check dtype for all columns in a dataframe?

Not calculating sum for all columns in pandas dataframe

Pandas: Find the maximum range in all the columns of dataframe

pandas select rows by condition for all of dataframe columns