Search

Search

How to remove rows with duplicates in pandas dataframe?

debugcn Published at Dev

25

Joe

Having a dataframe which contains duplicate values in two columns (A and B):

I want to remove duplicates so that only unique values remain:

This command does not provide what I want:

df.drop_duplicates(subset=['A','B'], keep='first')

Any idea how to do this?

jezrael

You can use stack with unstack:

print (df.stack().drop_duplicates().unstack().dropna().astype(int))
   A  B
0  1  2
2  4  5
3  7  6

Solution with boolean indexing:

print (df[~df.stack().duplicated().unstack().any(1)])
   A  B
0  1  2
2  4  5
3  7  6

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-07-21

0

Comments

0 comments

Login to comment

Related

From Dev

How to remove duplicates in pandas?

From Dev

How to remove duplicates in pandas?

From Dev

How to drop duplicates from a subset of rows in a pandas dataframe?

From Dev

Python/Pandas - remove rows based on conditions below in a dataframe (similar to remove duplicates but not the same)

From Dev

How to Conditionally Remove Duplicates from Pandas DataFrame with a List

From Dev

How to remove duplicates from a dataframe?

From Dev

Combine rows to remove duplicates in CSV Python and Pandas

From Dev

remove specific rows in dataframe with pandas

From Dev

How to find duplicates in pandas dataframe

From Dev

Pandas: How to remove rows from a dataframe based on a list?

From Dev

Pandas dataframe: group by columnn and let duplicates of this columnn span several rows

From Java

Select rows of pandas dataframe based on column values with duplicates

From Dev

Pandas.Dataframe.duplicated() includes missing rows as duplicates

From Java

Remove rows with duplicate indices (Pandas DataFrame and TimeSeries)

From Dev

Pandas: remove rows of dataframe with unique index value

From Dev

Remove cancelling rows from Pandas Dataframe

From Dev

Pandas: remove rows of dataframe with unique index value

From Dev

How to drop duplicates in Pandas DataFrame by checking for a condition?

From Dev

How to keep first two duplicates in a pandas dataframe?

From Dev

Remove duplicates in dataframe pandas based on values of two columns

From Dev

Remove duplicates in pandas. copy() and drop_duplicates() is removing rows that appear only once

From Dev

Remove Quasi Duplicates In Pandas

From Dev

Excel 2016: how to remove rows that do not have duplicates in column?

From Dev

How to split dataframe or reorder dataframe by rows in pandas

From Dev

Pandas: How can I remove duplicate rows from DataFrame and calculate their frequency?

From Dev

Pandas DataFrame, How do I remove all columns and rows that sum to 0

From Dev

How do I create a function that will accept a pandas dataframe and remove rows containing a specific value?

From Dev

Pandas DataFrame, How do I remove all columns and rows that sum to 0

From Dev

How to remove rows where all numerical columns contain zero in Pandas Dataframe with mixed type of columns?

Related Related

Article

HotTag

Archive