Search

Search

Python: logical comparing with columns in panda's dataframe

debugcn Published at Dev

10

dustin

I have a dataframe where I want to determine when the ser_no and CTRY_NM are the same and differ. However, I want to be mindful of the ser_no changes and not make a false and false return true or a false/true return false.

Consider the following dataframe:

import pandas as pd
df = pd.DataFrame({'ser_no': [1, 1, 1, 2, 2, 2, 2, 3, 3, 3],
                'CTRY_NM': ['a', 'a', 'b', 'e', 'e', 'a', 'b', 'b', 'b', 'd']})
def check(key):
    return df[key] == df[key].shift(1)

match = check('ser_no') == check('CTRY_NM')

This returns:

However, at indices, 4 and 8 we have serial number changes. Since each serial number is a different machine, it doesn't make sense to have a logical comparison at these locations. When ser_no changes, how can I insert NaN instead of do a logical comparison?

cncggvg

is this what you want?

def check(data, key):
    mask = data[key].shift(1) == data[key]
    mask.iloc[0] = np.nan
    return mask

df.groupby(by=['ser_no']).apply(lambda x: check(x, 'CTRY_NM'))

result

ser_no   
1       0   NaN
        1     1
        2     0
2       3   NaN
        4     1
        5     0
        6     0
3       7   NaN
        8     1
        9     0
Name: CTRY_NM, dtype: float64

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-07-16

0

Comments

0 comments

Login to comment

Related

From Dev

creating a logical panda series by comparing two series

From Dev

Panda's equivalent of R's order() for arranging dataframe columns

From Dev

Logical OR on a subset of columns in a DataFrame

From Dev

Logical OR on a subset of columns in a DataFrame

From Dev

Panda's DataFrame - renaming multiple identically named columns

From Dev

mean of all the columns of a panda dataframe?

From Dev

Converting list in panda dataframe into columns

From Dev

Finding the common columns when comparing two rows in a dataframe in python

From Dev

Logical operation on two columns of a dataframe

From Dev

Scala Comparing time columns in dataframe

From Dev

comparing columns pandas python

From Dev

Creating a new column in panda dataframe using logical indexing and group by

From Dev

Pandas dataframe If else with logical AND involving two columns

From Dev

Apply a function in a dataframe's columns [Python]

From Dev

Filtering a pandas dataframe comparing two columns

From Dev

Python comparing variables in 2 dataframe

From Dev

Comparing two dataframe values in python

From Dev

Python Panda count occurences depending on multiple columns

From Dev

Python Panda count occurences depending on multiple columns

From Dev

Delete rows from python panda dataframe

From Java

Iterate through multiple columns in a Panda dataframe and find count unique values

From Dev

Make new column in Panda dataframe by adding values from other columns

From Dev

How to group columns by label in a histogram using a panda DataFrame?

From Dev

Using panda for comparing column values and creating column based on the values in compared columns?

From Java

Groupby based on a multiple logical conditions applicated to a different columns DataFrame

From Dev

Can't remove columns from a dataframe, output turns into a logical vector

From Dev

Comparing two dataframe (python pandas) by datetime intervals

From Dev

pandas: get rows by comparing two columns of dataframe to list of tuples

From Dev

Filling NaNs with values from column of another dataframe by comparing the columns

Related Related

Article

HotTag

Archive