pandas apply only returning first value when using logical indexing

debugcn 投稿 Dev

spacediver

I create two dataframes:

data = [['John'], ['Mary']]
df1 = pd.DataFrame(data, columns = ['Name'])
df1['Height'] = 0

data = [['John', 5], ['Mary', 6]]
df2 = pd.DataFrame(data, columns = ['Name', 'Height'])

df1

Output:

       Name  Height
    0  John  0
    1  Mary  0

df2

Output:
       Name  Height
    0  John  5
    1  Mary  6

Now I try to fill in df1's Height using the values from df2:

df1['Height'] = df1.apply(lambda row: df2[df2.Name == row.Name]['Height'], axis = 1)

df1

Output:
       Name  Height
    0  John  5
    1  Mary  Nan

Why does only the first name (John) have the Height filled in? Shouldn't apply() be iterating through all the rows of the df1 and returning the Height from df2 where df2 matches the name in the current row of df1?

Quang Hoang

The problem is that df2[df2.Name == row.Name]['Height'] returns a series with different indexes. You when Pandas concatenate these series, it yields different columns. In particular:

df1.apply(lambda row: df2[df2.Name == row.Name]['Height'], axis = 1)

returns:

     0    1
0  5.0  NaN
1  NaN  6.0

and it looks like Pandas takes the first column to assign when you do:

df['Height'] = ...

To fix your code, you need to extract the single value:

df1['Height'] = df1.apply(lambda row: df2[df2.Name == row.Name]['Height'].iloc[0], axis = 1)

However, this is certainly not the best way to approach the problem. You should either take a look at map or merge. For example:

df1['Height'] = df1['Name'].map(df2.set_index('Name')['Height'])

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集2021-06-13

コメントを追加

サインイン

分類Dev

Related 関連記事

記事

pandas apply only returning first value when using logical indexing

pandas apply only returning first value when using logical indexing

regex scan only returning first value

setState Concat storing/returning only First value-- React Native

Array value only returning first character and extra values

Logical indexing in JAVA

redissonClient.poll() only returning the first 8 characters of String type value

The parameter in a custom function when using pandas.Series.apply

Using RXJS to issue two sequential http calls and returning the result of the first one only

Select statement only returning first row in result

scraper only returning results for first 2 inputs

Masking/modifying values using advanced indexing with pandas

specify a first column when exporting csv using pandas

Several errors in Swift when using || or && logical operators

apply prepends space for logical

Why does the cell style index returning wrong CellFormat value when parsing excel cells using OpenXml SDK?

Using QualifierFilter leads to only returning matched columns

Example to clarify fill_value when using add() on dataframes (pandas)

IBM Watson Speech to Text Only Returning First Word With Java SDK

Python's Popen + communicate only returning the first line of stdout

Compiling Sass with Scout doesn't apply correct alpha value when using rgba(r,g,b,a)

Checkbox not returning value when not checked PHP Codeigniter

How to rename a logical value (TRUE or FALSE) with "Yes" or "No" and apply distinct() to FALSE values

python pandas - groupby.first() returning NaT values

How to set a defaultChecked to only first input in radio button when using map function in reactjs

Pandas read_csv only first comma

ListView using findViewById is returning null value

Correct Use of Matcher Groups in Java Regex when using logical OR

returning the first column value with two independent row criteria in excel

R: passing by parameter to function and using apply instead of nested loop and recursive indexing failed

pandas using apply method and sending column names?