Merging multiple values into one row in a new column Pandas Python

debugcn Published at Dev

Umar.H

Greetings Beautiful People!

I'm putting together a visualization for some customer whoops edit survey data. Unfortunately, the data modeling or end to end process throughout is non-existent

I have multiple columns as follows :

    What Role : Teacher, What Role: Engineer, What Role : Doctor
1   Yes,                 Yes,                 No, 
2   No,                  No,                  Yes,
3,  Yes,                 No,                  Yes,

so, what I want to do is create a new column and convert the Yes' into a new Value which matches the Header, so if doctor is Yes, then it would enter int a new Column:

    What Role?
1   Teacher, Engineer,
2   Doctor,
3   Teacher, Doctor

Could this be done by creating a dictionary then a for loop?

for example:

import pandas as pd

df = pd.read_csv("file.csv")

Dictionary_File = {'What Role?' : 'What Role : Teacher', 
'What Role?': 'What Role : Engineer', 'What Role?' : 'What Role : Doctor'}

for k,v in Dictionary_File.items():
   (df[k] = df[k] == 'Yes', 'Unsure here' + df[v])

df = df.drop(list(Dictonary_File.values()), axis=1)

So when it comes to the for loop I couldn't think or find a way to merge the values into something new (Other than manually changing all the columns Yes into a new value then merging..?)

any help would be much appreciated!

Cheers,

jezrael

You need first remove What Role: by split.

Then by boolean mask df == 'Yes' create joined values by numpy.where

c = df.columns.str.split().str[-1]
s = np.where(df == 'Yes', ['{}, '.format(x) for x in c], '')
print (s)
[['Teacher, ' 'Engineer, ' '']
 ['' '' 'Doctor, ']
 ['Teacher, ' '' 'Doctor, ']]

df['new'] = pd.Series([''.join(x).strip(', ') for x in s], index=df.index)
print (df)
  What Role : Teacher What Role : Engineer What Role : Doctor  \
1                 Yes                  Yes                 No   
2                  No                   No                Yes   
3                 Yes                   No                Yes   

                 new  
1  Teacher, Engineer  
2             Doctor  
3    Teacher, Doctor

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-08-9

Comments

0 comments

From Dev

Merging multiple values into one row in a new column Pandas Python

Merging multiple values into one row in a new column Pandas Python

Assigning multiple column values in a single row of pandas DataFrame, in one line

Merge multiple column values into one column in python pandas

Merge multiple values of a column after group by into one column in python pandas

Merging multiple rows into one row

Python pandas column merging

How to return one row with multiple column values?

Merging pandas columns into a new column

Merging two data frames based on row values in python pandas

pandas create new column based on values from other columns / apply a function of multiple columns, row-wise

Python3.7 Pandas1.0.1 Dataframe - Calculate sum of column within a range and regroup as one new row?

Python Pandas multiple occurrences of a row in dataframe 1 into new column of dataframe 2

Map value from one row as a new column in pandas

new column based on row and column conditions pandas python

Merging two pandas dataframe with column values

sql query get multiple values from same column for one row

Excel - Return multiple matching values from a column, horizontally in one row

Displaying multiple values from same column in one row in SQL

How to do multiple column conversions on a Pandas Row in a DF in one pass

Merging values for one column and summing up the values in the second column

Pandas Set multiple column and row values to nan based on another dataframe

Python pandas: Setting row values in Pandas using data of that column

Merging pandas dataframes on multiple conditions (python/pandas)

Merging pandas dataframes on multiple conditions (python/pandas)

Create a new column in pandas based on values in multiple columns and the same condition

Merging multiple rows with multiple factors to create a new row in a dataset

Moving row values contains specific string to new column in Python

Record values of one column based on another column: Python & Pandas

Python Pandas - Dataset with many columns - want to iterate over each column, add row values to new list only from fields that are not null

Merging values in different columns into one in python list