Need to split variable length data in a pandas dataframe column into multiple columns

Nexxsys

I have 2 column dataframe likes this:

ITEM        REFNUMS
1   00000299    0036701923024762922029229294652954429569295832...
2   00000655    NaN
24  00001791    00016027123076000158004563065131972
25  00001805    00016027123076000158004563065131972
26  00001813    00016027123076000158004563065131972
27  00001821    00016027123076000158004563065131972
28  00001937    0142530521316303164702509000510012201310027820...

I would like to split the REFNUMS columns into divisible parts and add onto the existing dataframe if possible as I need to retain the row index and matching ITEM #. The data in REFNUMS is a length divisible by 5, when not NaN, so for example Row 1 is = 78 sets of 5.

data_len = (data['REFNUMS'].str.len())/5 

Then

0         NaN
1        78.0
2         NaN

Appreciate any suggestions on how to do this.

DSM

IIUC, you can use str.extractall to get the groups of 5 digits, clean up the columns, and then join:

In [168]: r = df.REFNUMS.str.extractall("(\d{1,5})").unstack()

In [169]: r.columns = r.columns.droplevel(0)

In [170]: df.join(r)
Out[170]: 
    ITEM                                            REFNUMS      0      1      2      3      4      5      6      7      8     9
1    299  0036701923024762922029229294652954429569295832...  00367  01923  02476  29220  29229  29465  29544  29569  29583     2
2    655                                                NaN    NaN    NaN    NaN    NaN    NaN    NaN    NaN    NaN    NaN   NaN
24  1791                00016027123076000158004563065131972  00016  02712  30760  00158  00456  30651  31972   None   None  None
25  1805                00016027123076000158004563065131972  00016  02712  30760  00158  00456  30651  31972   None   None  None
26  1813                00016027123076000158004563065131972  00016  02712  30760  00158  00456  30651  31972   None   None  None
27  1821                00016027123076000158004563065131972  00016  02712  30760  00158  00456  30651  31972   None   None  None
28  1937  0142530521316303164702509000510012201310027820...  01425  30521  31630  31647  02509  00051  00122  01310  02782     0

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at
0

Comments

0 comments
Login to comment

Related

From Dev

Split list in Pandas dataframe column into multiple columns

From Dev

How to split a dataframe column into multiple columns with a Pandas converter

From Dev

Split a text(with names and values) column into multiple columns in Pandas DataFrame

From Dev

Split pandas dataframe by column variable

From Dev

Pandas :How to split the tuple data in column and create multiple columns

From Dev

Appending variable length columns in Pandas dataframe Python

From Dev

Detection of variable length pattern in pandas dataframe column

From Dev

Detection of variable length pattern in pandas dataframe column

From Java

split multiple columns in pandas dataframe by delimiter

From Dev

How to split a pandas dataframe into multiple columns

From Dev

Pandas Dataframe - Split string into multiple columns

From Java

Pandas split column of lists into multiple columns

From Dev

Split a pandas column of dictionaries into multiple columns

From Dev

Pandas DataFrame efficiently split one column into multiple

From Dev

pandas: Split separated values in a DataFrame column (one Series) into multiple Columns. Elegant solutions?

From Dev

Pandas Dataframe: split column into multiple columns, right-align inconsistent cell entries

From Dev

Split a pandas dataframe column with inconsistant data

From Dev

Pandas dataframe, split data by last column in last position but keep other columns

From Dev

Pandas, DataFrame: Splitting one column into multiple columns

From Dev

Convert a column in a pandas DataFrame into multiple columns

From Dev

Pandas, DataFrame: Splitting one column into multiple columns

From Dev

pandas dataframe column based on row and multiple columns

From Dev

Split a dataframe column in multiple columns based on multiple occurrences of a separator in R

From Dev

Split column into multiple columns

From Dev

Split a column to multiple columns

From Dev

unpack variable length dictionary from pandas column and create separate columns

From Dev

Split one column to two columns depending one the content in pandas dataframe

From Dev

R: Split Variable Column into multiple (unbalanced) columns by comma

From Java

Split pandas dataframe into multiple dataframes based on null columns

Related Related

  1. 1

    Split list in Pandas dataframe column into multiple columns

  2. 2

    How to split a dataframe column into multiple columns with a Pandas converter

  3. 3

    Split a text(with names and values) column into multiple columns in Pandas DataFrame

  4. 4

    Split pandas dataframe by column variable

  5. 5

    Pandas :How to split the tuple data in column and create multiple columns

  6. 6

    Appending variable length columns in Pandas dataframe Python

  7. 7

    Detection of variable length pattern in pandas dataframe column

  8. 8

    Detection of variable length pattern in pandas dataframe column

  9. 9

    split multiple columns in pandas dataframe by delimiter

  10. 10

    How to split a pandas dataframe into multiple columns

  11. 11

    Pandas Dataframe - Split string into multiple columns

  12. 12

    Pandas split column of lists into multiple columns

  13. 13

    Split a pandas column of dictionaries into multiple columns

  14. 14

    Pandas DataFrame efficiently split one column into multiple

  15. 15

    pandas: Split separated values in a DataFrame column (one Series) into multiple Columns. Elegant solutions?

  16. 16

    Pandas Dataframe: split column into multiple columns, right-align inconsistent cell entries

  17. 17

    Split a pandas dataframe column with inconsistant data

  18. 18

    Pandas dataframe, split data by last column in last position but keep other columns

  19. 19

    Pandas, DataFrame: Splitting one column into multiple columns

  20. 20

    Convert a column in a pandas DataFrame into multiple columns

  21. 21

    Pandas, DataFrame: Splitting one column into multiple columns

  22. 22

    pandas dataframe column based on row and multiple columns

  23. 23

    Split a dataframe column in multiple columns based on multiple occurrences of a separator in R

  24. 24

    Split column into multiple columns

  25. 25

    Split a column to multiple columns

  26. 26

    unpack variable length dictionary from pandas column and create separate columns

  27. 27

    Split one column to two columns depending one the content in pandas dataframe

  28. 28

    R: Split Variable Column into multiple (unbalanced) columns by comma

  29. 29

    Split pandas dataframe into multiple dataframes based on null columns

HotTag

Archive