Separating a text document by specific lines of text using python

debugcn Published at Dev

CiaranWelsh

I'm writing a python function to take a chunk of text, parsed from a text file using f.readlines and split this chunk of text into a list. The text contains dividers and I want to split this text specifically at these locations. Below is an example of the text file in question.

@model:2.4.0=Skeleton "Skeleton"
@compartments
 Cell=1.0 "Cell"
@species
 Cell:[A]=100.0 "A"
 Cell:[B]=1.0 "B"
 Cell:[C]=0.0 "C"
 Cell:[D]=0.0 "D"
@parameters
kcat=4000
km = 146
v2_k = 88
@reactions
@r=v1 "v1"
 A -> C : B
 Cell * kcat * B * A / (km + A) 
@r=v2 "v2"
 C -> C+D
 Cell * v2_k * C

My desired output is to have a python dictionary that has the name of the dividers as keys and all the content between that divider and the next as values. For example, the first element of the sections dictionary should be:

sections['@model']=:2.4.0=Skeleton "Skeleton"

Current Code

def split_sections(SBshorthand_file):
    '''
    Takes a SBshorthand file and returns a dictionary of each of the sections. 
    Keys of the dictionary are the dividers.
    Values of dictionary are the content between dividers. 
    '''
    SBfile=parse_SBshorthand_read(SBshorthand_file) #simple parsing function. uses f.read()
    dividers=["@model", "@units", "@compartments", "@species", "@parameters", "@rules", "@reactions", "@events"]
    sections={}
    for i in  dividers:
        pattern=re.compile(i)
        if re.findall(pattern,SBfile) == []:
            pass
#            print 'Section \'{}\' not present in {}'.format(i,SBshorthand_file)
        else:
            SBfile2=re.sub(pattern,'\n'+i,SBfile)
            print SBfile2

This however does not do what I want. Would anybody have any ideas how to fix my code? Thanks

-----------------Edit--------------------

Please note that the section '@reactions' contains a number of 'reactions' all of which start with @r, but they all need to be grouped under the reactions key.

vks

import re

x="""@model:2.4.0=Skeleton "Skeleton"
@compartments
Cell=1.0 "Cell"
@species
Cell:[A]=100.0 "A"
Cell:[B]=1.0 "B"
Cell:[C]=0.0 "C"
Cell:[D]=0.0 "D"
@parameters
kcat=4000
km = 146
v2_k = 88
@reactions
@r=v1 "v1"
A -> C : B
Cell * kcat * B * A / (km + A)
@r=v2 "v2"
C -> C+D
Cell * v2_k * C"""


print dict(re.findall(r"(?:^|(?<=\n))(@\w+)([\s\S]*?)(?=\n@(?!r\b)\w+|$)",x))

You can directly use re.findall and get what you want.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-07-14

Comments

0 comments

From Dev

Related Related

Article

Separating a text document by specific lines of text using python

Separating a text document by specific lines of text using python

Find specific lines in text document

Rename specific lines in a text file using python

Rename specific lines in a text file using python

Extract specific lines from text file using python

Copy a specific text from lines using batch

Python - combine text files (specific lines)

Removing lines above specific line in text in python

How to read text file lines after Specific lines using StreamReader

Is there a faster way to insert text into a document with multiple lines using the ECHO command?

Split text lines in scanned document

python help separating lists in a text file

how to read specific lines of a text document and write them to another text | C

Comment some lines in a text file using python

reading from a text document in python using a variable

print specific lines from text file using perl

Insert multiple lines of text before specific line using Bash

Delete specific lines in a text file using vb.net

Using perl to append text to specific lines in a file on Solaris

Using perl to append text to specific lines in a file on Solaris

Editing nested text and specific lines within a file using bash script

print specific lines from text file using perl

Printing specific lines from text file using cmd

Mark specific lines of paragraph text

Insert text in specific lines of a file

Extract specific text from a document using notepad++

Python - Read specific lines in a text file based on a condition

Regex to filter and remove specific multiple lines of text from a file with python

Separating then Joining Text and Data

Separating banner from text