我有一个脚本,该脚本最终应从文件中提取文本的一部分并将其删除,然后对目录中的每个文件重复此操作。但是我在下面编写的代码不包含delete语句,因为我不确定该怎么做。这篇文章使我认为我应该改为grep我想要保留的所有内容,并将其写入新文件。我尝试del p
在该p = re.compile(ur'(.)(\D\D\D\D\D..........)')
行之后插入,这似乎删除了我的一个输入文件。有什么建议吗?
import sys
import re
import os
PSGDirectory = "~/TestFolder"
for path, dirs, files in os.walk(PSGDirectory):
print path
print dirs
print files
def process (files):
p = re.compile(ur'(.)(\D\D\D\D\D..........)')
return re.search(p, files)
for root, dirs, files in os.walk(PSGDirectory):
for filename in files:
fullpath = os.path.join(root,filename)
print process (fullpath)
使用re.sub()
从文档中:
import os
import re
def process(file):
with open(file, 'w') as f:
text = f.read()
# Use your regex to grab all matching text:
# For example, to remove all non-digits from a string:
filtered = re.sub(r'\D', "", text)
f.write(filtered)
return filtered
def main():
PSGDirectory = "~/TestFolder"
for path, dirs, files in os.walk(PSGDirectory):
print path
print dirs
print files
for root, dirs, files in os.walk(PSGDirectory):
for filename in files:
fullpath = os.path.join(root, filename)
print process(full path)
本文收集自互联网,转载请注明来源。
如有侵权,请联系[email protected] 删除。
我来说两句