我有一个熊猫数据框:-
categories
['business $ finance'\n 'business $currency']
['entertainment $movies' 'entertainment &music']
['sports& volleyball' 'sports& football'\n 'sports$ baseball']
...............
我只想将括号内的这些字符串转换为正确的列表格式。预期输出为:-
categories
['business $ finance','business $currency']
['entertainment $movies','entertainment &music']
['sports&volleyball','sports& football','sports$ baseball']
....................
到目前为止我尝试过的:-
array_cat=[]
for i in df['categories']:
s=i[0].split()
array_cat.append(s)
df['categories'][0]
是这样的:-
"['business finance#mergers & acquisitions' 'business#industries#telecom'\n 'diseases#kidney' 'gaming #offline #console#xbox'\n 'gaming #offline #playstation' 'lifestyle#shopping' 'movies#genres#drama'\n 'music#genres#hiphop/rap' 'personal finance#tools' 'technology#trending'\n 'theme#historicalsites']"
str.findall
由 之间的所有值使用''
:
df['categories'] = df['categories'].str.findall('\'(.*?)\'')
print (df)
categories
0 [business finance#mergers & acquisitions, busi...
1 [entertainment$movies, entertainment&music]
2 [sports&volleyball, sports&football, sports$ba...
本文收集自互联网,转载请注明来源。
如有侵权,请联系[email protected] 删除。
我来说两句