我有这样的数据框:
item tags
1 awesome, awesome, great
2 cool, fun
3 boring, boring, average
4 ok, expensive
如何删除重复的标签以获取:
item tags
1 awesome, great
2 cool, fun
3 boring, average
4 ok, expensive
如果我理解正确,请尝试:
df['new_tags'] = df['tags'].apply(lambda x: ', '.join(set(x.split(', '))))
输出:
item tags new_tags
0 1 awesome, awesome, great awesome, great
1 2 cool, fun cool, fun
2 3 boring, boring, average average, boring
3 4 ok, expensive expensive, ok
本文收集自互联网,转载请注明来源。
如有侵权,请联系[email protected] 删除。
我来说两句