Pandas DataFrame基于列表的部分字符串匹配

debugcn 发表于 Dev

飞利浦

我有DataFrame以下内容。

import pandas as pd
import numpy as np

df = pd.DataFrame([['One person has died after two motorbikes crashed in the Bay of Plenty.', 'The crash occurred at 3.15pm on Bell Rd in Nukuhou south of Whakatāne police said.', 'Another person suffered minor injuries in the crash.', 'WorkSafe had been advised and the Serious Crash Unit was in attendance police said.',
 'One person has died and another has minor injuries following a serious crash in Nukuhou near Whakatāne in Eastern Bay of Plenty.', 'The crash involved two motorcycles and was reported around 3.15pm.', 'The Serious Crash Unit and WorkSafe are attending the scene.']]).T
df.columns = ['col1']

我想查找以下字符串列表是否在中的“ col1”可用DataFrame。

loc_list = ['Bay of Plenty', 'Bell Rd', 'Nukuhou']

以下是我尝试过的。

df['location_mapped_title'] = (df.col1.str
                             .findall('|'.join(loc_list))
                             .str[0])
    col1    location_mapped_title
0   One person has died after two motorbikes crash...   Bay of Plenty
1   The crash occurred at 3.15pm on Bell Rd in Nuk...   Bell Rd
2   Another person suffered minor injuries in the ...   NaN
3   WorkSafe had been advised and the Serious Cras...   NaN
4   One person has died and another has minor inju...   Nukuhou
5   The crash involved two motorcycles and was rep...   NaN
6   The Serious Crash Unit and WorkSafe are attend...   NaN

但是它不会打印所有匹配的子字符串。例如，在索引4处有另一个“丰盛湾”。如何找到所有匹配项？

桑迪普·卡达帕（Sandeep Kadapa）

尝试findall或extractall：

df.col1.str.findall(f'({"|".join(loc_list)})', flags=re.IGNORECASE)

df.col1.str.extractall(f'({"|".join(loc_list)})', flags=re.IGNORECASE)

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。

编辑于2021-04-5

我来说两句

0条评论

登录后参与评论

来自分类Dev

删除部分字符串pandas DataFrame

来自分类Dev

从列表中搜索数据框列中的部分字符串匹配-Pandas-Python

来自分类Dev

从列表中搜索数据框列中的部分字符串匹配-Pandas-Python

来自分类常见问题

从Pandas DataFrame的列内的列表中选择部分字符串

来自分类Dev

从Pandas DataFrame的列内的列表中选择部分字符串

来自分类Dev

从Pandas DataFrame删除引号中的字符串部分

来自分类Dev

将Pandas DataFrame值从列表更改为字符串

来自分类Dev

在Pandas DataFrame中获取字符串而不是列表

来自分类Dev

在Pandas DataFrame中将列表转换为字符串

来自分类Dev

如何基于多列中的字符串匹配在Pandas数据框中选择行

来自分类Dev

Pandas 错误匹配字符串

来自分类Dev

如何检查字符串列表中的字符串是否在pandas dataframe列中

来自分类Dev

从字符串列表中删除某些字符串作为 pandas.DataFrame 中的列

来自分类Dev

仅打印与正则表达式Pandas匹配的字符串部分

来自分类Dev

如何在pandas.MultiIndex中找到部分字符串匹配

来自分类Dev

根据列表中的部分匹配字符串过滤DataFrame

来自分类Dev

用pandas DataFrame制作字符串

来自分类Dev

获取带有预定义列表的pandas列字符串中匹配单词的计数

来自分类Dev

动态屏蔽pandas df以查找与列表中的字符串匹配的行

来自分类Dev

如何将Pandas Dataframe中的字符串转换为列表或字符数组？

来自分类Dev

从pandas DataFrame中选择部分字符串，保存在列中

来自分类Dev

如果列包含任何指定的部分字符串，Pandas Dataframe 保持行

来自分类Dev

如何从字符串的选定部分创建一个 pandas DataFrame

来自分类Dev

从python pandas的dataframe列中搜索匹配的字符串模式

来自分类Dev

根据名称在列表中包含字符串的方式选择pandas dataframe列

来自分类Dev

Pandas DF：替换字符串的中间部分

来自分类Dev

Pandas 包含不匹配的完整字符串

来自分类Dev

python pandas根据前缀匹配字符串

来自分类Dev

如果有来自另一列的项目匹配，如何迭代 Pandas DataFrame 并替换字符串

Related 相关文章

文章