熊猫数据框将N个小数后的数字替换为空字符串

debugcn 发表于 Dev

astro123

我正在学习如何在熊猫数据框替换中使用正则表达式。我遇到以下问题：

我正在尝试将N个小数点后的字符串替换为空。例如12.349 ==> 12.35。

MWE

import numpy as np
import pandas as pd
import seaborn as sns

df1 = pd.DataFrame({'A': ['hello','wold'],
                   'B': [12.346789, 12.223344]})
df1 = df1.astype(str)
round_ = 2
to_replace = r"(^\d+\." + r"\d" * round_ + r")(.*)"
repl = lambda m: m.group(0)

df1 = df1.replace(to_replace,repl,regex=True)
df1

Pandas文档说我可以使用正则表达式替换字符串，但是当我使用它时，得到的是函数repr而不是值。该如何解决？

更新资料

我试图将格式应用于数据帧的转置。（当然，我可以在转换之前设置样式，但是出于某些原因，我需要应用格式进行转置）。

df1 = pd.DataFrame({'A': ['hello','wold'],
                   'B': [12.349, 12.22]})
df1 = df1.T
df1.style.format({'B': "{:.2f}"}, axis=0)

参考文献

广晃

您可以尝试以下模式：

df1 = df1.replace(r"^(\d+\.\d{," + rf"{round_}" + r'})(\d*)',r'\1',regex=True)

输出：

      survived pclass    age  sibsp  parch    fare
count    891.0  891.0  714.0  891.0  891.0   891.0
mean      0.38   2.30  29.69   0.52   0.38   32.20
std       0.48   0.83  14.52   1.10   0.80   49.69
min        0.0    1.0   0.42    0.0    0.0     0.0
25%        0.0    2.0  20.12    0.0    0.0    7.91
50%        0.0    3.0   28.0    0.0    0.0   14.45
75%        1.0    3.0   38.0    1.0    0.0    31.0
max        1.0    3.0   80.0    8.0    6.0  512.32

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。