熊猫to_csv（）：TypeError：强制转换为Unicode：需要字符串或缓冲区，找到列表

debugcn 发表于 Dev

回旋加速器3x3

我正在尝试将多个CSV文件合并为我的数据集的单个大CSV。我正在寻找的是从多个CVS文件中获取一些列数据，并从中获取数据集。我不希望最终数据集中的所有列都只有少数选定的列。我names在读取CSV时已在panda中使用过属性，但返回的效果很好，但是我无法从读取的CSV中创建新的CSV。我在这里做错了什么？我在底部添加了堆栈跟踪。

import glob
import pandas as pd
import os
import time
from datetime import datetime
import numpy as np

path = "C:\Users\lenovo\Downloads\Compressed\LoanStats3a.csv_2\csv"
class MergeCsvFiles:
def MergeCsv(self):
    allFiles = glob.glob(os.path.join(path, "LoanStats3a.csv"))
    print 'allFiles',allFiles

    for file_ in allFiles:
        print 'file_ ######### ',file_

        # merge_df = pd.DataFrame.from_csv(file_)
        # print merge_df
        fileToSave = glob.glob(os.path.join(path, "merge.csv"))
        print 'filrToSave #### ', fileToSave
        np_array_list = []

        df = pd.read_csv(file_, skipinitialspace=True,low_memory=False,header=0,index_col=None)
        np_array_list.append(df.as_matrix())
        comb_np_array = np.vstack(np_array_list)
        big_frame = pd.DataFrame(comb_np_array)
        # big_frame.columns = fields
        print 'big_frame#### ', big_frame
        big_frame.to_csv(fileToSave)

        # See the keys
        print 'df.keys########',df.keys()
        print 'df @@@@@', df
        frame = pd.DataFrame()
        list_ = []

        list_.append(df)
        frame = pd.concat(list_)
        # print 'frame#### ',frame

        frame.to_csv(fileToSave)

if __name__ == "__main__":
    s =  MergeCsvFiles()
    s.MergeCsv()

堆栈跟踪：

Traceback (most recent call last):
  File "C:/Users/lenovo/Downloads/Video/Machine Learning/MLPredictiveAnalysis/MergeCsv.py", line 59, in <module>
    s.MergeCsv()
  File "C:/Users/lenovo/Downloads/Video/Machine Learning/MLPredictiveAnalysis/MergeCsv.py", line 39, in MergeCsv
    big_frame.to_csv(fileToSave)
  File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 1344, in to_csv
    formatter.save()
  File "C:\Python27\lib\site-packages\pandas\formats\format.py", line 1526, in save
    compression=self.compression)
  File "C:\Python27\lib\site-packages\pandas\io\common.py", line 426, in _get_handle
    f = open(path, mode)
TypeError: coercing to Unicode: need string or buffer, list found

特德·彼得鲁（Ted Petrou）

glob.glob返回列表。您需要将路径名称的字符串传递给big_frame.csv。为什么你甚至需要水珠？big_frame.csv(os.path.join(path, "merge.csv"))应该管用。

您还将frame.to_csv(fileToSave)在循环的底部将这个文件写满。而且每次迭代都将覆盖文件，因此只有最后一次迭代才能保存任何文件。

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。

编辑于2021-07-5

我来说两句

0条评论

登录后参与评论

来自分类Dev

Related 相关文章

文章

熊猫to_csv（）：TypeError：强制转换为Unicode：需要字符串或缓冲区，找到列表

熊猫to_csv（）：TypeError：强制转换为Unicode：需要字符串或缓冲区，找到列表

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到列表

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到文件

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到int

Python TypeError：强制转换为Unicode：需要字符串或缓冲区，找到元组

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到ResultSet

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到文件（open（）函数）

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到了PosixPath

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到文件（open（）函数）

TypeError：强制转换为Unicode：需要字符串或缓冲区，找到WSGIRequest

Python:TypeError: 强制转换为 Unicode: 需要字符串或缓冲区，找到文件

强制转换为Unicode：需要字符串或缓冲区，找到列表

Django模型：TypeError：强制转换为Unicode：需要字符串或缓冲区，已找到用户

Boto3 S3：TypeError：强制转换为Unicode：需要字符串或缓冲区，找到文件

强制转换为Unicode：需要字符串或缓冲区，找到标记

TypeError：强制转换为Unicode，需要字符串或缓冲区，找不到NoneType

Python MQTT：TypeError：强制转换为Unicode：需要字符串或缓冲区，发现布尔

引发异常时出现“ TypeError：强制转换为Unicode：需要字符串或缓冲区”

强制转换为Unicode：需要字符串或缓冲区，找到int-Django Rest Framework

类型错误：强制转换为 Unicode：需要字符串或缓冲区，找到 _sre.SRE_Pattern

Django 错误强制转换为 Unicode：需要字符串或缓冲区

错误：强制转换为Unicode：需要字符串或缓冲区，很长一段时间

强制转换为 Unicode：需要字符串或缓冲区，NoneType 发现一切似乎都很好，但仍然出现此错误

而将字符缓冲区转换为字符串是需要 strdup

TypeError：必须是字符串或只读字符缓冲区，而不是列表

TypeError：必须是字符串或只读字符缓冲区，而不是列表

如何将字符串转换为二进制缓冲区？

将字节缓冲区转换为UTF8字符串

将固定大小的缓冲区（字节数组）转换为字符串

如何将tkFileDialog.asksaveasfilename（）转换为缓冲区字符串？