스크래피를 텍스트 파일로 내보내기

debugcn 에 게시 Dev

오징어

스크래피 데이터를 텍스트 파일로 내 보내서 파이썬 스크립트가 실행될 때 스크래피를 실행하기 위해 터미널을 거치지 않고 텍스트 파일을 생성하는 방법이 있습니까?

코드 예

class NameListSpider(CrawlSpider):
    name = 'namelist'
    allowed_domains = ['namelist.com']
    start_urls = ['http://www.namelist.com']

    rules = (
        Rule(LinkExtractor(restrict_xpaths='//div[@class="post-outer"]/a'), callback='parse_item', follow=True),
    )

    def parse_item(self, response):
        yield {
            'name': response.xpath('//div[@class="alt"]/span/span[2]/text()').get()
        }

# have added the below as an example
with open("file.txt", "a") as file: 
    file.write(name)

패트릭 클라인

이 결과를 얻을 수있는 방법은 여러 가지가 있습니다.
프로젝트를 실행 scrapy crawl하려면 설정에서 피드를 구성 할 수 있습니다 .
함께 실행 python your_python_script.py하려면 설정도 전달해야합니다.
다른 항목을 다른 파일로 내보낼 수도 있습니다. 이를 위해 github에서이 파이프 라인을 확인하세요.

이제 스파이더를 사용하여 실행하려면 python your_script.py다음과 같이하십시오.

# -*- coding: utf-8 -*-
from scrapy.settings import Settings
from scrapy.crawler import CrawlerRunner
from twisted.internet import reactor
from scrapy.linkextractors import LinkExtractor
from scrapy.spiders import Rule, CrawlSpider

class NameListSpider(CrawlSpider):
    name = 'namelist'
    allowed_domains = ['namelist.com']
    start_urls = ['http://www.namelist.com']
    rules = (
        Rule(LinkExtractor(restrict_xpaths='//div[@class="post-outer"]/a'), callback='parse_item', follow=True),
    )

    def parse_item(self, response):
        yield {
            'name': response.xpath('//div[@class="alt"]/span/span[2]/text()').get()
        }

def get_settings():
    settings = Settings()
    settings.set('FEED_URI', 'file.txt')
    settings.set('FEED_FORMAT', 'csv')
    return settings

if __name__ == '__main__':
    settings = get_settings()
    runner = CrawlerRunner(settings)
    d = runner.crawl(NameListSpider)
    d.addBoth(lambda _: reactor.stop())
    reactor.run()

이 기사는 인터넷에서 수집됩니다. 재 인쇄 할 때 출처를 알려주십시오.

침해가 발생한 경우 연락 주시기 바랍니다[email protected] 삭제

에서 수정2021-04-2

몇 마디 만하겠습니다

0리뷰

로그인참여 후 검토

Related 관련 기사

기사

스크래피를 텍스트 파일로 내보내기

스크래피를 텍스트 파일로 내보내기

일부 용어로 텍스트 파일 내보내기

Python Selenium 목록을 텍스트 파일로 내보내기

Outlook에서 텍스트 파일로 규칙 내보내기

사전을 텍스트 파일로 내보내기

Hadoop 출력을 텍스트 파일로 내보내기

이 형식의 텍스트 파일로 사전 내보내기

sendgrid에 html 텍스트를 첨부 파일로 보내기

여러 저장 프로 시저를 텍스트 파일로 내보내기

CSV로 가져 오기 및 텍스트 파일로 내보내기

HTML 텍스트 상자는 txt 파일로 보내기

`time` 명령을 텍스트 파일로 보내기

스크래피 : 이메일로 로그 보내기

Unity Gameobject 좌표를 추출하고 텍스트 파일로 내보내기

조직 내보내기 : 조직 테이블을 HTML 또는 텍스트 파일로 내보내기

루프의 Google 시트에서 텍스트 파일로 내보내기

해당 사이트의 값을 새 텍스트 파일로 내보내기

텍스트 파일 내보내기 엑셀

기능 프롬프트를 텍스트로 내보내기

jquery로 동적 텍스트 보내기

MySQL 내보내기 테이블을 텍스트 파일 필드 이름으로

Powershell 내보내기의 두 문자열을 텍스트 파일로 결합

Python 2 차원 목록 : 구분 된 텍스트 파일로 내보내기

각 html 클래스를 pdf 파일로 내보내기

zip 파일로드, ".docx"를 텍스트 파일로 변환, R로 다시 내보내기

스크래피 내보내기 빈 csv

스크래피 내보내기 빈 csv

CSV의 지정된 텍스트를 HTML로 내보내기

이모티콘으로 텍스트 이메일 첨부 파일 보내기

웹 사이트에서 pdf로 텍스트 내보내기