当前位置：首页 > 资讯 > 技术文档

Python使用scrapy采集时伪装成HTTP/1.1的方法

时间：2021-11-25 17:33 编辑：来源：阅读：
扫一扫，手机访问

摘要：Python使用scrapy采集时伪装成HTTP/1.1的方法

本文实例讲述了Python使用scrapy采集时伪装成HTTP/1.1的方法。分享给大家供大家参考。具体如下：添加下面的代码到 settings.py 文件

DOWNLOADER_HTTPCLIENTFACTORY = 'myproject.downloader.HTTPClientFactory'

保存以下代码到单独的.py文件

from scrapy.core.downloader.webclient import ScrapyHTTPClientFactory, ScrapyHTTPPageGetter

class PageGetter(ScrapyHTTPPageGetter):

    def sendCommand(self, command, path):

        self.transport.write('%s %s HTTP/1.1\r\n' % (command, path))

class HTTPClientFactory(ScrapyHTTPClientFactory):

     protocol = PageGetter

希望本文所述对大家的Python程序设计有所帮助。

全部评论(0)

上一篇：python 3.5实现检测路由器流量并写入txt的方法实例
下一篇：python通过邮件服务器端口发送邮件的方法

资讯排行榜
更多>>