测试积点老人 发表于 2020-7-13 11:32:41

selenium爬虫被检测到 该如何破?

你好, 我现在用selenium抓取一个网站的时候,被识别为爬虫,请问有什么破解的方法么? 代码如下
import time
from selenium import webdriver
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.common.exceptions import TimeoutException

browser = webdriver.Chrome()
browser.implicitly_wait(40)
browser.get("https://www.crunchbase.com/app/search/companies/")

time.sleep(60)页面返回:
Pardon Our Interruption...
As you were browsing crunchbase accelerates innovation by bringing together data on companies and the people behind them. something about your browser made us think you were a bot. There are a few reasons this might happen:

You're a power user moving through this website with super-human speed.
You've disabled JavaScript in your web browser.
A third-party browser plugin, such as Ghostery or NoScript, is preventing JavaScript from running. Additional information is available in this support article.
To request an unblock, please fill out the form below and we will review it as soon as possible.

该网站使用了http://distilnetworks.com的反爬服务.

海海豚 发表于 2020-7-14 09:38:55

https://blog.csdn.net/Python1996/article/details/99709167 看下这个

郭小贱 发表于 2020-7-14 09:49:01

参考看看呢 https://www.zhihu.com/question/50738719

bellas 发表于 2020-7-14 09:54:26

https://www.jianshu.com/p/5e34a8f95512参考下这个链接

qqq911 发表于 2020-7-14 10:19:25

加上头信息
页: [1]
查看完整版本: selenium爬虫被检测到 该如何破?