selenium爬虫被检测到 该如何破?
你好, 我现在用selenium抓取一个网站的时候,被识别为爬虫,请问有什么破解的方法么? 代码如下import time
from selenium import webdriver
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.common.exceptions import TimeoutException
browser = webdriver.Chrome()
browser.implicitly_wait(40)
browser.get("https://www.crunchbase.com/app/search/companies/")
time.sleep(60)页面返回:
Pardon Our Interruption...
As you were browsing crunchbase accelerates innovation by bringing together data on companies and the people behind them. something about your browser made us think you were a bot. There are a few reasons this might happen:
You're a power user moving through this website with super-human speed.
You've disabled JavaScript in your web browser.
A third-party browser plugin, such as Ghostery or NoScript, is preventing JavaScript from running. Additional information is available in this support article.
To request an unblock, please fill out the form below and we will review it as soon as possible.该网站使用了http://distilnetworks.com的反爬服务.
参考下这个链接https://www.zhihu.com/question/50738719 自己搭建一个网站 https://blog.csdn.net/Python1996/article/details/99709167看下这个 参考这篇文章看下呢:https://blog.csdn.net/Python1996/article/details/99709167 参考楼上的
页:
[1]