Selenium的使用 (2)获取页面的内容
1.启动driver:
chromedriver
成功的话看到这样的内容:
Starting ChromeDriver 108.0.5359.94 (713576b895246504ccc6b92c2fb8ce2d60194074-refs/branch-heads/5359_71@{#3}) on port 9515<br />
Only local connections are allowed.<br />
Please see https://chromedriver.chromium.org/security-considerations for suggestions on keeping ChromeDriver safe.<br />
ChromeDriver was started successfully.2.写一个脚本,类似这样:(可以获取页面的p,h3等信息,)
require 'selenium-webdriver'<br />
#driver = Selenium::WebDriver.for :firefox(如果你使用的是Firefox,就替换这一行)<br />
driver = Selenium::WebDriver.for :chrome#, service: service(如果你需要service,就在上边进行设置)<br />
driver.manage.timeouts.implicit_wait = 10
begin<br />
#driver.get 'http://somedomain/url_that_delays_loading'((官方给的页面,但是无法打开)<br />
driver.get 'https://zhuanlan.kanxue.com/article-20043.htm'<br />
wait = Selenium::WebDriver::Wait.new(:timeout => 10)<br />
search_form = driver.find_element(:id,'header')<br />
ele = wait.until { driver.find_element(css: 'p')}<br />
title = wait.until { driver.find_element(css: 'h3')}<br />
puts "title.text is: "<br />
puts title.text<br />
puts "ele.text is: "<br />
foo = ele.text<br />
puts "foo is: "<br />
puts foo<br />
puts "search_form is: "<br />
puts search_form.inspect<br />
puts "begin hihi"<br />
ensure<br />
driver.quit<br />
end3.运行脚本就可以获得需要的内容:
title.text is:<br />
2022游戏安全行业峰会直播预约通道开启!<br />
ele.text is:<br />
foo is:<br />
近年来,随着游戏用户规模不断扩大、游戏厂商全球化布局加速,游戏安全问题也变得愈加复杂。外挂、黑产、内容安全、DDoS攻击等问题层出不穷!安全问题已经成为游戏运营过程中的重中之重,除了传统的游戏大厂,越来越多的游戏厂商开始着手布局游戏安全的业务模块!<br />
search_form is:<br />
#<Selenium::WebDriver::Element:0x45bbdf7170916ac id="97b5e53e-6294-45b2-a547-d4c2d5df9d40"><br />
begin hihi