如何在python中使用Selenium和Beautifulsoup解析网站？

Z时代
2024-01-10
分类：问答

编程新手，并弄清楚了如何使用Selenium导航到我需要去的地方。我想立即解析数据，但不确定从哪里开始。有人可以握我的手几秒钟，并朝正确的方向指点我吗？

任何帮助表示赞赏-

回答：

假设您在要解析的页面上，Selenium将源HTML存储在驱动程序的page_source属性中。这样，你会加载page_source到BeautifulSoup如下：

In [8]: from bs4 import BeautifulSoup
In [9]: from selenium import webdriver
In [10]: driver = webdriver.Firefox()
In [11]: driver.get('http://news.ycombinator.com')
In [12]: html = driver.page_source
In [13]: soup = BeautifulSoup(html)
In [14]: for tag in soup.find_all('title'):
   ....:     print tag.text
   ....:     
   ....:     
Hacker News

以上是如何在python中使用Selenium和Beautifulsoup解析网站？的全部内容，来源链接： utcz.com/qa/419066.html

回到顶部