使用Python进行网页抓取

Z时代
2024-01-10
分类：问答

我想从网站上获取每天的日出/日落时间。是否可以使用Python抓取网络内容？使用什么模块？有没有可用的教程？

回答：

结合使用urllib2和出色的BeautifulSoup库：

import urllib2
from BeautifulSoup import BeautifulSoup
# or if you're using BeautifulSoup4:
# from bs4 import BeautifulSoup
soup = BeautifulSoup(urllib2.urlopen('http://example.com').read())
for row in soup('table', {'class': 'spad'})[0].tbody('tr'):
    tds = row('td')
    print tds[0].string, tds[1].string
    # will print date and sunrise

以上是使用Python进行网页抓取的全部内容，来源链接： utcz.com/qa/405578.html

回到顶部