无法从BeautifulSoup获取表

from BeautifulSoup import BeautifulSoup
import urllib2

url = 'http://www.data.jma.go.jp/obd/stats/etrn/view/monthly_s3_en.php?block_no=47401&view=1'
html = urllib2.urlopen(url).read()        
soup = BeautifulSoup(html)
table = soup.find('table')
print table

预期表未生成。

我想抓住下表:

在此处输入图片说明

帕德拉克·坎宁安(Padraic Cunningham)

首先,使用BS4 beaufifulsoup3不再维持,也希望表有类*data2_s*,称find("table")只是得到了第一台这是不是你想要的网页上:

from bs4 import BeautifulSoup
import urllib2

url = 'http://www.data.jma.go.jp/obd/stats/etrn/view/monthly_s3_en.php?block_no=47401&view=1'
html = urllib2.urlopen(url).read()
soup = BeautifulSoup(html)
table = soup.select_one("table.data2_s") # or table = soup.find("table", class_="data2_s")
print table

这给你:

<table class="data2_s"><caption class="m">WAKKANAI   WMO Station ID:47401 Lat 45<sup>o</sup>24.9'N  Lon 141<sup>o</sup>40.7'E</caption><tr><th scope="col">Year</th><th scope="col">Jan</th><th scope="col">Feb</th><th scope="col">Mar</th><th scope="col">Apr</th><th scope="col">May</th><th scope="col">Jun</th><th scope="col">Jul</th><th scope="col">Aug</th><th scope="col">Sep</th><th scope="col">Oct</th><th scope="col">Nov</th><th scope="col">Dec</th><th scope="col">Annual</th></tr><tr class="mtx" style="text-align:right;"><td style="text-align:center">1938</td><td class="data_0_0_0_0">-5.2</td><td class="data_0_0_0_0">-4.9</td><td class="data_0_0_0_0">-0.6</td><td class="data_0_0_0_0">4.7</td><td class="data_0_0_0_0">9.5</td><td class="data_0_0_0_0">11.6</td><td class="data_0_0_0_0">17.9</td><td class="data_0_0_0_0">22.2</td><td class="data_0_0_0_0">16.5</td><td class="data_0_0_0_0">10.7</td><td class="data_0_0_0_0">3.3</td><td class="data_0_0_0_0">-4.7</td><td class="data_0_0_0_0">6.8</td></tr>
<tr class="mtx" style="text-align:right;"><td style="text-align:center">1939</td><td class="data_0_0_0_0">-7.5</td><td class="data_0_0_0_0">-6.6</td><td class="data_0_0_0_0">-1.4</td><td class="data_0_0_0_0">4.0</td><td class="data_0_0_0_0">7.5</td><td class="data_0_0_0_0">13.0</td><td class="data_0_0_0_0">17.4</td><td class="data_0_0_0_0">20.0</td><td class="data_0_0_0_0">17.4</td><td class="data_0_0_0_0">9.7</td><td class="data_0_0_0_0">3.0</td><td class="data_0_0_0_0">-2.5</td><td class="data_0_0_0_0">6.2</td></tr>
<tr class="mtx" style="text-align:right;"><td style="text-align:center">1940</td><td class="data_0_0_0_0">-6.0</td><td class="data_0_0_0_0">-5.7</td><td class="data_0_0_0_0">-0.5</td><td class="data_0_0_0_0">3.5</td><td class="data_0_0_0_0">8.5</td><td class="data_0_0_0_0">11.0</td><td class="data_0_0_0_0">16.6</td><td class="data_0_0_0_0">19.7</td><td class="data_0_0_0_0">15.6</td><td class="data_0_0_0_0">10.4</td><td class="data_0_0_0_0">3.7</td><td class="data_0_0_0_0">-1.0</td><td class="data_0_0_0_0">6.3</td></tr>
<tr class="mtx" style="text-align:right;"><td style="text-align:center">1941</td><td class="data_0_0_0_0">-6.5</td><td class="data_0_0_0_0">-5.8</td><td class="data_0_0_0_0">-2.6</td><td class="data_0_0_0_0">3.6</td><td class="data_0_0_0_0">8.1</td><td class="data_0_0_0_0">11.4</td><td class="data_0_0_0_0">12.7</td><td class="data_0_0_0_0">16.5</td><td class="data_0_0_0_0">16.0</td><td class="data_0_0_0_0">10.0</td><td class="data_0_0_0_0">4.0</td><td class="data_0_0_0_0">-2.9</td><td class="data_0_0_0_0">5.4</td></tr>
<tr class="mtx" style="text-align:right;"><td style="text-align:center">1942</td><td class="data_0_0_0_0">-7.8</td><td class="data_0_0_0_0">-8.2</td><td class="data_0_0_0_0">-0.8</td><td class="data_0_0_0_0">3.5</td><td class="data_0_0_0_0">7.1</td><td class="data_0_0_0_0">12.0</td><td class="data_0_0_0_0">17.4</td><td class="data_0_0_0_0">18.4</td><td class="data_0_0_0_0">15.7</td><td class="data_0_0_0_0">10.5</td><td class="data_0_0_0_0">2.5</td><td class="data_0_0_0_0">-2.9</td><td class="data_0_0_0_0">5.6</td></tr>
etc...................................

本文收集自互联网,转载请注明来源。

如有侵权,请联系[email protected] 删除。

编辑于
0

我来说两句

0条评论
登录后参与评论

相关文章

来自分类Dev

无法获取相关表的属性

来自分类Dev

使用beautifulsoup从Wikipedia表中获取列

来自分类Dev

无法从BeautifulSoup的页面获取实际的标记

来自分类Dev

在BeautifulSoup中获取表的内容

来自分类Dev

如何在BeautifulSoup中从表的td中获取价值?

来自分类Dev

无法通过beautifulsoup webscraping python获取标签“ rel”

来自分类Dev

使用BeautifulSoup进行Web抓取-无法提取表行

来自分类Dev

python BeautifulSoup无法从网页获取文本

来自分类Dev

无法使用BeautifulSoup获取span属性的文本

来自分类Dev

BeautifulSoup无法解析表中的内容

来自分类Dev

Selenium和BeautifulSoup无法获取所有HTML内容

来自分类Dev

使用BeautifulSoup无法在</ span>标记后立即获取文本

来自分类Dev

beautifulsoup无法获取所有数据

来自分类Dev

无法使用BeautifulSoup获取锚标签

来自分类Dev

使用BeautifulSoup获取表内容

来自分类Dev

我的for循环无法在BeautifulSoup中获取imgae src

来自分类Dev

BeautifulSoup:如何获取此表的特定文本?

来自分类Dev

无法识别BeautifulSoup评论表中的差异

来自分类Dev

python:无法使用BeautifulSoup从html获取特定数据

来自分类Dev

使用BeautifulSoup从html表中获取数据

来自分类Dev

Python BeautifulSoup无法获取完整表

来自分类Dev

无法使用BeautifulSoup获取某些数据

来自分类Dev

无法从HTML获取表值

来自分类Dev

无法使用beautifulsoup提取表数据

来自分类Dev

无法从表中获取数据

来自分类Dev

使用 BeautifulSoup find 方法从表行中获取数据

来自分类Dev

无法使用 BeautifulSoup 从 html 获取日期

来自分类Dev

无法解析来自网站 bseindia.com 的表数据 beautifulsoup

来自分类Dev

BeautifulSoup:无法获取文本

Related 相关文章

热门标签

归档