[問題] BeautifulSoup反選擇
不知BeautifulSoup可否反選擇呢?
以下是問題範例...
##################################
HTML2 = """
<table>
<tr>
<td class>a</td>
<td class>b</td>
<td class>c</td>
<td class>d</td>
</tr>
<tr>
<td class>e</td>
<td class>f</td>
<td class>g</td>
<td class>h</td>
</tr>
</table>
<table cellpadding="0">
<tr>
<td class>111</td>
<td class>222</td>
<td class>333</td>
<td class>444</td>
</tr>
<tr>
<td class>555</td>
<td class>666</td>
<td class>777</td>
<td class>888</td>
</tr>
"""
soup2 = BeautifulSoup(HTML2, 'html.parser')
f2 = soup2.select('table[cellpadding!="0"]') #<---關鍵在此
for div in f2:
row = ''
rows = div.findAll('tr')
for row in rows:
if(row.text.find('td') != False):
print(row.text)
##################################
我想把英文內容個table的td全都取出來,
按如下的形式存到excel當中.
a b c d
e f g h
但怎麼都是取到數字內容的表格.
Is there a hint?
Thanks!
--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 118.160.98.32
※ 文章網址: https://www.ptt.cc/bbs/Python/M.1500774608.A.113.html
推
07/23 10:20, , 1F
07/23 10:20, 1F
→
07/23 12:15, , 2F
07/23 12:15, 2F
→
07/23 12:15, , 3F
07/23 12:15, 3F
→
07/23 12:16, , 4F
07/23 12:16, 4F
→
07/23 15:13, , 5F
07/23 15:13, 5F
推
07/23 15:48, , 6F
07/23 15:48, 6F
→
07/23 15:48, , 7F
07/23 15:48, 7F
→
07/23 15:48, , 8F
07/23 15:48, 8F
→
07/23 15:48, , 9F
07/23 15:48, 9F
→
07/23 15:48, , 10F
07/23 15:48, 10F

→
07/23 19:00, , 11F
07/23 19:00, 11F
→
07/23 19:09, , 12F
07/23 19:09, 12F
→
07/23 19:11, , 13F
07/23 19:11, 13F