[問題] 自動載入網頁且分析問題
hi 最近寫python遇到幾個問題,想跟大家請教
1.我有一個txt檔,內有多筆網址,這是我爬蟲下來的隨機網站,如下範例
http://goo.gl/hZM42U
http://goo.gl/fjJ0lG
http://goo.gl/N9HjLw
..........................等多筆資料
我希望在下面的url中,可以自動載入上述的網址,
import xml
from __future__ import division
import nltk, re, pprint
from urllib import urlopen
url = "http://goo.gl/hZM42U"
text = urlopen(url).read()
你可以觀察url = "http://goo.gl/hZM42U ",這一行就可。
我想請問要如何自動把網址加入url這一行中呢?我原本是打算一次open這個txt檔,可以是馬上就出錯了,
只好一筆筆手動貼上,還請大家幫我解答,謝謝。
--
Sent from my Windows
--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 123.110.158.25
※ 文章網址: https://www.ptt.cc/bbs/Python/M.1459524000.A.B07.html
→
04/01 23:23, , 1F
04/01 23:23, 1F
→
04/01 23:23, , 2F
04/01 23:23, 2F
※ 編輯: busystudent (42.72.191.2), 04/01/2016 23:28:06
推
04/01 23:56, , 3F
04/01 23:56, 3F
→
04/01 23:58, , 4F
04/01 23:58, 4F
→
04/02 00:00, , 5F
04/02 00:00, 5F
→
04/02 00:00, , 6F
04/02 00:00, 6F
→
04/02 00:00, , 7F
04/02 00:00, 7F
→
04/02 00:07, , 8F
04/02 00:07, 8F
→
04/02 00:07, , 9F
04/02 00:07, 9F
→
04/02 09:48, , 10F
04/02 09:48, 10F
→
04/02 12:08, , 11F
04/02 12:08, 11F
→
04/02 12:20, , 12F
04/02 12:20, 12F
→
04/02 13:38, , 13F
04/02 13:38, 13F
→
04/02 14:24, , 14F
04/02 14:24, 14F
→
04/02 14:49, , 15F
04/02 14:49, 15F
→
04/02 14:50, , 16F
04/02 14:50, 16F
推
04/02 16:11, , 17F
04/02 16:11, 17F
→
04/02 16:11, , 18F
04/02 16:11, 18F