Re: [問題] 網路爬蟲抓不到標籤<img>的src屬性

看板Python作者rexyeah (ccccccc)時間7年前 (2018/11/11 21:30)推噓1(1推 0噓 3→)

留言4則, 3人參與討論串1/1

不在意速度的話... from selenium import webdriver from bs4 import BeautifulSoup url = 'https://v.comicbus.com/online/comic-103.html?ch=924' browser = webdriver.PhantomJS() browser.get(url) html = browser.page_source soup = BeautifulSoup(html, 'html.parser') img_url = 'https:%s' % soup.find('img', {'id': 'TheImg'})['src'] print img_url ==== 不過其實phantomjs已經deprecated了，但還是可以用。上面那段我自己跑過，可以抓到，只是真的很慢 ※ 引述《bugbug777 (sil)》之銘言： : 大家好，小魯是個網路爬蟲新手 : 最近想來寫一個下載圖片的網路爬蟲 : 這裡附上簡短的程式碼 : <img border="0" id="TheImg" name="TheImg"/> : 似乎抓不到src的這個屬性，請問這是為什麼？ : 圖示8comic的海賊王924話圖片 : https://imgur.com/ccnRjKr