Re: [請益] parser 文字

看板PHP作者woominin (沒事就好)時間9年前 (2014/09/28 13:46)推噓2(2推 0噓 9→)

留言11則, 5人參與討論串4/4 (看更多)

原文恕刪想請問前輩們小弟在parser網頁遇到一個新的問題就是用原本的 simple_parser_dom的工具來parser http://tour.taitung.gov.tw/zh-tw/Home/Index 會出錯問題1 : 如何解再來小弟到處研究了一下用了另一個 curl <?php # Use the Curl extension to query Google and get back a page of results $url = "http://tour.taitung.gov.tw/zh-tw/Home/Index"; $ch = curl_init(); $timeout = 5; curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, $timeout); $html = curl_exec($ch); curl_close($ch); # Create a DOM parser object $dom = new DOMDocument(); # Parse the HTML from Google. # The @ before the method call suppresses any warnings that # loadHTML might throw because of invalid HTML in the page. @$dom->loadHTML($html); # Iterate over all the <a> tags foreach($dom->getElementsByTagName('a') as $link) { # Show the <a href> echo $link->getAttribute('href'); echo "<br />"; } foreach($dom->getElementsByTagName('a') as $v) { echo $v->getAttribute('title'); echo "<br />"; } ?> 用上面的語法是parser出來了，不過parser回來的字是亂碼試著加入 $v = mb_convert_encoding($v,"BIG5","UTF-8"); 結果會出錯請教這如何解呢 ? -- ※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 49.158.112.110 ※ 文章網址: http://www.ptt.cc/bbs/PHP/M.1411883164.A.113.html

→

bibo9901

09/28 14:02, , 1^F

09/28 14:02, 1^F

→

woominin

09/28 14:14, , 2^F

09/28 14:14, 2^F

推

bency

09/28 15:25, , 3^F

09/28 15:25, 3^F