python3.3 web parser－plonee的部落格

早就想寫一個更新程式

方便我抓mlb資訊(又聽說python這方面很強，就摸摸看)

先搞定一個抓取網頁，然後parser指定區域後到csv的陽春程式

紀錄一下

#coding: utf-8
import urllib.request, csv
from html.parser import HTMLParser
data = urllib.request.urlopen('http://tw.movies.yahoo.com/movieinfo_main.html/id=4569')
content = data.read().decode('utf_8')
data.close()

f = open('example.csv','wt')
writer = csv.writer(f)
class myparser(HTMLParser):
def __init__(self):
HTMLParser.__init__(self)
self.isNumber = 0
self.numbers = []

def handle_data(self, data):
if self.isNumber == 1:
writer.writerow([data])
print(data)
self.isNumber = 0
def handle_starttag(self, tag, attrs):
if tag == 'span' and attrs == [('class','dta')]:
self.isNumber = 1

Parser = myparser()
Parser.feed(content)
f.close()

python

plonee

plonee的部落格

plonee 發表在痞客邦留言(0) 人氣()

E-mail轉寄

plonee的部落格

歡迎光臨plonee在痞客邦的小天地

python3.3 web parser

留言列表

站方公告

活動快報

我的好友

熱門文章

文章分類

最新文章

最新留言

動態訂閱

文章精選

文章搜尋

新聞交換(RSS)

誰來我家

參觀人氣

QR Code

POWERED BY