Python Urllib2 Parse Html Problem

October 27, 2023 Post a Comment

I am using mechanize to parse html of website, but with this website i got strange result. from mechanize import Browser br = Browser() r = br.open('http://www.heavenplaza.com') re

Solution 1:

import urllib2, StringIO, gzip
f = urllib2.urlopen("http://www.heavenplaza.com")
data = StringIO.StringIO(f.read())
gzipper = gzip.GzipFile(fileobj=data)
print gzipper.read()

Solution 2:

I quickly checked the script in the console and the site was returning crap. You probably need to spoof your HTTP user agent to be something else that the site doesn't think you are using a robot.

http://www.google.com works

Python stackoverflow Examples

Python Urllib2 Parse Html Problem

Solution 1:

Solution 2:

Post a Comment for "Python Urllib2 Parse Html Problem"