Ad

Data Scraping With Python And Beautiful Soup

I am currently making my first steps with Python & Beautiful Soup in order to scrape data from the Russian statistics website.

Looking at different examples here on Stack Overflow, I think the code is correct, and yet my simple query does not return anything from this site. When executing the code, my Python command line remains blank, but also does not return an error.

What's wrong here?

My (very simple) code:

from bs4 import BeautifulSoup
import urllib2
url = "http://www.gks.ru/bgd/free/B00_25/IssWWW.exe/Stg/d000/000715.HTM"
page = urllib2.urlopen(url)
soup = BeautifulSoup(page.read())
print(soup)
Ad

Answer

you need to specify a parser:

soup = BeautifulSoup(page.read(), 'html.parser')
Ad
source: stackoverflow.com
Ad