Screen Scraping a Javascript based webpage in Python

I am working on a screen scraping tool in Python. But, as I look through the source of the webpage, I noticed that most of the data is coming through Javascript.

Any idea, how to scrape javascript based webpage ? Any tool in Python ?

Thanks


Scraping javascript-based webpages is possible with selenium. In particular, try the Selenium WebDriver.


I use webkit, which is the browser renderer behind Chrome and Safari. There are Python bindings to webkit through Qt.

And here is a full Python example to execute JavaScript and extract the final HTML.


You can use the QtWebKit module of the PyQt4 library