Screen scraping: getting around "HTTP Error 403: request disallowed by robots.txt"

oh you need to ignore the robots.txt

br = mechanize.Browser()
br.set_handle_robots(False)

You can try lying about your user agent (e.g., by trying to make believe you're a human being and not a robot) if you want to get in possible legal trouble with Barnes & Noble. Why not instead get in touch with their business development department and convince them to authorize you specifically? They're no doubt just trying to avoid getting their site scraped by some classes of robots such as price comparison engines, and if you can convince them that you're not one, sign a contract, etc, they may well be willing to make an exception for you.

A "technical" workaround that just breaks their policies as encoded in robots.txt is a high-legal-risk approach that I would never recommend. BTW, how does their robots.txt read?

The code to make a correct request:

br = mechanize.Browser()
br.set_handle_robots(False)
br.addheaders = [('User-agent', 'Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.0.1) Gecko/2008071615 Fedora/3.0.1-1.fc9 Firefox/3.0.1')]
resp = br.open(url)
print resp.info()  # headers
print resp.read()  # content

By Emacs, how to join two lines into one?

android.support.v4.content.FileProvider not found

How important are Design Patterns really? [closed]

Xcode 4 shortcut for swapping focus between editor and assistant editor

Weak' must not be applied to non-class-bound consider adding a protocol conformance that has a class bound [duplicate]

Removed operator!= in C++20 standard library [duplicate]

How can I parse out points and draw a route on a Google Map in Android?

How to make TRichEdit behave like WordPad on Windows 7 when changing font for certain non-text characters?

Is it possible to directly apply an affine transformation matrix to a Mayavi ImageActor object?

VectorKit crash reports with MKMapSnapshotter on iOS

NPE in ChangeCurrentByOneFromLongPressCommand (on Samsung devices w/ Android 4.3)

'Repa' performance for planetary simulation