Website downloader (cache?) to view sites offline

Solution 1:

I use HTTrack.

It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer

Solution 2:

I use wget with these options to mirror a site for offline use

wget -cmkE -np http://example.com/a/section/i/like

where

-m turns on mirroring options for mirroring a site locally

-c continues a previous download in case I have already downloaded some pages

-k converts absolute href to point to local ones for offline viewing

-E ensures files have .html extension after download.

-np only downloads objects under /a/section/i/ and does not cache the whole site.

For example I wanted to download south documentation but not south tickets, etc...

wget -cmkE -np http://south.aeracode.org/docs/index.html

I use Windows and run wget on cygwin but there is also a native windows wget port.

Although, in your case, you can download python offline docs from python docs section

Solution 3:

Try http://www.downthemall.net/ a Firefox plugin. I've used it to download 250 pages of PDFs in 20+ separate files. It is extremely powerful. It has a wildcard/query syntax that lets you surgically get only the files that you want and none of the irrelevant ones you don'.

Solution 4:

Some Firefox extensions that I know of:

  • ScrapBook

    helps you to save Web pages and easily manage collections. Key features are lightness, speed, accuracy and multi-language support. Major features are:

    • Save Web page
    • Save snippet of Web page
    • Save Web site
    • Organize the collection in the same way as Bookmarks
    • Full text search and quick filtering search of the collection
    • Editing of the collected Web page
    • Text/HTML edit feature resembling Opera's Notes
  • ScrapBook Plus

    Difference between ScrapBook Plus and ScrapBook:

    • faster sorting
    • faster import and export
    • open the window to manage your collection directly from the sidebar
    • simplified the handling of the "Combine Wizard"
    • new features for "Capture Multiple URLs" (filter to exclude links, use title of the web site or title of link as new title for the entry in the sidebar, specify waiting time between to download from 0 to 3 seconds, use UTF-8 or ISO-8859-1 as character set)
    • new "Capture" window (download needs to be started manually, automated scrolling turned off)
    • 6 highlighters in the editor
  • UnMHT

    allows you to view MHT (MHTML) web archive format files, and save complete web pages, including text and graphics, into a single MHT file

  • Pocket (not an extension; a built-in Firefox feature)

    lets you save web pages and videos to Pocket in just one click. Pocket strips away clutter and saves the page in a clean, distraction-free view and lets you access them on the go through the Pocket app.

    Note that:

    Saving to and accessing your Pocket list on Firefox requires an Internet connection. Offline viewing is possible on other devices with the Pocket app.