I'm looking for a web spider for Ubuntu like this one Webripper - Calluna Software. You can download a whole site like you can with

wget -r -m example.com

but the feature I'm looking for is you can input a search term like "Linux" and it searches the Web and downloads them. Are there any programs on Ubuntu like this?


Solution 1:

Give httrack (CLI) or webhttrack (web interface) a shot, it's in the universe repo. I'm not sure about the search-term-feature you describe, but it does offer a bunch of easily configurable options.

http://packages.ubuntu.com/de/oneiric/webhttrack

HTTrack Website Copier - Free Software Offline Browser (GNU GPL)

Solution 2:

You could use Google Alerts to create a sort of search page delivered to a feed and then use an RSS reader or Thunderbird to read them.

I use Thunderbird for RSS. I don't know if there are any RSS readers who could export the feed to simple html.