Screenscraping the Senate (9 tags)
In Paul Ford's first Hacking Congress column, he shows us how to turn information on the U.S. Senate site into RDF.
Spidering Hacks (6 tags)
This week we offer two hacks from Spidering Hacks that save you time as well as extra trips to your favorite web sites. The first is on using Template::Extract, a Perl module that allows you to scrape a web page to generate RSS from its data structure. And the second is on using a program called dailystrips to grab all your favorite online comic strips and have them presented in one HTML file.