Dear subscribers,
I wanted to inform you about a new feature within the Internet Archive
(www.archive.org) that I believe you'll find interesting and useful. My
company, Advanced Software, has been working with Brewster, Brad and the
other good folks at the Archive to incorporate our DocuComp software into
the Wayback Machine. DocuComp is an advanced comparison engine that
identifies inserted, deleted, replaced and moved text (and code) in Web
pages and documents.
You can access it here--
http://web.archive.org/collections/web/advanced.html --by searching for a
specific URL, selecting 'List all pages that match search criteria' and
checking the Comparison box. A list will be produced, similar to this--
http://web.archive.org/web/20010101-20020904*dc_re_/http://www.cnet.com --wh
ere you can then select two dates to compare.
This feature is still in beta, so I would ask your help in trying out the
comparison tool, running any tests you see fit and reporting any issues
(there is an email link on the comparison's page itself), as we're always
working to fine-tune the DocuComp comparison system. There are many
malformed HTML pages on the web, and we're trying to make this comparison
tool work with as many of them as possible.
Some fascinating comparisons discovered by Brewster:
http://docucomp.archive.org/cgi-bin/dc_compare.cgi?urls=http%3A%2F%2Fweb.arc
hive.org%2Fweb%2F20010123224100%2Fhttp%3A%2F%2Fwww.whitehouse.gov%2Fprivacy.
html&urls=http%3A%2F%2Fweb.archive.org%2Fweb%2F20010331175115%2Fhttp%3A%2F%2
Fwww.whitehouse.gov%2Fprivacy.html
http://docucomp.archive.org/cgi-bin/dc_compare.cgi?urls=http://web.archive.o
rg/web/20000815053019/http://www.microsoft.com/info/privacy.htm&urls=http://
web.archive.org/web/20001018121323/http://www.microsoft.com/info/privacy.htm
Best regards,
Eric Quanstrom
Director, Business Development
Advanced Software Inc. (ASI)
eric@...
http://www.docucomp.com