Aaron - In response to your query regarding the Internet Archive collection,
the website is still accurate. We primarily collect ASCII.txt. In the past
four months we have begun images crawls and have about 1 Terabyte. We
expected a larger collection, but a large number of URLs have robot
exclusions and cannot be crawled. As for accepting donations of other
Internet collections, we are always interested in considering proposals.
Thanks for your interest.
Gail Feldman
Policy and Communications Manager
-----Original Message-----
From: archivists-admin@...
[mailto:archivists-admin@...]On Behalf Of Aaron Swartz
Sent: Monday, July 24, 2000 8:39 AM
To: archivists@...
Subject: Re: [Archivists] A variety of fish in my net!
Electronic Information Systems Librarian <xlib@...> wrote:
> 1. Aaron Swartz wrote of "the work of the Internet Archive" Is that what
this
> list is meant to be about? I had completely forgotten about it, but by
> plugging in www.archive.org into my web browser, was reminded that it was
here
> that I joined this list! I notice also that the site still says that
since
> 1998 they have only been collecting ASCII text. Is that really still the
case?
>
> Aaron asked if we should "focus on more specialized archives rather than
> trying to archive the entire Web". Indeed my hope was to elicit help from
> other people in how to archive an extremely specialised subset of
electronic
> documents (about or mentioning the Baha'i Faith) with extremely limited
> resources - only a part of my job, and just me with one lowly PC attached
to a
> network, as part of a total library staff of 15 people.
Well, the website says the list is for "discussion on Internet libraries"
which is rather broad. Perhaps the Internet Archive could work out a
distributed system allowing people like you to work on smaller subsets
(Baha'i) of the Web and contribute your work to the archive. The archive
could provide you with the tools and technologies to spider and store the
information you'd like, and in return you could provide them with the data.
However, I haven't yet heard from anyone at the archive on this list, so I
don't know how feasible this is.
--
Aaron Swartz |"This information is top security.
<http://swartzfam.com/aaron/>| When you have read it, destroy yourself."
<http://www.theinfo.org/> | - Marshall McLuhan
_______________________________________________
Archivists mailing list
Archivists@...
http://www.archive.org/mailman/listinfo/archivists
_______________________________________________
Archivists mailing list
Archivists@...
http://www.archive.org/mailman/listinfo/archivists