Group Information- Members: 581
- Category: Cyberculture
- Founded: Dec 1, 2002
- Language: English
Yahoo! Groups Tips
Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.
|
Stay up to speed on the latest Groups news and updates, visit the
Groups blog today!
Description
|
Discussion group for the Heritrix open-source archival web crawler project.
|
Re: poor performance
Hi we have found what was the problem with speed we had set mirrorwriter processor, which caused a lot of disk operations which slowed disk down and used a lot
Posted - Sat Jul 4, 2009 7:23 am
|
nukleonrus
Offline Send Email
|
homepage content extraction
Hi, Is there any way of extracting contents of homepage given a URI under that domain/host at runtime in heritrix? ex: URI is,
Posted - Thu Jul 2, 2009 8:35 am
|
ramab1988
Offline Send Email
|
|
Posted - Wed Jul 1, 2009 6:01 pm
|
steve@...
stearcorg
Online Now Send Email
|
|
Posted - Wed Jul 1, 2009 7:35 am
|
Enrico Detoma
enrico.detoma@...
Send Email
|
Re: Crawl stops after some time
Hi Gordon, An instance pretty much stopped ( like 0 active toe threads of 50 ). But it continued after sometime. Regards Abin Varghese ... From: Gordon Mohr
Posted - Wed Jul 1, 2009 2:07 am
|
Ebin
mail2abin
Offline Send Email
|
Add archive-crawler to your personalized My Yahoo! page What's This?
|
Message History
Group Email Addresses
| Related Link: |
http://crawler.archive.org |
| Post message: |
archive-crawler@yahoogroups.com |
| Subscribe: |
archive-crawler-subscribe@yahoogroups.com |
| Unsubscribe: |
archive-crawler-unsubscribe@yahoogroups.com |
| List owner: |
archive-crawler-owner@yahoogroups.com |
|