Search the web
Sign In
New User? Sign Up
pavuk · Pavuk Webgrabber Mailing List
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
pavuk-0.9.31 odd behaviour   Message List  
Reply | Forward Message #766 of 988 |
Greetings,
I need to download the content of this website except certain directory
http://www.microwerks.net/~hugo/download/MondoCD
using pavuk-0.9.31 (on Gentoo)

pavuk -skip_url_pattern '*/RPMS/*,*/SRPMS/*,*.rpm, /old/*' -logfile
pavuk.log -dont_leave_site -dont_leave_dir
http://www.microwerks.net/~hugo/download/MondoCD

Even if I used the -dont_leave_site -dont_leave_dir it tries to
dowload wrong urls like it was in / and at the end it quit

URL[ 2]: 261(60) of 277
http://www.microwerks.net/~hugo/download/about/about.html
download: ERROR: HTTP document not found

What is the correct syntax? If it finds already downloaded files (i have
already downloaded 100 mb) does it compare the date the size and decided
to redownload the file?
As I understand the default is the analog of -no-clobing in wget but it
seems it skip saying "File redirect"


TIA for any advice :)
Eli


Sun Dec 5, 2004 12:11 am

ml@...
Send Email Send Email

Attachment:
pavuk.log
Type:
text/x-log
(Message over 64k, truncated.)
Forward
Message #766 of 988 |
Expand Messages Author Sort by Date

Greetings, I need to download the content of this website except certain directory http://www.microwerks.net/~hugo/download/MondoCD using pavuk-0.9.31 (on...
Eli Spizzichino
ml@...
Send Email
Dec 6, 2004
9:43 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help