Hi! I wonder if this is possible with pavuk. I never used it and seems the manpage is a bit overwhelming. I have a website I need to mirror for migration from...
Hello, sorry I forgot to inform this mailinglist: Pavuk 0.9.35 has been released at the 21st of february. Nothing really new included, only bug fixes. RPM...
Hello! Is it possible to retrieve only a single mime-type (application/pdf) from a whole page? When I use "-amimet", HTML-pages with the links to the PDFs are...
Stephan Windmüller
windy@...
Mar 6, 2007 12:39 pm
852
Hello, ... Actually this is what you want :-) You need to add html or you don't get any links. Maybe the -url_pattern switch is also interessting for you? ...
... Well, it works. Isn't that enough? I continously fix all the bugs I get noticed of and that's it. There was a new release short time ago. Ciao -- ...
Hello everybody. pavuk is very powerfull and i like it. but the number of options is totaly confusing me. Is there a way/mode to receive only the documents of...
Hi, Finally I got to taking up the project of porting pavuk to (native) Win32 again after a fat year of utter silence. :-) And it works! (single thread only...
Progress? yes. We've got a working EXEcutable of sorts. Results so far: win32 port now includes multithreading, using pthreads-Win32 (great stuff!), and also...
Thanks for reporting. I found a wicked bug in my new pavuk/chunky code thanks to this URL. Anyway, back to your problem. Currently pavuk does not support...
Correction to what I said just now: pavuk _does_ support 'chunked' content, but it looks like both pavuk and IIS 6.0 (the webserver version we're taking about...
... commandline argument '-nouse_http11' . Yes it work, thanks. ... Sorry, I forgot to specify. Version is 0.9.35. Compile from source on Debian 3.1. ...
Hi, i have a question. I want Pavuk to download 10 pages from one site. The URL is http://www.domain.com -lmax 1 -pattern http://www.domain.com/article* When...
You might want to try the additional -adomain commandline argument (= accept only these domains), e.g. -adomain www.domain.com in order to restrict pavuk to...
Hi, progress report: fixed a few bugs, including the one concerning RFC2616 chunked downloads (which related to this group's thread @ 2007-05-09, subject:...
Hi, thank you really much, but unfortunately that didn't help :-(, because the sites have the same domain. I can give you an example: I have the site...
Hi there, Think I found the reason, why pavuk was downloading so many site sometimes. (Pavuk is downloadung too many sites) If a site is redirected Pavuk does...
Hello, ... There are more options in pavuk to restrict download. Maybe you want to look at -url_pattern. Or any of the other options fits your needs. Ciao -- ...
Can pavuk do live two-way mirroring? I have been looking at pavuk's man page and have noticed that you can run pavuk in the background with the -bg option...
AFAIK pavuk (or wget or rsync) cannot perform bidirectional sync; of course there's always the option to have two unidirectional sync scripts run in parallel,...
Hello, I would like to download only the first level links given on a web page (for offline browsing purposes), including the styles and images on these links....
I am trying to locate information about the format of the Mozilla cache. I noticed that your project supports this format. Could you send me pointers to the...
Hi, Quick question: anyone out there using -fnrules macro %M? Why do I ask? Because I'm debugging the Win32 port, using a -fnrules "*" "%n/%b.%X" style...
Hi Steven, Extremely late answer (the usual excuses apply ;-) ), but I can only say that I have not found any documentation of this format yet. IIRC this was...
FYI, source code in CVS has been updated; now you can use commandline arguments like these: -fnrules F "*" "%d/%b%Y" -mime_types_file ../mime.types to grab...