I have a fairly simple task to do, of crawling a website for certain types of links. However, wget seems unable to do this efficiently (and honestly segfaults...
Greg Hazel
gah@...
Oct 20, 2003 8:07 am
731
Hi, I'm trying to use pavuk to mirror the cygwin distribution, but it fails to mirror the sub-directories, i.e.: pavuk -mode sync \ -ftp_proxy...
I believe that at least the first 4 or 5 items are easy to do. I'll have to experiment a bit on the 6th. I'm out of town on business just now, but as soon as I...
Hello! I'm just looking for help with some problems concearning PAVUK. So I found this place, and I wan't to test wether Yahoo accepts my mails. Intelligent...
Is there a way to add multiple "TrStrToStr:" conditions in the configuration file. I need to replace more than one string pattern in the local name of the...
"ftpdir" mode seems not going recursively through a ftp server with pavuk. is it correct? is it possible to recursively list the contents of a ftp server with...
Hello! I want to thank the pavuk community[1] for supporting this program[2], but I also want to protest[3] against the bad ... attention which seems to be ...
Hi, pavuk -store_name foo.html -singlepage http://www.theherald.co.uk/6123.shtml From what I understand from the pavuk documentation, this should download the ...
Jim Farrand
jim@...
Dec 15, 2003 11:40 am
740
Hi, ... What against sourceforge? http://pavuk.sourceforge.net/ ... Well, you always get the latest version when using the Sourceforge-CVS. ... According to...
it's a bug. I fixed it at Netli before they fired me. I've got two sets of updates to merge with the mainline tree. I'm sorry I haven't been very responsive...
Martin Fouts
lists@...
Jan 4, 2004 3:13 pm
743
Could someone please show me what type of regexps the Windows command line version of Pavuk uses to exclude binary files? This is my attempt: ./pavuk.exe...
Terrence Brannon
metaperl@...
Jan 4, 2004 3:13 pm
744
Hi, ... Under Unix I usually join multiple patterns, as I never got the (aaa|bbb|ccc)method to work (probably it is not supported - check source?) Use...
Hi,
I have a problem getting the -js_transform2 option to work properly. I
have
javascripts which have lines like this:
SHFTHZFIG = JSFIGDIR + "shh.gif";...
... Hash: SHA1 Hi, I have a problem getting the -js_transform2 option to work properly. I have javascripts which have lines like this: SHFTHZFIG = JSFIGDIR +...
If you set MaxDocs: 300 (for example) in the scenario file, and then go on and spider a site, pavuk correctly stops downloading documents once 300 are found. ...
[for some reason this did not get posted in 24 hours, so I'm trying again] The manpage says it works, but I can't get the following simple pattern: ...
Hi, Pavuk doesn't want to make. I can't figure out why. I am running Slackware with all the standard libraries and tools... I get the following.. make...
I have a question for the group (or the Pavuk authors.) I am considering using Pavuk for a project that would require a bit of modification on how it runs....
Hi New to this so forgive the boring question. Trying to replicate http://www.csszengarden.com/ website to my local machine Here's me issuing the command and...
... my local machine Hi Jason, It is only downloading one file, because you told it to (mostl ikely it's the mode parameter in your rc file). File redirect ...
Hello, I am wondering if the GUI of PAVUK is currently "compilable" and "runable". When compiling the source from pavuk-0.9pl30b.tar.bz2 I get some errors...
Hi, ... I have no errors at the moment. Please tell me, what problems you have. ... This has been fixed already. P.S. It will probably not build with some GTK2...
On Tue, 28 Sep 2004 11:54:25 +0200 (CEST), Dirk Stoecker ... OK, thanks to You, with the --disable-gtk2 option it compiled. It is much nicer to configure with...
Hi, I am trying to do something like filtering a certain string out of the document to stop Pavuk from loading a dynamically generated pseudo image. - - -...